RESUMO
BACKGROUND: Chaetognaths, or arrow worms, are small marine, bilaterally symmetrical metazoans. The objective of this study was to analyse ribosomal protein (RP) coding sequences from a published collection of expressed sequence tags (ESTs) from a chaetognath (Spadella cephaloptera) and to use them in phylogenetic studies. RESULTS: This analysis has allowed us to determine the complete primary structures of 23 out of 32 RPs from the small ribosomal subunit (SSU) and 32 out of 47 RPs from the large ribosomal subunit (LSU). Ten proteins are partially determined and 14 proteins are missing. Phylogenetic analyses of concatenated RPs from six animals (chaetognath, echinoderm, mammalian, insect, mollusc and sponge) and one fungal taxa do not resolve the chaetognath phylogenetic position, although each mega-sequence comprises approximately 5,000 amino acid residues. This is probably due to the extremely biased base composition and to the high evolutionary rates in chaetognaths. However, the analysis of chaetognath RP genes revealed three unique features in the animal Kingdom. First, whereas generally in animals one RP appeared to have a single type of mRNA, two or more genes are generally transcribed for one RP type in chaetognath. Second, cDNAs with complete 5'-ends encoding a given protein sequence can be divided in two sub-groups according to a short region in their 5'-ends: two novel and highly conserved elements have been identified (5'-TAATTGAGTAGTTT-3' and 5'-TATTAAGTACTAC-3') which could correspond to different transcription factor binding sites on paralog RP genes. And, third, the overall number of deduced paralogous RPs is very high compared to those published for other animals. CONCLUSION: These results suggest that in chaetognaths the deleterious effects of the presence of paralogous RPs, such as apoptosis or cancer are avoided, and also that in each protein family, some of the members could have tissue-specific and extra-ribosomal functions. These results are congruent with the hypotheses of an allopolyploid origin of this phylum and of a ribosome heterogeneity.
Assuntos
Invertebrados/genética , Biossíntese de Proteínas , Proteínas Ribossômicas/genética , Sequência de Aminoácidos , Animais , DNA Complementar , Evolução Molecular , Etiquetas de Sequências Expressas , Invertebrados/classificação , Filogenia , Isoformas de Proteínas/genética , Alinhamento de SequênciaRESUMO
Chaetognaths constitute a small marine phylum exhibiting several characteristic which are highly unusual in animal genomes, including two classes of both rRNA and protein ribosomal genes. As in this phylum presence of retrovirus-like elements has never been documented, analysis of a published expressed sequence tag (EST) collection of the chaetognath Spadella cephaloptera has been made. Twelve sequences representing transcript sections of reverse transcriptase domain of active retrotransposons were isolated from~11,000 ESTs. Five of them are originated from Gypsy retrovirus-like elements, whereas the other are transcripts from a Bel-Pao LTR-retrotransposon, a Penelope-like element and LINE retrotransposons. Moreover, a part of a putative integrase has also been found. Phylogenetic analyses suggest a deep-branching clade of the retrovirus-like elements, which is in agreement with the probably Cambrian origin of the phylum. Moreover, retrotransposons have not been found in telomeric-like transcripts which are probably constituted by both vertebrate and arthropod canonical repeats.