RESUMO
BACKGROUND: Candida glabrata follows C. albicans as the second or third most prevalent cause of candidemia worldwide. These two pathogenic yeasts are distantly related, C. glabrata being part of the Nakaseomyces, a group more closely related to Saccharomyces cerevisiae. Although C. glabrata was thought to be the only pathogenic Nakaseomyces, two new pathogens have recently been described within this group: C. nivariensis and C. bracarensis. To gain insight into the genomic changes underlying the emergence of virulence, we sequenced the genomes of these two, and three other non-pathogenic Nakaseomyces, and compared them to other sequenced yeasts. RESULTS: Our results indicate that the two new pathogens are more closely related to the non-pathogenic N. delphensis than to C. glabrata. We uncover duplications and accelerated evolution that specifically affected genes in the lineage preceding the group containing N. delphensis and the three pathogens, which may provide clues to the higher propensity of this group to infect humans. Finally, the number of Epa-like adhesins is specifically enriched in the pathogens, particularly in C. glabrata. CONCLUSIONS: Remarkably, some features thought to be the result of adaptation of C. glabrata to a pathogenic lifestyle, are present throughout the Nakaseomyces, indicating these are rather ancient adaptations to other environments. Phylogeny suggests that human pathogenesis evolved several times, independently within the clade. The expansion of the EPA gene family in pathogens establishes an evolutionary link between adhesion and virulence phenotypes. Our analyses thus shed light onto the relationships between virulence and the recent genomic changes that occurred within the Nakaseomyces. SEQUENCE ACCESSION NUMBERS: Nakaseomyces delphensis: CAPT01000001 to CAPT01000179Candida bracarensis: CAPU01000001 to CAPU01000251Candida nivariensis: CAPV01000001 to CAPV01000123Candida castellii: CAPW01000001 to CAPW01000101Nakaseomyces bacillisporus: CAPX01000001 to CAPX01000186.
Assuntos
Candida glabrata/classificação , Genoma Fúngico , Filogenia , Saccharomycetales/classificação , Candida glabrata/genética , DNA Fúngico/genética , Evolução Molecular , Saccharomycetales/genética , Seleção Genética , Análise de Sequência de DNARESUMO
In eubacteria, the post-transcriptional modification of the wobble cytidine of the CAU anticodon in a precursor tRNA(Ile2) to a lysidine residue (2-lysyl-cytidine, abbreviated as L) allows the amino acid specificity to change from methionine to isoleucine and the codon decoding specificity to shift from AUG to AUA. The tilS gene encoding the enzyme that catalyses this modification is widely distributed. However, some microbial species lack a tilS gene, indicating that an alternative strategy exists to accurately translate the AUA codon into Ile. To determine whether a TilS-dependent bacterium, such as Bacillus subtilis, can overcome the absence of lysidine in its tRNA(Ile2) (CAU), we analysed the suppressor mutants of a tilS-thermosensitive allele. These tilS-suppressor mutants carry a substitution of the wobble guanosine into thymidine in one of the tRNA(Ile1) genes (the original GAT anticodon is changed to a TAT). In absence of TilS activity, the AUA codons are translated into isoleucine by the suppressor tRNA(Ile1), although a low level of AUA codons is also mistranslated into methionine. Results are in agreement with rare cases of eubacteria (and archaea), which naturally lack the tilS gene (or tiaS in archaea) but contain a tRNA(Ile2) gene containing a TAT instead of a CAT anticodon.
Assuntos
Aminoacil-tRNA Sintetases/metabolismo , Bacillus subtilis/enzimologia , RNA de Transferência/genética , Substituição de Aminoácidos , Aminoacil-tRNA Sintetases/genética , Anticódon/genética , Bacillus subtilis/genética , Temperatura Alta , Lisina/análogos & derivados , Lisina/química , Lisina/genética , Biossíntese de Proteínas/genética , Nucleosídeos de Pirimidina/química , Nucleosídeos de Pirimidina/genética , Supressão GenéticaRESUMO
The 11.3-Mb genome of the yeast Lachancea (Saccharomyces) kluyveri displays an intriguing compositional heterogeneity: a region of approximately 1 Mb, covering almost the whole left arm of chromosome C (C-left), has an average GC content of 52.9%, which is significantly higher than the 40.4% global GC content of the rest of the genome. This region contains the MAT locus, which remains normal in composition. The excess of GC base pairs affects both coding and noncoding sequences, and thus is not due to selective pressure acting on protein sequences. It leads to a strong codon usage bias and alters the amino acid composition of the 457 proteins encoded on C-left that do not show obvious bias for functional categories, or the presence of paralogs or orthologs of essential genes of Saccharomyces cerevisiae. They share significant synteny conservation with other species of the Saccharomycetaceae, and phylogenetic analysis indicates that C-left originates from a Lachancea species. In contrast, there is a complete absence of transposable elements in C-left, whereas 18 elements per megabase are distributed across the rest of the genome. Comparative hybridization of synchronized cells using high-density genome arrays reveals that C-left is replicated later during S phase than the rest of the genome. Two possible primary causes of this major compositional heterogeneity are discussed: an ancient hybridization of two related species with very distinct GC composition, or an intrinsic mechanism, possibly associated with the loss of the silent cassettes from C-left that progressively increased the GC content and generated the delayed replication of this chromosomal arm.
Assuntos
Composição de Bases/fisiologia , Cromossomos Fúngicos/genética , Período de Replicação do DNA/genética , Saccharomyces/genética , Composição de Bases/genética , Cromossomos Fúngicos/química , Códon/genética , Elementos de DNA Transponíveis/genética , Genoma Fúngico , Dados de Sequência Molecular , Filogenia , SinteniaRESUMO
Our knowledge of yeast genomes remains largely dominated by the extensive studies on Saccharomyces cerevisiae and the consequences of its ancestral duplication, leaving the evolution of the entire class of hemiascomycetes only partly explored. We concentrate here on five species of Saccharomycetaceae, a large subdivision of hemiascomycetes, that we call "protoploid" because they diverged from the S. cerevisiae lineage prior to its genome duplication. We determined the complete genome sequences of three of these species: Kluyveromyces (Lachancea) thermotolerans and Saccharomyces (Lachancea) kluyveri (two members of the newly described Lachancea clade), and Zygosaccharomyces rouxii. We included in our comparisons the previously available sequences of Kluyveromyces lactis and Ashbya (Eremothecium) gossypii. Despite their broad evolutionary range and significant individual variations in each lineage, the five protoploid Saccharomycetaceae share a core repertoire of approximately 3300 protein families and a high degree of conserved synteny. Synteny blocks were used to define gene orthology and to infer ancestors. Far from representing minimal genomes without redundancy, the five protoploid yeasts contain numerous copies of paralogous genes, either dispersed or in tandem arrays, that, altogether, constitute a third of each genome. Ancient, conserved paralogs as well as novel, lineage-specific paralogs were identified.
Assuntos
Genoma Fúngico , Genômica/métodos , Saccharomycetales/genética , Elementos de DNA Transponíveis/genética , Elementos de DNA Transponíveis/fisiologia , Eremothecium/genética , Duplicação Gênica , Genes Fúngicos/genética , Inteínas/genética , Kluyveromyces/genética , Dados de Sequência Molecular , Fases de Leitura Aberta/genética , Filogenia , RNA não Traduzido/genética , Saccharomyces/genética , Spliceossomos/metabolismo , Zygosaccharomyces/genéticaRESUMO
Identifying the mechanisms of eukaryotic genome evolution by comparative genomics is often complicated by the multiplicity of events that have taken place throughout the history of individual lineages, leaving only distorted and superimposed traces in the genome of each living organism. The hemiascomycete yeasts, with their compact genomes, similar lifestyle and distinct sexual and physiological properties, provide a unique opportunity to explore such mechanisms. We present here the complete, assembled genome sequences of four yeast species, selected to represent a broad evolutionary range within a single eukaryotic phylum, that after analysis proved to be molecularly as diverse as the entire phylum of chordates. A total of approximately 24,200 novel genes were identified, the translation products of which were classified together with Saccharomyces cerevisiae proteins into about 4,700 families, forming the basis for interspecific comparisons. Analysis of chromosome maps and genome redundancies reveal that the different yeast lineages have evolved through a marked interplay between several distinct molecular mechanisms, including tandem gene repeat formation, segmental duplication, a massive genome duplication and extensive gene loss.
Assuntos
Evolução Molecular , Genes Fúngicos/genética , Genoma Fúngico , Leveduras/classificação , Leveduras/genética , Cromossomos Fúngicos/genética , Sequência Conservada/genética , Duplicação Gênica , Dados de Sequência Molecular , RNA Ribossômico/genética , RNA de Transferência/genética , Proteínas de Saccharomyces cerevisiae/genética , Sintenia/genética , Sequências de Repetição em Tandem/genéticaRESUMO
In eukaryotes, genes transcribed by RNA polymerase III (Pol III) carry their own internal promoters and as such, are transcribed as individual units. Indeed, a very few cases of dicistronic Pol III genes are yet known. In contrast to other hemiascomycetes, 5S rRNA genes of Yarrowia lipolytica are not embedded into the tandemly repeated rDNA units, but appear scattered throughout the genome. We report here an unprecedented genomic organization: 48 over the 108 copies of the 5S rRNA genes are located 3' of tRNA genes. We show that these peculiar tRNA-5S rRNA dicistronic genes are expressed in vitro and in vivo as Pol III transcriptional fusions without the need of the 5S rRNA gene-specific factor TFIIIA, the deletion of which displays a viable phenotype. We also report the existence of a novel putative non-coding Pol III RNA of unknown function about 70 nucleotide-long (RUF70), the 13 genes of which are devoid of internal Pol III promoters and located 3' of the 13 copies of the tDNA-Trp (CCA). All genes embedded in the various dicistronic genes, fused 5S rRNA genes, RUF70 genes and their leader tRNA genes appear to be efficiently transcribed and their products correctly processed in vivo.
Assuntos
Genes de RNAr , RNA Ribossômico 5S/genética , RNA de Transferência/genética , Fator de Transcrição TFIIIA/metabolismo , Yarrowia/genética , Sequência de Bases , Evolução Molecular , Dosagem de Genes , Expressão Gênica , Fusão Gênica , Variação Genética , Genoma Fúngico , Dados de Sequência Molecular , Fenótipo , Precursores de RNA/química , Precursores de RNA/metabolismo , RNA Ribossômico 5S/química , RNA Ribossômico 5S/metabolismo , RNA de Transferência/química , RNA de Transferência de Triptofano/genética , Fator de Transcrição TFIIIA/antagonistas & inibidores , Fator de Transcrição TFIIIA/química , Yarrowia/metabolismoRESUMO
BACKGROUND: Naturally occurring RNAs contain numerous enzymatically altered nucleosides. Differences in RNA populations (RNomics) and pattern of RNA modifications (Modomics) depends on the organism analyzed and are two of the criteria that distinguish the three kingdoms of life. If the genomic sequences of the RNA molecules can be derived from whole genome sequence information, the modification profile cannot and requires or direct sequencing of the RNAs or predictive methods base on the presence or absence of the modifications genes. RESULTS: By employing a comparative genomics approach, we predicted almost all of the genes coding for the t+rRNA modification enzymes in the mesophilic moderate halophile Haloferax volcanii. These encode both guide RNAs and enzymes. Some are orthologous to previously identified genes in Archaea, Bacteria or in Saccharomyces cerevisiae, but several are original predictions. CONCLUSION: The number of modifications in t+rRNAs in the halophilic archaeon is surprisingly low when compared with other Archaea or Bacteria, particularly the hyperthermophilic organisms. This may result from the specific lifestyle of halophiles that require high intracellular salt concentration for survival. This salt content could allow RNA to maintain its functional structural integrity with fewer modifications. We predict that the few modifications present must be particularly important for decoding, accuracy of translation or are modifications that cannot be functionally replaced by the electrostatic interactions provided by the surrounding salt-ions. This analysis also guides future experimental validation work aiming to complete the understanding of the function of RNA modifications in Archaeal translation.
Assuntos
Genômica , Haloferax volcanii/genética , RNA Arqueal/genética , Genes de RNAr , Biossíntese de Proteínas , RNA Ribossômico 5S/genética , RNA de Transferência/genéticaRESUMO
We present the first comprehensive analysis of RNA polymerase III (Pol III) transcribed genes in ten yeast genomes. This set includes all tRNA genes (tDNA) and genes coding for SNR6 (U6), SNR52, SCR1 and RPR1 RNA in the nine hemiascomycetes Saccharomyces cerevisiae, Saccharomyces castellii, Candida glabrata, Kluyveromyces waltii, Kluyveromyces lactis, Eremothecium gossypii, Debaryomyces hansenii, Candida albicans, Yarrowia lipolytica and the archiascomycete Schizosaccharomyces pombe. We systematically analysed sequence specificities of tRNA genes, polymorphism, variability of introns, gene redundancy and gene clustering. Analysis of decoding strategies showed that yeasts close to S.cerevisiae use bacterial decoding rules to read the Leu CUN and Arg CGN codons, in contrast to all other known Eukaryotes. In D.hansenii and C.albicans, we identified a novel tDNA-Leu (AAG), reading the Leu CUU/CUC/CUA codons with an unusual G at position 32. A systematic 'p-distance tree' using the 60 variable positions of the tRNA molecule revealed that most tDNAs cluster into amino acid-specific sub-trees, suggesting that, within hemiascomycetes, orthologous tDNAs are more closely related than paralogs. We finally determined the bipartite A- and B-box sequences recognized by TFIIIC. These minimal sequences are nearly conserved throughout hemiascomycetes and were satisfactorily retrieved at appropriate locations in other Pol III genes.
Assuntos
Ascomicetos/genética , Genes Fúngicos , RNA Polimerase III/metabolismo , RNA de Transferência/genética , Ascomicetos/enzimologia , Sequência de Bases , Códon , Sequência Conservada , DNA Fúngico/química , Evolução Molecular , Genoma Fúngico , Genômica , Íntrons , Dados de Sequência Molecular , Família Multigênica , Polimorfismo Genético , Regiões Promotoras Genéticas , RNA de Transferência/metabolismo , RNA não Traduzido/genética , Fatores de Transcrição TFIII/metabolismo , Transcrição GênicaRESUMO
Yeast transcription factor IIIC (TFIIIC) plays a key role in assembling the transcription initiation factor TFIIIB on class III genes after TFIIIC-DNA binding. The second largest subunit of TFIIIC, tau131, is thought to initiate TFIIIB assembly by interacting with Brf1/TFIIIB70. In this work, we have analyzed a TFIIIC mutant (tau131-DeltaTPR2) harboring a deletion in tau131 removing the second of its 11 tetratricopeptide repeats. Remarkably, this thermosensitive mutation was selectively suppressed in vivo by overexpression of B"/TFIIIB90, but not Brf1 or TATA-binding protein. In vitro, the mutant factor preincubated at restrictive temperature bound DNA efficiently but lost transcription factor activity. The in vitro transcription defect was abolished at high concentrations of B" but not Brf1. Copurification experiments of baculovirus-expressed proteins confirmed a direct physical interaction between tau131 and B". tau131, therefore, appears to be involved in the recruitment of both Brf1 and B".
Assuntos
Saccharomyces cerevisiae/metabolismo , Fatores Associados à Proteína de Ligação a TATA , Fatores de Transcrição TFIII/metabolismo , Fatores de Transcrição/metabolismo , Motivos de Aminoácidos , Sequência de Aminoácidos , Animais , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Genes Reporter , Heparina/farmacologia , Humanos , Dados de Sequência Molecular , Mutação , Subunidades Proteicas , Proteínas Recombinantes de Fusão/metabolismo , Saccharomyces cerevisiae/efeitos dos fármacos , Saccharomyces cerevisiae/crescimento & desenvolvimento , Proteínas de Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Alinhamento de Sequência , Temperatura , Fator de Transcrição TFIIIB , Fatores de Transcrição/química , Fatores de Transcrição/genética , Fatores de Transcrição TFIII/química , Fatores de Transcrição TFIII/genéticaRESUMO
A search has been performed on 2878 tRNA sequences from 60 different genomes in order to detect the existence of closely related 'alloacceptor' tRNAs accepting dissimilar amino acids that could be paralogs generated by gene duplications. This has led to the identification of extremely conserved tRNA(Phe)-tRNA(Tyr) pairs displaying as high as 94% identity between them, and also other potentially paralogous tRNA pairs in archaeal species. These paralogous pairs are enriched for amino acid pairs belonging to the same amino acid biosynthetic family, thus providing evidence for the coevolution of genetic code and amino acid biosynthesis. Overall, the genetic distances between alloacceptor tRNAs yield estimates of how closely clustered in sequence space are the tRNAs in a genome. Among 34 Bacteria, 18 Archaea and 8 Eukarya, Methanopyrus kandleri and Aeropyrum pernix have yielded the lowest alloacceptor distances and largest number of paralogous pairs. Based on a cluster-dispersion model of tRNA evolution, such tight alloacceptor clustering is a measure of primitiveness of tRNA genotypes, and places last universal common ancestor (LUCA) between the branches leading to these two archaea in the tRNA phylogenetic tree.
Assuntos
Filogenia , RNA de Transferência/genética , Aminoácidos/biossíntese , Animais , Archaea/genética , Bactérias/genética , Sequência de Bases , Evolução Molecular , Código Genético/genética , Genoma , Humanos , Modelos Genéticos , Dados de Sequência Molecular , RNA de Transferência/metabolismo , Homologia de Sequência do Ácido NucleicoRESUMO
BACKGROUND: The industrially important yeast Blastobotrys (Arxula) adeninivorans is an asexual hemiascomycete phylogenetically very distant from Saccharomyces cerevisiae. Its unusual metabolic flexibility allows it to use a wide range of carbon and nitrogen sources, while being thermotolerant, xerotolerant and osmotolerant. RESULTS: The sequencing of strain LS3 revealed that the nuclear genome of A. adeninivorans is 11.8 Mb long and consists of four chromosomes with regional centromeres. Its closest sequenced relative is Yarrowia lipolytica, although mean conservation of orthologs is low. With 914 introns within 6116 genes, A. adeninivorans is one of the most intron-rich hemiascomycetes sequenced to date. Several large species-specific families appear to result from multiple rounds of segmental duplications of tandem gene arrays, a novel mechanism not yet described in yeasts. An analysis of the genome and its transcriptome revealed enzymes with biotechnological potential, such as two extracellular tannases (Atan1p and Atan2p) of the tannic-acid catabolic route, and a new pathway for the assimilation of n-butanol via butyric aldehyde and butyric acid. CONCLUSIONS: The high-quality genome of this species that diverged early in Saccharomycotina will allow further fundamental studies on comparative genomics, evolution and phylogenetics. Protein components of different pathways for carbon and nitrogen source utilization were identified, which so far has remained unexplored in yeast, offering clues for further biotechnological developments. In the course of identifying alternative microorganisms for biotechnological interest, A. adeninivorans has already proved its strengthened competitiveness as a promising cell factory for many more applications.
RESUMO
A comparative genomic analysis of the recently sequenced human body louse unicellular endosymbiont Candidatus Riesia pediculicola with a reduced genome (582 Kb), revealed that it is the only known organism that might have lost all post-transcriptional base and ribose modifications of the tRNA body, retaining only modifications of the anticodon-stem-loop essential for mRNA decoding. Such a minimal tRNA modification set was not observed in other insect symbionts or in parasitic unicellular bacteria, such as Mycoplasma genitalium (580 Kb), that have also evolved by considerably reducing their genomes. This could be an example of a minimal tRNA modification set required for life, a question that has been at the center of the field for many years, especially for understanding the emergence and evolution of the genetic code.
RESUMO
Polyploidization is an important process in the evolution of eukaryotic genomes, but ensuing molecular mechanisms remain to be clarified. Autopolyploidization or whole-genome duplication events frequently are resolved in resulting lineages by the loss of single genes from most duplicated pairs, causing transient gene dosage imbalance and accelerating speciation through meiotic infertility. Allopolyploidization or formation of interspecies hybrids raises the problem of genetic incompatibility (Bateson-Dobzhansky-Muller effect) and may be resolved by the accumulation of mutational changes in resulting lineages. In this article, we show that an osmotolerant yeast species, Pichia sorbitophila, recently isolated in a concentrated sorbitol solution in industry, illustrates this last situation. Its genome is a mosaic of homologous and homeologous chromosomes, or parts thereof, that corresponds to a recently formed hybrid in the process of evolution. The respective parental contributions to this genome were characterized using existing variations in GC content. The genomic changes that occurred during the short period since hybrid formation were identified (e.g., loss of heterozygosity, unilateral loss of rDNA, reciprocal exchange) and distinguished from those undergone by the two parental genomes after separation from their common ancestor (i.e., NUMT (NUclear sequences of MiTochondrial origin) insertions, gene acquisitions, gene location movements, reciprocal translocation). We found that the physiological characteristics of this new yeast species are determined by specific but unequal contributions of its two parents, one of which could be identified as very closely related to an extant Pichia farinosa strain.
RESUMO
Whatever their abundance in genomes, spliceosomal introns are the signature of eukaryotic genes. The sequence of Saccharomyces cerevisiae, achieved fifteen years ago, revealed that this yeast has very few introns, but conserved intron boundaries typical for an intron definition mechanism. With the improvement and the development of new sequencing technologies, yeast genomes have been extensively sequenced during the last decade. We took advantage of this plethora of data to compile and assess the intron content of the protein-coding genes of 13 genomes representative of the evolution of hemiascomycetous yeasts. We first observed that intron paucity is a general rule and that the fastest evolving genomes tend to lose their introns more rapidly (e.g. S. cerevisiae versus Yarrowia lipolytica). Noticeable differences were also confirmed for 5' splice sites and branch point sites (BP) as well as for the relative position of the BP. These changes seemed to be correlated with the lineage specific evolution of splicing factors.
Assuntos
Genoma Fúngico/genética , Íntrons/genética , Saccharomycetales/genética , Processamento Alternativo/genética , Bases de Dados Genéticas , Evolução Molecular , Filogenia , Proteínas Ribossômicas/metabolismo , Saccharomyces cerevisiae/genéticaRESUMO
The strategies organisms use to decode synonymous codons in cytosolic protein synthesis are not uniform. The complete isoacceptor tRNA repertoire and the type of modified nucleoside found at the wobble position 34 of their anticodons were analyzed in all kingdoms of life. This led to the identification of four main decoding strategies that are diversely used in Bacteria, Archaea and Eukarya. Many of the modern tRNA modification enzymes acting at position 34 of tRNAs are present only in specific domains and obviously have arisen late during evolution. In an evolutionary fine-tuning process, these enzymes must have played an essential role in the progressive introduction of new amino acids, and in the refinement and standardization of the canonical nuclear genetic code observed in all extant organisms (functional convergent evolutionary hypothesis).
Assuntos
Archaea/genética , Bactérias/genética , Códon/genética , Eucariotos/genética , Código Genético , RNA de Transferência/genética , Archaea/enzimologia , Bactérias/enzimologia , Enzimas/genética , Enzimas/metabolismo , Eucariotos/enzimologia , Evolução Molecular , Biossíntese de Proteínas/genéticaRESUMO
Over a thousand of cytoplasmic, non organellar tRNA genes were extracted from the whole-genomes of more than 100 organisms spanning the Bacteria, Eukarya and Archaea (tRNomics). Also, whenever possible, the genes coding for modification enzymes acting on tRNA, particularly those involved in modification of nucleotides in the anticodon loop, were identified (Modomics). Combining these two data sets, we were able to reveal three main decoding strategies used by individual contemporary organisms to read the 62 (61+1 initiator) sense codons of mRNA. Based on the known phylogenetic relationships of the different organisms analyzed, this work allows to predict which RNA modification enzymes are essential for an accurate and efficient translation process, as well as to shed light on when these complex and diverse tRNA maturation processes probably emerged during cellular evolution.
Assuntos
Anticódon/química , Códon/química , Evolução Molecular , RNA de Transferência/genética , Animais , Ascomicetos/genética , Genoma Arqueal , Genoma Bacteriano , Genoma Fúngico , Genômica , RNA de Transferência/química , Tenericutes/genéticaRESUMO
Stable RNAs are central to protein synthesis. Ribosomal RNAs make the core of the ribosome and provide the scaffold for accurate translation of mRNAs by a set of tRNA molecules each carrying an activated amino acid. To fulfill these important cellular functions, both rRNA and tRNA molecules require more than the four canonical bases and have recruited enzymes that introduce numerous modifications on nucleosides. Mollicutes are parasitic unicellular bacteria that originated from gram-positive bacteria by considerably reducing their genome, reaching a minimal size of 480 kb in Mycoplasma genitalium. By analyzing the complete set of tRNA isoacceptors (tRNomics) and predicting the tRNA/rRNA modification enzymes (Modomics) among all sequenced Mollicutes (15 in all), our goal is to predict the minimal set of RNA modifications needed to sustain accurate translation of the cell's genetic information. Building on the known phylogenetic relationship of the 15 Mollicutes analyzed, we demonstrate that the solutions to reducing the RNA component of the translation apparatus vary from one Mollicute to the other and often rely on co-evolution of specific tRNA isoacceptors and RNA modification enzymes. This analysis also reveals that only a few modification enzymes acting on nucleotides of the anticodon loop in tRNA (the wobble position 34 as well as in position 37, 3'-adjacent to anticodon) and of the peptidyltransferase center of 23S rRNA appear to be absolutely essential and resistant to gene loss during the evolutionary process of genome reduction.
Assuntos
Evolução Molecular , Genes Bacterianos/fisiologia , Genoma Bacteriano/fisiologia , RNA Bacteriano/fisiologia , RNA Ribossômico/fisiologia , RNA de Transferência/fisiologia , Tenericutes/enzimologia , Tenericutes/genética , Sequência de Bases , Códon/genética , Dados de Sequência Molecular , Mycoplasma capricolum/enzimologia , Mycoplasma capricolum/genética , Mycoplasma capricolum/fisiologia , Conformação de Ácido Nucleico , RNA Bacteriano/química , RNA Ribossômico/química , RNA de Transferência/química , Tenericutes/fisiologia , tRNA Metiltransferases/química , tRNA Metiltransferases/genéticaRESUMO
TFIIIC in yeast and humans is required for transcription of tRNA and 5 S RNA genes by RNA polymerase III. In the yeast Saccharomyces cerevisiae, TFIIIC is composed of six subunits, five of which are conserved in humans. We report the identification, molecular cloning, and characterization of the sixth subunit of human TFIIIC, TFIIIC35, which is related to the smallest subunit of yeast TFIIIC. Human TFIIIC35 does not contain the phosphoglycerate mutase domain of its yeast counterpart, and these two proteins display only limited homology within a 34-amino acid domain. Homologs of the sixth TFIIIC subunit are also identified in other eukaryotes, and their phylogenic evolution is analyzed. Affinity-purified human TFIIIC from an epitope-tagged TFIIIC35 cell line is active in binding to and in transcription of the VA1 gene in vitro. Furthermore, TFIIIC35 specifically interacts with the human TFIIIC subunits TFIIIC63 and, to a lesser extent, TFIIIC90 in vitro. Finally, we determined a limited region in the smallest subunit of yeast TFIIIC that is sufficient for interacting with the yeast TFIIIC subunit ScTfc1 (orthologous to TFIIIC63) and found it to be adjacent to and overlap the 34-amino acid domain that is conserved from yeast to humans.
Assuntos
Fatores de Transcrição TFIII/metabolismo , Sequência de Aminoácidos , Sequência de Bases , Clonagem Molecular , Primers do DNA , Imunofluorescência , Humanos , Dados de Sequência Molecular , Filogenia , RNA Polimerase III/metabolismo , Proteínas Recombinantes/química , Proteínas Recombinantes/classificação , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Homologia de Sequência de Aminoácidos , Fatores de Transcrição TFIII/química , Fatores de Transcrição TFIII/classificação , Fatores de Transcrição TFIII/genéticaRESUMO
From 50 genomes of the three domains of life (7 eukarya, 13 archaea, and 30 bacteria), we extracted, analyzed, and compared over 4,000 sequences corresponding to cytoplasmic, nonorganellar tRNAs. For each genome, the complete set of tRNAs required to read the 61 sense codons was identified, which permitted revelation of three major anticodon-sparing strategies. Other features and sequence peculiarities analyzed are the following: (1) fit to the standard cloverleaf structure, (2) characteristic consensus sequences for elongator and initiator tDNAs, (3) frequencies of bases at each sequence position, (4) type and frequencies of conserved 2D and 3D base pairs, (5) anticodon/tDNA usages and anticodon-sparing strategies, (6) identification of the tRNA-Ile with anticodon CAU reading AUA, (7) size of variable arm, (8) occurrence and location of introns, (9) occurrence of 3'-CCA and 5'-extra G encoded at the tDNA level, and (10) distribution of the tRNA genes in genomes and their mode of transcription. Among all tRNA isoacceptors, we found that initiator tDNA-iMet is the most conserved across the three domains, yet domain-specific signatures exist. Also, according to which tRNA feature is considered (5'-extra G encoded in tDNAs-His, AUA codon read by tRNA-Ile with anticodon CAU, presence of intron, absence of "two-out-of-three" reading mode and short V-arm in tDNA-Tyr) Archaea sequester either with Bacteria or Eukarya. No common features between Eukarya and Bacteria not shared with Archaea could be unveiled. Thus, from the tRNomic point of view, Archaea appears as an "intermediate domain" between Eukarya and Bacteria.
Assuntos
Archaea/genética , Bactérias/genética , Células Eucarióticas/fisiologia , Genoma , RNA de Transferência/genética , Sequência de Aminoácidos , Animais , Composição de Bases , Pareamento de Bases , Sequência de Bases , Códon , Sequência Conservada , Humanos , Dados de Sequência Molecular , Conformação de Ácido Nucleico , Filogenia , Processamento Pós-Transcricional do RNA , RNA de Transferência/química , RNA de Transferência de Isoleucina , Transcrição GênicaRESUMO
Most introns of archaeal tRNA genes (tDNAs) are located in the anticodon loop, between nucleotides 37 and 38, the unique location of their eukaryotic counterparts. However, in several Archaea, mostly in Crenarchaeota, introns have been found at many other positions of the tDNAs. In the present work, we revisit and extend all previous findings concerning the identification, exact location, size, and possible fit to the proposed bulge-helix-bulge structural motif (BHB, now renamed hBHBh') of the sequences spanning intron-exon junctions in intron-containing tRNAs of 18 archaea. A total of 103 introns were found located at the usual position 37/38 and 33 introns at 14 other different positions, that is, in the anticodon stem and loop, in the D-and T-loops, in the V-arm, or in the amino acid arm. For introns located at 37/38 and elsewhere in the pre-tRNA, canonical hBHBh' motifs were not always found. Instead, a relaxed hBH or HBh' motif including the constant central 4-bp helix H flanked by one helix (h or h') on either side generating only one bulge could be disclosed. Also, for introns located elsewhere than at position 37/38, the hBHBh' (or HBh') structure competes with the three-dimensional structure of the mature tRNA, attesting to important structural rearrangements during the complex multistep maturation-splicing processes. A homotetramer-type of splicing endonuclease (like in all Crenarchaeota) instead of a homodimeric-type of enzyme (as in most Euryarchaeota) appears to best fit the requirement for splicing introns at relaxed hBH or HBh' motifs, and may represent the most primitive form of this enzyme.