Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 49
Filtrar
1.
Stud Mycol ; 105: 1-22, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38895705

RESUMO

The subphylum Saccharomycotina is a lineage in the fungal phylum Ascomycota that exhibits levels of genomic diversity similar to those of plants and animals. The Saccharomycotina consist of more than 1 200 known species currently divided into 16 families, one order, and one class. Species in this subphylum are ecologically and metabolically diverse and include important opportunistic human pathogens, as well as species important in biotechnological applications. Many traits of biotechnological interest are found in closely related species and often restricted to single phylogenetic clades. However, the biotechnological potential of most yeast species remains unexplored. Although the subphylum Saccharomycotina has much higher rates of genome sequence evolution than its sister subphylum, Pezizomycotina, it contains only one class compared to the 16 classes in Pezizomycotina. The third subphylum of Ascomycota, the Taphrinomycotina, consists of six classes and has approximately 10 times fewer species than the Saccharomycotina. These data indicate that the current classification of all these yeasts into a single class and a single order is an underappreciation of their diversity. Our previous genome-scale phylogenetic analyses showed that the Saccharomycotina contains 12 major and robustly supported phylogenetic clades; seven of these are current families (Lipomycetaceae, Trigonopsidaceae, Alloascoideaceae, Pichiaceae, Phaffomycetaceae, Saccharomycodaceae, and Saccharomycetaceae), one comprises two current families (Dipodascaceae and Trichomonascaceae), one represents the genus Sporopachydermia, and three represent lineages that differ in their translation of the CUG codon (CUG-Ala, CUG-Ser1, and CUG-Ser2). Using these analyses in combination with relative evolutionary divergence and genome content analyses, we propose an updated classification for the Saccharomycotina, including seven classes and 12 orders that can be diagnosed by genome content. This updated classification is consistent with the high levels of genomic diversity within this subphylum and is necessary to make the higher rank classification of the Saccharomycotina more comparable to that of other fungi, as well as to communicate efficiently on lineages that are not yet formally named. Taxonomic novelties: New classes: Alloascoideomycetes M. Groenew., Hittinger, Opulente & A. Rokas, Dipodascomycetes M. Groenew., Hittinger, Opulente & A. Rokas, Lipomycetes M. Groenew., Hittinger, Opulente, A. Rokas, Pichiomycetes M. Groenew., Hittinger, Opulente & A. Rokas, Sporopachydermiomycetes M. Groenew., Hittinger, Opulente & A. Rokas, Trigonopsidomycetes M. Groenew., Hittinger, Opulente & A. Rokas. New orders: Alloascoideomycetes: Alloascoideales M. Groenew., Hittinger, Opulente & A. Rokas; Dipodascomycetes: Dipodascales M. Groenew., Hittinger, Opulente & A. Rokas; Lipomycetes: Lipomycetales M. Groenew., Hittinger, Opulente & A. Rokas; Pichiomycetes: Alaninales M. Groenew., Hittinger, Opulente & A. Rokas, Pichiales M. Groenew., Hittinger, Opulente & A. Rokas, Serinales M. Groenew., Hittinger, Opulente & A. Rokas; Saccharomycetes: Phaffomycetales M. Groenew., Hittinger, Opulente & A. Rokas, Saccharomycodales M. Groenew., Hittinger, Opulente & A. Rokas; Sporopachydermiomycetes: Sporopachydermiales M. Groenew., Hittinger, Opulente & A. Rokas; Trigonopsidomycetes: Trigonopsidales M. Groenew., Hittinger, Opulente & A. Rokas. New families: Alaninales: Pachysolenaceae M. Groenew., Hittinger, Opulente & A. Rokas; Pichiales: Pichiaceae M. Groenew., Hittinger, Opulente & A. Rokas; Sporopachydermiales: Sporopachydermiaceae M. Groenew., Hittinger, Opulente & A. Rokas. Citation: Groenewald M, Hittinger CT, Bensch K, Opulente DA, Shen X-X, Li Y, Liu C, LaBella AL, Zhou X, Limtong S, Jindamorakot S, Gonçalves P, Robert V, Wolfe KH, Rosa CA, Boekhout T, Cadez N, Péter G, Sampaio JP, Lachance M-A, Yurkov AM, Daniel H-M, Takashima M, Boundy-Mills K, Libkind D, Aoki K, Sugita T, Rokas A (2023). A genome-informed higher rank classification of the biotechnologically important fungal subphylum Saccharomycotina. Studies in Mycology 105: 1-22. doi: 10.3114/sim.2023.105.01 This study is dedicated to the memory of Cletus P. Kurtzman (1938-2017), a pioneer of yeast taxonomy.

2.
Science ; 287(5456): 1283-6, 2000 Feb 18.
Artigo em Inglês | MEDLINE | ID: mdl-10678838

RESUMO

Point mutations are generally assumed to involve changes of single nucleotides. Nevertheless, the nature and known mechanisms of mutation do not exclude the possibility that several adjacent nucleotides may change simultaneously in a single mutational event. Two independent approaches are used here to estimate the frequency of simultaneous double-nucleotide substitutions. The first examines switches between TCN and AGY (where N is any nucleotide and Y is a pyrimidine) codons encoding absolutely conserved serine residues in a number of proteins from diverse organisms. The second reveals double-nucleotide substitutions in primate noncoding sequences. These two complementary approaches provide similar high estimates for the rate of doublet substitutions, on the order of 0.1 per site per billion years.


Assuntos
Códon , Globinas/genética , Mutação , Nucleotídeos/genética , Serina/genética , Alelos , Substituição de Aminoácidos , Animais , Evolução Molecular , Haplorrinos/genética , Humanos , Modelos Genéticos , Filogenia
3.
Science ; 246(4931): 808-10, 1989 Nov 10.
Artigo em Inglês | MEDLINE | ID: mdl-2683084

RESUMO

The basal rate of DNA sequence evolution in enterobacteria, as seen in the extent of divergence between Escherichia coli and Salmonella typhimurium, varies greatly among genes, even when only "silent" sites are considered. The degree of divergence is clearly related to the level of gene expression, reflecting constraints on synonymous codon choice. However, where this constraint is weak, among genes not expressed at high levels, divergence is also related to the chromosomal location of the gene; it appears that genes furthest away from oriC, the origin of replication, have a mutation rate approximately two times that of genes near oriC.


Assuntos
Evolução Biológica , Cromossomos Bacterianos , Enterobacteriaceae/genética , Mutação , Viés , Códon/genética , Reparo do DNA , Replicação do DNA , DNA Bacteriano/genética , Enterobacteriaceae/ultraestrutura , Escherichia coli/genética , Escherichia coli/ultraestrutura , Regulação Bacteriana da Expressão Gênica , Genes Bacterianos , Análise de Regressão , Salmonella typhimurium/genética , Salmonella typhimurium/ultraestrutura
4.
Curr Opin Genet Dev ; 8(6): 694-700, 1998 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-9914206

RESUMO

Several eukaryotes, including maize, yeast and Xenopus, are degenerate polyploids formed by relatively recent whole-genome duplications. Ohno's conjecture that more ancient genome duplications occurred in an ancestor of vertebrates is probably at least partly true but the present shortage of gene sequence and map information from vertebrates makes it difficult to either prove or disprove this hypothesis. Candidate paralogous segments in mammalian genomes have been identified but the lack of statistical rigour means that many of the proposals in the literature are probably artefacts.


Assuntos
Genoma , Poliploidia , Animais , Humanos , Saccharomyces cerevisiae , Xenopus , Zea mays
5.
Curr Opin Genet Dev ; 1(4): 523-9, 1991 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-1822284

RESUMO

Recent findings have established cracks in the straight-laced image of the plastid genome as a molecule whose sole function is photosynthesis and whose gene content is highly conserved. Genes for numerous non-photosynthetic functions have been identified. Algal plastid genomes contain many genes with no homologs in angiosperms, and the recent transfer of genes from the plastid to the nuclear genome has been described. Wholesale abandonment of genes encoding photosynthetic and gene-expression functions has occurred in the plastid genomes of a non-green plant and alga. The origins of plastid DNA, its use in phylogenetic studies, and the origins of plastid introns are also reviewed.


Assuntos
Evolução Biológica , Genoma , Organelas , Plantas/genética , Cloroplastos , Eucariotos/classificação , Eucariotos/genética , Expressão Gênica , Genes de Plantas , Filogenia , Plantas/classificação , Plantas Tóxicas , Simbiose , Nicotiana/genética
6.
Curr Opin Microbiol ; 2(5): 548-54, 1999 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-10508730

RESUMO

The Saccharomyces cerevisiae genome sequence, augmented by new data on gene expression and function, continues to yield new findings about eukaryote genome evolution. Analysis of the duplicate gene pairs formed by whole-genome duplication indicates that selection for increased levels of gene expression was a significant factor determining which genes were retained as duplicates and which were returned to a single-copy state, possibly in addition to selection for novel gene functions. Proteome comparisons between worm and yeast show that genes for core metabolic processes are shared among eukaryotes and unchanging in function, while comparisons between different yeast species identify 'orphan' genes as the most rapidly evolving fraction of the proteome. Natural hybridisation among yeast species is frequent, but its long-term evolutionary significance is unknown.


Assuntos
Evolução Molecular , Genoma Fúngico , Saccharomyces cerevisiae/genética , Expressão Gênica , Genes Duplicados , Íntrons , Proteoma
7.
Biochim Biophys Acta ; 1219(3): 711-2, 1994 Nov 22.
Artigo em Inglês | MEDLINE | ID: mdl-7948031

RESUMO

A Saccharomyces cerevisiae gene (HTN1) that encodes a homologue of mouse Ran binding protein 1 (RanBP1, also known as HTF9A) was identified, cloned and sequenced. The two proteins are 51% identical in sequence. The HTN1 protein may interact with yeast GSP1, GSP2, and PRP20 proteins in an intracellular signalling pathway equivalent to the mammalian RanBP1-Ran/TC4-RCC1 pathway. RanBP1 homologues also exist in worms and rice.


Assuntos
Proteínas de Ligação ao GTP/genética , Proteínas Ativadoras de GTPase , Genes Fúngicos/genética , Proteínas Nucleares , Proteínas de Saccharomyces cerevisiae , Saccharomyces cerevisiae/genética , Proteína ran de Ligação ao GTP , Sequência de Aminoácidos , Sequência de Bases , Dados de Sequência Molecular , Homologia de Sequência de Aminoácidos
8.
J Mol Biol ; 223(1): 95-104, 1992 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-1731088

RESUMO

We have determined the nucleotide sequence of a 7 kb (1 kb = 10(3) base-pairs) region that includes the entire small single-copy region (SSC) of the plastid genome of Epifagus virginiana, a non-photosynthetic, parasitic flowering plant. The SSC (4.8 kb) is considerably smaller than those of photosynthetic plants due to the complete deletion of all photosynthetic, chlororespiratory and ribosomal protein genes. This leaves only two genes: a protein gene of 1738 codons whose product is unlikely to be involved in bioenergetic processes and a leucine tRNA gene (trn(LUAG)). Both genes span junctions between the inverted repeat and the SSC, with the consequence that the terminal 20 base-pairs of the repeat is transcribed in both directions and functions both as the 3' end of the tRNA gene and as an internal segment of orf1738. We find that the region of tobacco plastid DNA homologous to Epifagus orf1738 contains a single open reading frame (ORF) of 1901 codons rather than the three ORFs of 1244, 273 and 228 codons originally reported. However, we confirm that the equivalent region of the bryophyte Marchantia contains two genes (1068 and 464 codons) corresponding to the N and C-terminal portions of the dicot protein. In contrast, rice plastid DNA contains a severely truncated pseudogene at this locus.


Assuntos
Cloroplastos/fisiologia , Genes de Plantas , Plantas/genética , Sequência de Aminoácidos , Sequência de Bases , Evolução Biológica , Clonagem Molecular , Dados de Sequência Molecular , Oligonucleotídeos/química , Filogenia , RNA de Transferência/genética
9.
Genetics ; 150(1): 425-33, 1998 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-9725858

RESUMO

The plastid genome of the nonphotosynthetic parasitic plant Epifagus virginiana contains only 17 of the 30 tRNA genes normally found in angiosperm plastid DNA. Although this is insufficient for translation, the genome is functional, so import of cytosolic tRNAs into plastids has been suggested. This raises the question of whether the tRNA genes that remain in E. virginiana plastid DNA are active or have just fortuitously escaped deletion. We report the sequences of 20 plastid tRNA loci from Orobanche minor, which shares a nonphotosynthetic ancestor with E. virginiana. The two species have 9 intact tRNA genes in common, the others being defunct in one or both species. The intron-containing trnLUAA gene is absent from E. virginiana, but it is intact, transcribed, and spliced in O. minor. The shared intact genes are better conserved than intergenic sequences, which indicates that these genes are being maintained by natural selection and, therefore, must be functional. For the most part, the tRNA species conserved in nonphotosynthetic plastids are also those that have never been found to be imported in plant mitochondria, which suggests that the same rules may govern tRNA import in the two organelles. A small photosynthesis gene, psbI, is still intact in O. minor, and computer simulations show that some small nonessential genes have an appreciable chance of escaping deletion.


Assuntos
DNA de Plantas/genética , Genes de Plantas , Plantas/genética , Plastídeos/metabolismo , RNA de Transferência/genética , Sequência de Bases , Conformação de Ácido Nucleico , RNA de Transferência/química , Deleção de Sequência
10.
Gene ; 66(2): 215-22, 1988 Jun 30.
Artigo em Inglês | MEDLINE | ID: mdl-3169573

RESUMO

We have used a rapid computer dot-matrix comparison method to identify all DNA regions which have been evolutionarily conserved between the completely sequenced chloroplast genomes of tobacco and a liverwort. Analysis of these regions reveals 74 homologous open reading frames (ORFs) which have been conserved as to length and amino acid sequence; these ORFs also have an excess of nucleotide substitutions at silent sites of codons. Since the nonfunctional parts of these genomes have become saturated with mutations and show no sequence similarity whatsoever, the homologous ORFs are almost certainly functional. A further four pairs of ORFs show homology limited to only a short part of their putative gene products. Amino acid sequence identities range between 50 and 99%; some chloroplast proteins are seen to be among the most slowly evolving of all known proteins. A search of the nucleotide and amino acid sequence databanks has revealed several previously unidentified genes in chloroplast sequences from other species, but no new homologies to prokaryotic genes.


Assuntos
Cloroplastos , Plantas/genética , Evolução Biológica , DNA/genética , Mutação , Plantas Tóxicas , Homologia de Sequência do Ácido Nucleico , Nicotiana/genética
11.
Gene ; 238(1): 253-61, 1999 Sep 30.
Artigo em Inglês | MEDLINE | ID: mdl-10571001

RESUMO

We have updated the map of duplicated chromosomal segments in the Saccharomyces cerevisiae genome originally published by Wolfe and Shields in 1997 (Nature 387, 708-713). The new analysis is based on the more sensitive Smith Waterman search method instead of BLAST. The parameters used to identify duplicated chromosomal regions were optimized such as to maximize the amount of the genome placed into paired regions, under the assumption that the hypothesis that the entire genome was duplicated in a single event is correct. The core of the new map, with 52 pairs of regions containing three or more duplicated genes, is largely unchanged from our original map. 39 tRNA gene pairs and one snRNA pair have been added. To find additional pairs of genes that may have been formed by whole genome duplication, we searched through the parts of the genome that are not covered by this core map, looking for putative duplicated chromosomal regions containing only two duplicate genes instead of three, or having lower-scoring gene pairs. This approach identified a further 32 candidate paired regions, bringing the total number of protein-coding genes on the duplication map to 905 (16% of the proteome). The updated map suggests that a second copy of the ribosomal DNA array has been deleted from chromosome IV.


Assuntos
Duplicação Gênica , Genoma Fúngico , Saccharomyces cerevisiae/genética , Cromossomos Fúngicos , Kluyveromyces/genética
12.
Gene ; 71(1): 135-46, 1988 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-3145907

RESUMO

Eight bovine genomic clones have been identified as members of the aspartyl protease gene family. The clones were prepared in phage vector lambda EMBL3 from Sau3AI partial digests of DNA from a single animal. Restriction maps show that seven of these clones are related and comprise at least five non-overlapping sequences. Allowing for allelic variation these probably represent three or more different genes. The nucleotide sequences show open reading frames (ORFs) corresponding closely to exons 6, 7 and 8 of human and porcine pepsin A. Comparison with other aspartyl proteases shows that these are multiple bovine pepsin A genes. The seven clones would encode at least two different but closely related forms of pepsin A. The 5' splice site at the end of exon 7 in all seven clones is the unusual sequence GC. The eighth clone contains an ORF homologous to exon 2 of the mammalian aspartyl proteases. The corresponding amino acid sequence is more closely related to bovine chymosin than to any of the other known sequences; it may be functionally homologous to chymosin but could be a novel mammalian aspartyl protease. The intron/exon boundaries seen in both this clone and in the bovine pepsin A clones are at the same positions as found in human pepsin A, bovine chymosin and human and mouse renins, further evidence that the general structure of mammalian aspartyl protease genes has been strongly conserved.


Assuntos
Endopeptidases/genética , Família Multigênica , Sequência de Aminoácidos , Animais , Ácido Aspártico Endopeptidases , Sequência de Bases , Southern Blotting , Bovinos , Quimosina/genética , Clonagem Molecular , DNA/genética , Dados de Sequência Molecular , Pepsina A/genética , Filogenia , Mapeamento por Restrição
13.
Adv Parasitol ; 43: 161-266, 1999.
Artigo em Inglês | MEDLINE | ID: mdl-10214692

RESUMO

Many parasites have deployed proteinases to accomplish some of the tasks imposed by a parasitic life style, including tissue penetration, digestion of host tissue for nutrition and evasion of host immune responses. Information on proteinases from trematodes, cestodes and nematode parasites is reviewed, concentrating on those worms of major medical and economical importance. Their biochemical characterization is discussed, along with their putative biological roles and, where available, their associated genes. For example, proteinases expressed by the various stages of the schistosome life-cycle, in particular the well-characterized cercarial elastase which is involved in the penetration of the host skin and the variety of proteinases, such as cathepsin B (Sm31), cathepsin L1, cathepsin L2, cathepsin D, cathepsin C and legumain (Sm32), which are believed to be involved in the catabolism of host haemoglobin. The various endo- and exoproteinases of Fasciola hepatica, the causative agent of liver fluke disease, are reviewed, and recent reports of how these enzymes have been successfully employed in cocktail vaccines are discussed. The various proteinases of cestodes and of the diverse superfamilies of parasitic nematodes are detailed, with special attention being given to those parasites for which most is known, including species of Taenia, Echinococcus, Spirometra, Necator, Acylostoma and Haemonchus. By far the largest number of papers in the literature and entries to the sequence data bases dealing with proteinases of parasitic helminths report on enzymes belonging to the papain superfamily of cysteine proteinases. Accordingly, the final section of the review is devoted to a phylogenetic analysis of this superfamily using over 150 published sequences. This analysis shows that the papain superfamily can be divided into two major branches. Branch A contains the cathepin Bs, the cathepsin Cs and a novel family termed cathepsin Xs, while Branch B contains the cruzipains, cathepsin Ls, papain-like and aleurain/cathepsin H-like proteinases. The relationships of the helminth proteinases, and similar proteinases from protozoan parasites and other organisms, within these groups are discussed.


Assuntos
Endopeptidases/genética , Endopeptidases/metabolismo , Genes de Helmintos , Helmintíase/parasitologia , Helmintos/genética , Sequência de Aminoácidos , Animais , Endopeptidases/química , Endopeptidases/classificação , Helmintos/enzimologia , Humanos , Dados de Sequência Molecular , Filogenia
14.
Curr Genet ; 25(4): 379-83, 1994 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-8082182

RESUMO

Plastid ORF2280 proteins from five species of land plant are shown to have limited amino-acid sequence similarity to a family of proteins that includes the yeast CDC48, SEC18, PAS1 and SUG1 proteins, three subunits of the mammalian 26S protease, and the Escherichia coli FtsH protein. These proteins all contain one or two ATPase domains and many are involved in cell division, transport of proteins across membranes, or proteolysis. Similarity with the ORF2280 proteins is restricted to a single region of about 130 amino acids that contains: (1) sequences resembling a nucleotide binding site but lacking two normally conserved residues, and (2) a downstream conserved motif with the consensus sequence VIX2TX2PX3DPALX2P. Most of the rest of ORF2280 is very poorly conserved among land plants, even though other family members such as CDC48 have slow rates of protein sequence evolution. In contrast, a protein encoded by plastid DNA of the rhodophyte alga Porphyra purpurea is very similar to E. coli FtsH. Phylogenetic analysis suggests that the red and green plastid genes are not true homologues (orthologues) but distinct members of an ancient gene family.


Assuntos
Adenosina Trifosfatases/genética , Trifosfato de Adenosina/metabolismo , Proteínas de Bactérias/genética , Proteínas de Ciclo Celular , Proteínas Fúngicas/genética , Proteínas de Membrana/genética , Proteínas de Plantas/genética , Plastídeos/metabolismo , Proteases Dependentes de ATP , Sequência de Aminoácidos , Sítios de Ligação , Sequência Consenso , Proteínas de Escherichia coli , Dados de Sequência Molecular , Família Multigênica , Homologia de Sequência de Aminoácidos , Proteína com Valosina
15.
J Theor Biol ; 149(4): 441-51, 1991 Apr 21.
Artigo em Inglês | MEDLINE | ID: mdl-2062104

RESUMO

Experimental studies have shown that the fidelity of DNA replication can be affected by the concentrations of free deoxyribonucleotides present in the cell. Replication of mammalian chromosomes is achieved using pools of newly-synthesized deoxyribonucleotides which fluctuate during the cell cycle. Since regions of mammalian chromosomes are replicated sequentially, there is the potential for differences among mammalian loci in both the relative and absolute frequencies of the various transitional and transversional mutations which may occur. Where these mutations are effectively neutral, at silent sites in genes and in non-coding sequences, this may result in different rates of evolution and in different base compositions, as have been observed in data from mammalian genes. A simple model of the DNA replication process is developed to describe how the mutation rate could be affected by the G + C contents of the deoxyribonucleotide pools and of the replicating DNA. Mutation rates are predicted to vary from locus to locus; only in the particular case of identical G + C contents in the DNA locus and the deoxyribonucleotide pools, and no proofreading, will the mutation rate be uniform over all loci.


Assuntos
Replicação do DNA , Animais , Sequência de Bases , Ciclo Celular , Desoxirribonucleotídeos/metabolismo , Matemática , Modelos Genéticos , Mutação
16.
Curr Genet ; 13(1): 97-9, 1988.
Artigo em Inglês | MEDLINE | ID: mdl-3359497

RESUMO

Analysis of a nucleotide sequence from pea chloroplast DNA which spans the site of deletion of one copy of the large inverted repeat (IR) element suggests that the IR unit has been entirely deleted, but that the single-copy coding sequences which flanked it (trnH and ndh5) have been left intact. However, these flanking sequences have not simply been ligated together: between them there is instead a novel approximately 200 bp DNA sequence which includes two regions apparently derived by the duplication of fragments of the psbA and rbcL genes.


Assuntos
Cloroplastos/metabolismo , Deleção Cromossômica , DNA/genética , Genes , Plantas/genética , Sequência de Bases , Fabaceae/genética , Dados de Sequência Molecular , Plantas Medicinais , Sequências Repetitivas de Ácido Nucleico
17.
Nat Rev Genet ; 2(5): 333-41, 2001 May.
Artigo em Inglês | MEDLINE | ID: mdl-11331899

RESUMO

Thirty years after Susumu Ohno proposed that vertebrate genomes are degenerate polyploids, the extent to which genome duplication contributed to the evolution of the vertebrate genome, if at all, is still uncertain. Sequence-level studies on model organisms whose genomes show clearer evidence of ancient polyploidy are invaluable because they indicate what the evolutionary products of genome duplication can look like. The greatest mystery is the molecular basis of diploidization, the evolutionary process by which a polyploid genome turns into a diploid one.


Assuntos
Diploide , Evolução Molecular , Genes de Plantas/genética , Genoma de Planta , Plantas/genética , Poliploidia , Saccharomyces cerevisiae/genética , Animais , Duplicação Gênica , Frequência do Gene , Variação Genética , Humanos
18.
Yeast ; 16(12): 1131-45, 2000 Sep 15.
Artigo em Inglês | MEDLINE | ID: mdl-10953085

RESUMO

In 1982, Ikemura reported a strikingly unequal usage of different synonymous codons, in five Saccharomyces cerevisiae nuclear genes having high protein levels. To study this trend in detail, we examined data from three independent studies that used oligonucleotide arrays or SAGE to estimate mRNA concentrations for nearly all genes in the genome. Correlation coefficients were calculated for the relationship of mRNA concentration to four commonly used measures of synonymous codon usage bias: the codon adaptation index (CAI), the codon bias index (CBI), the frequency of optimal codons (F(op)), and the effective number of codons (N(c)). mRNA concentration was best approximated as an exponential function of each of these four measures. Of the four, the CAI was the most strongly correlated with mRNA concentration (r(s)=0.62+/-0.01, n=2525, p<10(-17)). When we controlled for CAI, mRNA concentration and protein length were negatively correlated (partial r(s)=-0.23+/-0.01, n=4765, p<10(-17)). This may result from selection to reduce the size of abundant proteins to minimize transcriptional and translational costs. When we controlled for mRNA concentration, protein length and CAI were positively correlated (partial r(s)=0.16+/-0.01, n=4765, p<10(-17)). This may reflect more effective selection in longer genes against missense errors during translation. The correlation coefficients between the mRNA levels of individual genes, as measured by different investigators and methods, were low, in the range r(s)=0.39-0.68.


Assuntos
Proteínas Fúngicas/metabolismo , Genoma Fúngico , RNA Mensageiro/metabolismo , Saccharomyces cerevisiae/genética , Códon , Perfilação da Expressão Gênica , Análise de Sequência com Séries de Oligonucleotídeos , Fases de Leitura Aberta , Saccharomyces cerevisiae/metabolismo
19.
Yeast ; 10 Suppl A: S41-6, 1994 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-8091860

RESUMO

We report the sequence of a 12 kilobase region spanning the centromere of Saccharomyces cerevisiae chromosome II. The sequence from the left arm includes genes for histones H2A and H2B. The sequence from the right arm includes a gene that probably encodes a novel trehalase, as well as the COQ1 gene (for an enzyme involved in coenzyme Q biosynthesis), and an open reading frame with significant similarity to bacterial genes of unknown function. The trehalase gene (YBR0106) on chromosome II is located beside the centromere and transcribed towards it. This is identical to the arrangement of the neutral trehalase gene (NTH1) beside the centromere of chromosome IV. The centromere regions of chromosomes II and IV may therefore have arisen through a duplication of the centromere region of an ancestral chromosome. The YBR0106 and NTH1 proteins are 77% identical in predicted amino acid sequence, but there is no pronounced sequence similarity between the two centromeres (CEN2 and CEN4) outside of the universally conserved CDE I and CDE III elements. The genes flanking the centromere and trehalase genes differ between the two chromosomes, so the similarity between chromosomes II and IV may be less extensive than that recently reported between chromosomes III and XIV.


Assuntos
Alquil e Aril Transferases , Centrômero , Cromossomos Fúngicos , Genes Fúngicos , Saccharomyces cerevisiae/genética , Sequência de Aminoácidos , Mapeamento Cromossômico , Sequência Conservada , Dimetilaliltranstransferase/genética , Genes Fúngicos/genética , Histonas/genética , Dados de Sequência Molecular , Alinhamento de Sequência , Análise de Sequência de DNA , Homologia de Sequência de Aminoácidos , Trealase/genética
20.
J Mol Evol ; 37(4): 441-56, 1993 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-8308912

RESUMO

As a paradigm of mammalian gene evolution, the nature and extent of DNA sequence divergence between homologous protein-coding genes from mouse and rat have been investigated. The data set examined includes 363 genes totalling 411 kilobases, making this by far the largest comparison conducted between a single pair of species. Mouse and rat genes are on average 93.4% identical in nucleotide sequence and 93.9% identical in amino acid sequence. Individual genes vary substantially in the extent of nonsynonymous nucleotide substitution, as expected from protein evolution studies; here the variation is characterized. The extent of synonymous (or silent) substitution also varies considerably among genes, though the coefficient of variation is about four times smaller than for nonsynonymous substitutions. A small number of genes mapped to the X-chromosome have a slower rate of molecular evolution than average, as predicted if molecular evolution is "male-driven." Base composition at silent sites varies from 33% to 95% G+C in different genes; mouse and rat homologues differ on average by only 1.7% in silent-site G+C, but it is shown that this is not necessarily due to any selective constraint on their base composition. Synonymous substitution rates and silent site base composition appear to be related (genes at intermediate G+C have on average higher rates), but the relationship is not as strong as in our earlier analyses. Rates of synonymous and nonsynonymous substitution are correlated, apparently because of an excess of substitutions involving adjacent pairs of nucleotides. Several factors suggest that synonymous codon usage in rodent genes is not subject to selection.


Assuntos
Evolução Biológica , Variação Genética , Camundongos/genética , Ratos/genética , Sequência de Aminoácidos , Animais , Sequência de Bases , Ligação Genética , Sistemas de Informação , Mamíferos/genética , Mutação , Proteínas/classificação , Proteínas/genética , Cromossomos Sexuais
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA