Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
Genome Res ; 25(5): 762-74, 2015 May.
Artículo en Inglés | MEDLINE | ID: mdl-25840857

RESUMEN

Saccharomyces cerevisiae, a well-established model for species as diverse as humans and pathogenic fungi, is more recently a model for population and quantitative genetics. S. cerevisiae is found in multiple environments-one of which is the human body-as an opportunistic pathogen. To aid in the understanding of the S. cerevisiae population and quantitative genetics, as well as its emergence as an opportunistic pathogen, we sequenced, de novo assembled, and extensively manually edited and annotated the genomes of 93 S. cerevisiae strains from multiple geographic and environmental origins, including many clinical origin strains. These 93 S. cerevisiae strains, the genomes of which are near-reference quality, together with seven previously sequenced strains, constitute a novel genetic resource, the "100-genomes" strains. Our sequencing coverage, high-quality assemblies, and annotation provide unprecedented opportunities for detailed interrogation of complex genomic loci, examples of which we demonstrate. We found most phenotypic variation to be quantitative and identified population, genotype, and phenotype associations. Importantly, we identified clinical origin associations. For example, we found that an introgressed PDR5 was present exclusively in clinical origin mosaic group strains; that the mosaic group was significantly enriched for clinical origin strains; and that clinical origin strains were much more copper resistant, suggesting that copper resistance contributes to fitness in the human host. The 100-genomes strains are a novel, multipurpose resource to advance the study of S. cerevisiae population genetics, quantitative genetics, and the emergence of an opportunistic pathogen.


Asunto(s)
Mapeo Contig/métodos , Genoma Fúngico , Genotipo , Fenotipo , Polimorfismo Genético , Saccharomyces cerevisiae/genética , Alineación de Secuencia/métodos , Filogenia , Saccharomyces cerevisiae/clasificación , Saccharomyces cerevisiae/patogenicidad , Virulencia/genética
2.
FEMS Yeast Res ; 15(8)2015 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-26463005

RESUMEN

We determined that extrachromosomal 2µ plasmid was present in 67 of the Saccharomyces cerevisiae 100-genome strains; in addition to variation in the size and copy number of 2µ, we identified three distinct classes of 2µ. We identified 2µ presence/absence and class associations with populations, clinical origin and nuclear genotypes. We also screened genome sequences of S. paradoxus, S. kudriavzevii, S. uvarum, S. eubayanus, S. mikatae, S. arboricolus and S. bayanus strains for both integrated and extrachromosomal 2µ. Similar to S. cerevisiae, we found no integrated 2µ sequences in any S. paradoxus strains. However, we identified part of 2µ integrated into the genomes of some S. uvarum, S. kudriavzevii, S. mikatae and S. bayanus strains, which were distinct from each other and from all extrachromosomal 2µ. We identified extrachromosomal 2µ in one S. paradoxus, one S. eubayanus, two S. bayanus and 13 S. uvarum strains. The extrachromosomal 2µ in S. paradoxus, S. eubayanus and S. cerevisiae were distinct from each other. In contrast, the extrachromosomal 2µ in S. bayanus and S. uvarum strains were identical with each other and with one of the three classes of S. cerevisiae 2µ, consistent with interspecific transfer.


Asunto(s)
Secuencias Repetitivas Esparcidas , Plásmidos , Saccharomyces/genética , Variación Genética , Saccharomyces/clasificación
3.
Genome Biol ; 25(1): 60, 2024 02 26.
Artículo en Inglés | MEDLINE | ID: mdl-38409096

RESUMEN

Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI's Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1-10 min. Testing FCS-GX on artificially fragmented genomes demonstrates high sensitivity and specificity for diverse contaminant species. We used FCS-GX to screen 1.6 million GenBank assemblies and identified 36.8 Gbp of contamination, comprising 0.16% of total bases, with half from 161 assemblies. We updated assemblies in NCBI RefSeq to reduce detected contamination to 0.01% of bases. FCS-GX is available at https://github.com/ncbi/fcs/ or https://doi.org/10.5281/zenodo.10651084 .


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genoma , Programas Informáticos
4.
G3 (Bethesda) ; 13(10)2023 09 30.
Artículo en Inglés | MEDLINE | ID: mdl-37497616

RESUMEN

We characterized previously identified RNA viruses (L-A, L-BC, 20S, and 23S), L-A-dependent M satellites (M1, M2, M28, and Mlus), and M satellite-dependent killer phenotypes in the Saccharomyces cerevisiae 100-genomes genetic resource population. L-BC was present in all strains, albeit in 2 distinct levels, L-BChi and L-BClo; the L-BC level is associated with the L-BC genotype. L-BChi, L-A, 20S, 23S, M1, M2, and Mlus (M28 was absent) were in fewer strains than the similarly inherited 2µ plasmid. Novel L-A-dependent phenotypes were identified. Ten M+ strains exhibited M satellite-dependent killing (K+) of at least 1 of the naturally M0 and cured M0 derivatives of the 100-genomes strains; in these M0 strains, sensitivities to K1+, K2+, and K28+ strains varied. Finally, to complement our M satellite-encoded killer toxin analysis, we assembled the chromosomal KHS1 and KHR1 killer genes and used naturally M0 and cured M0 derivatives of the 100-genomes strains to assess and characterize the chromosomal killer phenotypes.


Asunto(s)
Virus ARN , Saccharomyces cerevisiae , Saccharomyces cerevisiae/genética , ARN Viral/genética , ARN Bicatenario , Virus ARN/genética , Fenotipo
5.
bioRxiv ; 2023 06 06.
Artículo en Inglés | MEDLINE | ID: mdl-37292984

RESUMEN

Assembled genome sequences are being generated at an exponential rate. Here we present FCS-GX, part of NCBI's Foreign Contamination Screen (FCS) tool suite, optimized to identify and remove contaminant sequences in new genomes. FCS-GX screens most genomes in 0.1-10 minutes. Testing FCS-GX on artificially fragmented genomes demonstrates sensitivity >95% for diverse contaminant species and specificity >99.93%. We used FCS-GX to screen 1.6 million GenBank assemblies and identified 36.8 Gbp of contamination (0.16% of total bases), with half from 161 assemblies. We updated assemblies in NCBI RefSeq to reduce detected contamination to 0.01% of bases. FCS-GX is available at https://github.com/ncbi/fcs/.

6.
BMC Evol Biol ; 11: 80, 2011 Mar 29.
Artículo en Inglés | MEDLINE | ID: mdl-21447149

RESUMEN

BACKGROUND: Urea amidolyase breaks down urea into ammonia and carbon dioxide in a two-step process, while another enzyme, urease, does this in a one step-process. Urea amidolyase has been found only in some fungal species among eukaryotes. It contains two major domains: the amidase and urea carboxylase domains. A shorter form of urea amidolyase is known as urea carboxylase and has no amidase domain. Eukaryotic urea carboxylase has been found only in several fungal species and green algae. In order to elucidate the evolutionary origin of urea amidolyase and urea carboxylase, we studied the distribution of urea amidolyase, urea carboxylase, as well as other proteins including urease, across kingdoms. RESULTS: Among the 64 fungal species we examined, only those in two Ascomycota classes (Sordariomycetes and Saccharomycetes) had the urea amidolyase sequences. Urea carboxylase was found in many but not all of the species in the phylum Basidiomycota and in the subphylum Pezizomycotina (phylum Ascomycota). It was completely absent from the class Saccharomycetes (phylum Ascomycota; subphylum Saccharomycotina). Four Sordariomycetes species we examined had both the urea carboxylase and the urea amidolyase sequences. Phylogenetic analysis showed that these two enzymes appeared to have gone through independent evolution since their bacterial origin. The amidase domain and the urea carboxylase domain sequences from fungal urea amidolyases clustered strongly together with the amidase and urea carboxylase sequences, respectively, from a small number of beta- and gammaproteobacteria. On the other hand, fungal urea carboxylase proteins clustered together with another copy of urea carboxylases distributed broadly among bacteria. The urease proteins were found in all the fungal species examined except for those of the subphylum Saccharomycotina. CONCLUSIONS: We conclude that the urea amidolyase genes currently found only in fungi are the results of a horizontal gene transfer event from beta-, gamma-, or related species of proteobacteria. The event took place before the divergence of the subphyla Pezizomycotina and Saccharomycotina but after the divergence of the subphylum Taphrinomycotina. Urea carboxylase genes currently found in fungi and other limited organisms were also likely derived from another ancestral gene in bacteria. Our study presented another important example showing plastic and opportunistic genome evolution in bacteria and fungi and their evolutionary interplay.


Asunto(s)
Ligasas de Carbono-Nitrógeno/genética , Evolución Molecular , Hongos/enzimología , Hongos/genética , Bacterias/enzimología , Bacterias/genética , Ligasas de Carbono-Nitrógeno/química , Hongos/metabolismo , Transferencia de Gen Horizontal , Filogenia , Estructura Terciaria de Proteína , Homología de Secuencia de Aminoácido
7.
Genetics ; 211(2): 773-786, 2019 02.
Artículo en Inglés | MEDLINE | ID: mdl-30498022

RESUMEN

Mitochondrial genome variation and its effects on phenotypes have been widely analyzed in higher eukaryotes but less so in the model eukaryote Saccharomyces cerevisiae Here, we describe mitochondrial genome variation in 96 diverse S. cerevisiae strains and assess associations between mitochondrial genotype and phenotypes as well as nuclear-mitochondrial epistasis. We associate sensitivity to the ATP synthase inhibitor oligomycin with SNPs in the mitochondrially encoded ATP6 gene. We describe the use of iso-nuclear F1 pairs, the mitochondrial genome equivalent of reciprocal hemizygosity analysis, to identify and analyze mitochondrial genotype-dependent phenotypes. Using iso-nuclear F1 pairs, we analyze the oligomycin phenotype-ATP6 association and find extensive nuclear-mitochondrial epistasis. Similarly, in iso-nuclear F1 pairs, we identify many additional mitochondrial genotype-dependent respiration phenotypes, for which there was no association in the 96 strains, and again find extensive nuclear-mitochondrial epistasis that likely contributes to the lack of association in the 96 strains. Finally, in iso-nuclear F1 pairs, we identify novel mitochondrial genotype-dependent nonrespiration phenotypes: resistance to cycloheximide, ketoconazole, and copper. We discuss potential mechanisms and the implications of mitochondrial genotype and of nuclear-mitochondrial epistasis effects on respiratory and nonrespiratory quantitative traits.


Asunto(s)
Genoma Mitocondrial , Fenotipo , Polimorfismo Genético , Saccharomyces cerevisiae/genética , Antifúngicos/toxicidad , Respiración de la Célula/genética , Cobre/toxicidad , Cicloheximida/toxicidad , Farmacorresistencia Fúngica/genética , Epistasis Genética , Cetoconazol/toxicidad , ATPasas de Translocación de Protón Mitocondriales/genética , Polimorfismo de Nucleótido Simple , Saccharomyces cerevisiae/efectos de los fármacos , Proteínas de Saccharomyces cerevisiae/genética
8.
Database (Oxford) ; 20172017 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-29220466

RESUMEN

The ITS (nuclear ribosomal internal transcribed spacer) RefSeq database at the National Center for Biotechnology Information (NCBI) is dedicated to the clear association between name, specimen and sequence data. This database is focused on sequences obtained from type material stored in public collections. While the initial ITS sequence curation effort together with numerous fungal taxonomy experts attempted to cover as many orders as possible, we extended our latest focus to the family and genus ranks. We focused on Trichoderma for several reasons, mainly because the asexual and sexual synonyms were well documented, and a list of proposed names and type material were recently proposed and published. In this case study the recent taxonomic information was applied to do a complete taxonomic audit for the genus Trichoderma in the NCBI Taxonomy database. A name status report is available here: https://www.ncbi.nlm.nih.gov/Taxonomy/TaxIdentifier/tax_identifier.cgi. As a result, the ITS RefSeq Targeted Loci database at NCBI has been augmented with more sequences from type and verified material from Trichoderma species. Additionally, to aid in the cross referencing of data from single loci and genomes we have collected a list of quality records of the RPB2 gene obtained from type material in GenBank that could help validate future submissions. During the process of curation misidentified genomes were discovered, and sequence records from type material were found hidden under previous classifications. Source metadata curation, although more cumbersome, proved to be useful as confirmation of the type material designation. Database URL:http://www.ncbi.nlm.nih.gov/bioproject/PRJNA177353


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Proteínas Fúngicas/genética , Trichoderma/clasificación , Trichoderma/genética
9.
G3 (Bethesda) ; 4(11): 2259-69, 2014 Sep 17.
Artículo en Inglés | MEDLINE | ID: mdl-25236733

RESUMEN

An important issue in genome evolution is the mechanism by which tandem duplications are generated from single-copy genes. In the yeast Saccharomyces cerevisiae, most strains contain tandemly duplicated copies of CUP1, a gene that encodes a copper-binding metallothionein. By screening 101 natural isolates of S. cerevisiae, we identified five different types of CUP1-containing repeats, as well as strains that only had one copy of CUP1. A comparison of the DNA sequences of these strains indicates that the CUP1 tandem arrays were generated by unequal nonhomologous recombination events from strains that had one CUP1 gene.


Asunto(s)
Duplicación de Gen , Recombinación Homóloga , Metalotioneína/genética , Saccharomyces cerevisiae/genética , Evolución Molecular
10.
Genomics ; 89(5): 602-12, 2007 May.
Artículo en Inglés | MEDLINE | ID: mdl-17336495

RESUMEN

Computational methods of predicting protein functions rely on detecting similarities among proteins. However, sufficient sequence information is not always available for some protein families. For example, proteins of interest may be new members of a divergent protein family. The performance of protein classification methods could vary in such challenging situations. Using the G-protein-coupled receptor superfamily as an example, we investigated the performance of several protein classifiers. Alignment-free classifiers based on support vector machines using simple amino acid compositions were effective in remote-similarity detection even from short fragmented sequences. Although it is computationally expensive, a support vector machine classifier using local pairwise alignment scores showed very good balanced performance. More commonly used profile hidden Markov models were generally highly specific and well suited to classifying well-established protein family members. It is suggested that different types of protein classifiers should be applied to gain the optimal mining power.


Asunto(s)
Aminoácidos/química , Clasificación/métodos , Receptores Acoplados a Proteínas G/clasificación , Algoritmos , Aminoácidos/análisis , Animales , Drosophila melanogaster/genética , Etiquetas de Secuencia Expresada/química , Cadenas de Markov , Modelos Químicos , Receptores Acoplados a Proteínas G/química , Receptores Acoplados a Proteínas G/metabolismo
11.
Genome Biol ; 7(10): R96, 2006.
Artículo en Inglés | MEDLINE | ID: mdl-17064408

RESUMEN

To identify divergent seven-transmembrane receptor (7TMR) candidates from the Arabidopsis thaliana genome, multiple protein classification methods were combined, including both alignment-based and alignment-free classifiers. This resolved problems in optimally training individual classifiers using limited and divergent samples, and increased stringency for candidate proteins. We identified 394 proteins as 7TMR candidates and highlighted 54 with corresponding expression patterns for further investigation.


Asunto(s)
Arabidopsis/genética , Variación Genética , Genoma de Planta , Receptores de Superficie Celular/genética , Proteínas de Arabidopsis/genética , Bases de Datos de Proteínas , Perfilación de la Expresión Génica , Vectores Genéticos , Cadenas de Markov
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA