Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 21
Filtrar
1.
Cell ; 158(3): 673-88, 2014 Jul 31.
Artículo en Inglés | MEDLINE | ID: mdl-25083876

RESUMEN

Trimethylation of histone H3 at lysine 4 (H3K4me3) is a chromatin modification known to mark the transcription start sites of active genes. Here, we show that H3K4me3 domains that spread more broadly over genes in a given cell type preferentially mark genes that are essential for the identity and function of that cell type. Using the broadest H3K4me3 domains as a discovery tool in neural progenitor cells, we identify novel regulators of these cells. Machine learning models reveal that the broadest H3K4me3 domains represent a distinct entity, characterized by increased marks of elongation. The broadest H3K4me3 domains also have more paused polymerase at their promoters, suggesting a unique transcriptional output. Indeed, genes marked by the broadest H3K4me3 domains exhibit enhanced transcriptional consistency and [corrected] increased transcriptional levels, and perturbation of H3K4me3 breadth leads to changes in transcriptional consistency. Thus, H3K4me3 breadth contains information that could ensure transcriptional precision at key cell identity/function genes.


Asunto(s)
Células/metabolismo , Código de Histonas , Histonas/metabolismo , Transcripción Genética , Animales , Inteligencia Artificial , Genómica , Humanos , Lisina/metabolismo , Metilación , Ratones Endogámicos C57BL , Células-Madre Neurales/metabolismo , ARN Polimerasa II/metabolismo
3.
Nucleic Acids Res ; 48(D1): D743-D748, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31612944

RESUMEN

The Saccharomyces Genome Database (SGD; www.yeastgenome.org) maintains the official annotation of all genes in the Saccharomyces cerevisiae reference genome and aims to elucidate the function of these genes and their products by integrating manually curated experimental data. Technological advances have allowed researchers to profile RNA expression and identify transcripts at high resolution. These data can be configured in web-based genome browser applications for display to the general public. Accordingly, SGD has incorporated published transcript isoform data in our instance of JBrowse, a genome visualization platform. This resource will help clarify S. cerevisiae biological processes by furthering studies of transcriptional regulation, untranslated regions, genome engineering, and expression quantification in S. cerevisiae.


Asunto(s)
Genoma Fúngico , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Transcriptoma , Biología Computacional/métodos , Bases de Datos Genéticas , Genómica , Anotación de Secuencia Molecular , Sistemas de Lectura Abierta , Isoformas de Proteínas , RNA-Seq , Valores de Referencia , Interfaz Usuario-Computador , Navegador Web
4.
Nucleic Acids Res ; 46(D1): D736-D742, 2018 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-29140510

RESUMEN

The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is an expertly curated database of literature-derived functional information for the model organism budding yeast, Saccharomyces cerevisiae. SGD constantly strives to synergize new types of experimental data and bioinformatics predictions with existing data, and to organize them into a comprehensive and up-to-date information resource. The primary mission of SGD is to facilitate research into the biology of yeast and to provide this wealth of information to advance, in many ways, research on other organisms, even those as evolutionarily distant as humans. To build such a bridge between biological kingdoms, SGD is curating data regarding yeast-human complementation, in which a human gene can successfully replace the function of a yeast gene, and/or vice versa. These data are manually curated from published literature, made available for download, and incorporated into a variety of analysis tools provided by SGD.


Asunto(s)
Bases de Datos Genéticas , Genoma Fúngico , Saccharomyces cerevisiae/genética , Predicción , Ontología de Genes , Genes Fúngicos , Genoma Humano , Humanos , Mutación , Especificidad de la Especie
5.
Dev Biol ; 426(2): 155-164, 2017 06 15.
Artículo en Inglés | MEDLINE | ID: mdl-27157655

RESUMEN

The Xenopus community has embraced recent advances in sequencing technology, resulting in the accumulation of numerous RNA-Seq and ChIP-Seq datasets. However, easily accessing and comparing datasets generated by multiple laboratories is challenging. Thus, we have created a central space to view, search and analyze data, providing essential information on gene expression changes and regulatory elements present in the genome. XenMine (www.xenmine.org) is a user-friendly website containing published genomic datasets from both Xenopus tropicalis and Xenopus laevis. We have established an analysis pipeline where all published datasets are uniformly processed with the latest genome releases. Information from these datasets can be extracted and compared using an array of pre-built or custom templates. With these search tools, users can easily extract sequences for all putative regulatory domains surrounding a gene of interest, identify the expression values of a gene of interest over developmental time, and analyze lists of genes for gene ontology terms and publications. Additionally, XenMine hosts an in-house genome browser that allows users to visualize all available ChIP-Seq data, extract specifically marked sequences, and aid in identifying important regulatory elements within the genome. Altogether, XenMine is an excellent tool for visualizing, accessing and querying analyzed datasets rapidly and efficiently.


Asunto(s)
Minería de Datos , Bases de Datos Genéticas , Genoma , Genómica/métodos , Xenopus/genética , Animales , Secuencia de Bases , Conjuntos de Datos como Asunto , Expresión Génica , Ontología de Genes , Internet , ARN/biosíntesis , ARN/genética , Secuencias Reguladoras de Ácidos Nucleicos , Programas Informáticos
6.
Nucleic Acids Res ; 44(D1): D698-702, 2016 Jan 04.
Artículo en Inglés | MEDLINE | ID: mdl-26578556

RESUMEN

The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is the authoritative community resource for the Saccharomyces cerevisiae reference genome sequence and its annotation. In recent years, we have moved toward increased representation of sequence variation and allelic differences within S. cerevisiae. The publication of numerous additional genomes has motivated the creation of new tools for their annotation and analysis. Here we present the Variant Viewer: a dynamic open-source web application for the visualization of genomic and proteomic differences. Multiple sequence alignments have been constructed across high quality genome sequences from 11 different S. cerevisiae strains and stored in the SGD. The alignments and summaries are encoded in JSON and used to create a two-tiered dynamic view of the budding yeast pan-genome, available at http://www.yeastgenome.org/variant-viewer.


Asunto(s)
Bases de Datos Genéticas , Variación Genética , Genoma Fúngico , Saccharomyces cerevisiae/genética , Anotación de Secuencia Molecular , Alineación de Secuencia , Análisis de Secuencia de ADN , Análisis de Secuencia de Proteína , Interfaz Usuario-Computador
7.
Nucleic Acids Res ; 42(Database issue): D717-25, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24265222

RESUMEN

The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is the community resource for genomic, gene and protein information about the budding yeast Saccharomyces cerevisiae, containing a variety of functional information about each yeast gene and gene product. We have recently added regulatory information to SGD and present it on a new tabbed section of the Locus Summary entitled 'Regulation'. We are compiling transcriptional regulator-target gene relationships, which are curated from the literature at SGD or imported, with permission, from the YEASTRACT database. For nearly every S. cerevisiae gene, the Regulation page displays a table of annotations showing the regulators of that gene, and a graphical visualization of its regulatory network. For genes whose products act as transcription factors, the Regulation page also shows a table of their target genes, accompanied by a Gene Ontology enrichment analysis of the biological processes in which those genes participate. We additionally synthesize information from the literature for each transcription factor in a free-text Regulation Summary, and provide other information relevant to its regulatory function, such as DNA binding site motifs and protein domains. All of the regulation data are available for querying, analysis and download via YeastMine, the InterMine-based data warehouse system in use at SGD.


Asunto(s)
Bases de Datos Genéticas , Regulación Fúngica de la Expresión Génica , Genoma Fúngico , Saccharomyces cerevisiae/genética , Sitios de Unión , Redes Reguladoras de Genes , Internet , Estructura Terciaria de Proteína , Proteínas de Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Factores de Transcripción/química , Factores de Transcripción/metabolismo , Transcripción Genética
8.
Genesis ; 53(8): 547-60, 2015 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-26097192

RESUMEN

InterMine is a data integration warehouse and analysis software system developed for large and complex biological data sets. Designed for integrative analysis, it can be accessed through a user-friendly web interface. For bioinformaticians, extensive web services as well as programming interfaces for most common scripting languages support access to all features. The web interface includes a useful identifier look-up system, and both simple and sophisticated search options. Interactive results tables enable exploration, and data can be filtered, summarized, and browsed. A set of graphical analysis tools provide a rich environment for data exploration including statistical enrichment of sets of genes or other entities. InterMine databases have been developed for the major model organisms, budding yeast, nematode worm, fruit fly, zebrafish, mouse, and rat together with a newly developed human database. Here, we describe how this has facilitated interoperation and development of cross-organism analysis tools and reports. InterMine as a data exploration and analysis tool is also described. All the InterMine-based systems described in this article are resources freely available to the scientific community.


Asunto(s)
Bases de Datos Factuales , Programas Informáticos , Animales , Biología Computacional/métodos , Bases de Datos Genéticas , Genómica , Humanos , Internet , Integración de Sistemas , Interfaz Usuario-Computador
9.
Nucleic Acids Res ; 40(Database issue): D700-5, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22110037

RESUMEN

The Saccharomyces Genome Database (SGD, http://www.yeastgenome.org) is the community resource for the budding yeast Saccharomyces cerevisiae. The SGD project provides the highest-quality manually curated information from peer-reviewed literature. The experimental results reported in the literature are extracted and integrated within a well-developed database. These data are combined with quality high-throughput results and provided through Locus Summary pages, a powerful query engine and rich genome browser. The acquisition, integration and retrieval of these data allow SGD to facilitate experimental design and analysis by providing an encyclopedia of the yeast genome, its chromosomal features, their functions and interactions. Public access to these data is provided to researchers and educators via web pages designed for optimal ease of use.


Asunto(s)
Bases de Datos Genéticas , Genoma Fúngico , Saccharomyces cerevisiae/genética , Genes Fúngicos , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Anotación de Secuencia Molecular , Fenotipo , Programas Informáticos , Terminología como Asunto
10.
Genome Res ; 20(1): 142-54, 2010 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-19846609

RESUMEN

ProPhylER (Protein Phylogeny and Evolutionary Rates) is a next-generation curated proteome resource that uses comparative sequence analysis to predict constraint and mutation impact for eukaryotic proteins. Its purpose is to inform any research program for which protein function and structure are relevant, by the predictive power of evolutionary constraint analyses. ProPhylER currently has nearly 9000 clusters of related proteins, including more than 200,000 sequences. It serves data via two interfaces. The "ProPhylER Interface" displays predictive analyses in sequence space; the "CrystalPainter" maps evolutionary constraints onto solved protein structures. Here we summarize ProPhylER's data content and analysis pipeline, demonstrate the use of ProPhylER's interfaces, and evaluate ProPhylER's unique regional analysis of evolutionary constraint. The high accuracy of ProPhylER's regional analysis complements the high resolution of its single-site analysis to effectively guide and inform structure-function investigations and predict the impact of polymorphisms.


Asunto(s)
Bases de Datos de Proteínas , Eucariontes , Evolución Molecular , Internet , Filogenia , Proteínas , Eucariontes/genética , Eucariontes/metabolismo , Polimorfismo de Nucleótido Simple , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Relación Estructura-Actividad , Interfaz Usuario-Computador
11.
Genome Res ; 20(3): 301-10, 2010 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-20067941

RESUMEN

Here, we demonstrate how comparative sequence analysis facilitates genome-wide base-pair-level interpretation of individual genetic variation and address two questions of importance for human personal genomics: first, whether an individual's functional variation comes mostly from noncoding or coding polymorphisms; and, second, whether population-specific or globally-present polymorphisms contribute more to functional variation in any given individual. Neither has been definitively answered by analyses of existing variation data because of a focus on coding polymorphisms, ascertainment biases in favor of common variation, and a lack of base-pair-level resolution for identifying functional variants. We resequenced 575 amplicons within 432 individuals at genomic sites enriched for evolutionary constraint and also analyzed variation within three published human genomes. We find that single-site measures of evolutionary constraint derived from mammalian multiple sequence alignments are strongly predictive of reductions in modern-day genetic diversity across a range of annotation categories and across the allele frequency spectrum from rare (<1%) to high frequency (>10% minor allele frequency). Furthermore, we show that putatively functional variation in an individual genome is dominated by polymorphisms that do not change protein sequence and that originate from our shared ancestral population and commonly segregate in human populations. These observations show that common, noncoding alleles contribute substantially to human phenotypes and that constraint-based analyses will be of value to identify phenotypically relevant variants in individual genomes.


Asunto(s)
Alelos , Frecuencia de los Genes , Variación Genética , Genoma Humano , Alineación de Secuencia , Secuencia de Aminoácidos , Animales , Secuencia de Bases , Evolución Biológica , Pruebas Genéticas , Genoma , Genómica , Humanos , Mamíferos/genética , Fenotipo , Polimorfismo Genético , Secuencias Reguladoras de Ácidos Nucleicos
12.
Genetics ; 224(1)2023 05 04.
Artículo en Inglés | MEDLINE | ID: mdl-36607068

RESUMEN

As one of the first model organism knowledgebases, Saccharomyces Genome Database (SGD) has been supporting the scientific research community since 1993. As technologies and research evolve, so does SGD: from updates in software architecture, to curation of novel data types, to incorporation of data from, and collaboration with, other knowledgebases. We are continuing to make steps toward providing the community with an S. cerevisiae pan-genome. Here, we describe software upgrades, a new nomenclature system for genes not found in the reference strain, and additions to gene pages. With these improvements, we aim to remain a leading resource for students, researchers, and the broader scientific community.


Asunto(s)
Saccharomyces , Humanos , Saccharomyces/genética , Saccharomyces cerevisiae/genética , Genoma Fúngico , Bases de Datos Genéticas , Programas Informáticos
13.
Genetics ; 220(4)2022 04 04.
Artículo en Inglés | MEDLINE | ID: mdl-34897464

RESUMEN

Saccharomyces cerevisiae is used to provide fundamental understanding of eukaryotic genetics, gene product function, and cellular biological processes. Saccharomyces Genome Database (SGD) has been supporting the yeast research community since 1993, serving as its de facto hub. Over the years, SGD has maintained the genetic nomenclature, chromosome maps, and functional annotation, and developed various tools and methods for analysis and curation of a variety of emerging data types. More recently, SGD and six other model organism focused knowledgebases have come together to create the Alliance of Genome Resources to develop sustainable genome information resources that promote and support the use of various model organisms to understand the genetic and genomic bases of human biology and disease. Here we describe recent activities at SGD, including the latest reference genome annotation update, the development of a curation system for mutant alleles, and new pages addressing homology across model organisms as well as the use of yeast to study human disease.


Asunto(s)
Saccharomyces , Alelos , Bases de Datos Genéticas , Genoma Fúngico , Humanos , Saccharomyces/genética , Saccharomyces cerevisiae/genética
14.
Database (Oxford) ; 20202020 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-32128557

RESUMEN

The identification and accurate quantitation of protein abundance has been a major objective of proteomics research. Abundance studies have the potential to provide users with data that can be used to gain a deeper understanding of protein function and regulation and can also help identify cellular pathways and modules that operate under various environmental stress conditions. One of the central missions of the Saccharomyces Genome Database (SGD; https://www.yeastgenome.org) is to work with researchers to identify and incorporate datasets of interest to the wider scientific community, thereby enabling hypothesis-driven research. A large number of studies have detailed efforts to generate proteome-wide abundance data, but deeper analyses of these data have been hampered by the inability to compare results between studies. Recently, a unified protein abundance dataset was generated through the evaluation of more than 20 abundance datasets, which were normalized and converted to common measurement units, in this case molecules per cell. We have incorporated these normalized protein abundance data and associated metadata into the SGD database, as well as the SGD YeastMine data warehouse, resulting in the addition of 56 487 values for untreated cells grown in either rich or defined media and 28 335 values for cells treated with environmental stressors. Abundance data for protein-coding genes are displayed in a sortable, filterable table on Protein pages, available through Locus Summary pages. A median abundance value was incorporated, and a median absolute deviation was calculated for each protein-coding gene and incorporated into SGD. These values are displayed in the Protein section of the Locus Summary page. The inclusion of these data has enhanced the quality and quantity of protein experimental information presented at SGD and provides opportunities for researchers to access and utilize the data to further their research.


Asunto(s)
Genoma Fúngico/genética , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Bases de Datos Genéticas , Genómica/métodos , Internet , Proteoma/genética , Proteoma/metabolismo , Proteómica/métodos , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Interfaz Usuario-Computador
16.
Database (Oxford) ; 2017(1)2017 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-28365727

RESUMEN

Due to recent advancements in the production of experimental proteomic data, the Saccharomyces genome database (SGD; www.yeastgenome.org ) has been expanding our protein curation activities to make new data types available to our users. Because of broad interest in post-translational modifications (PTM) and their importance to protein function and regulation, we have recently started incorporating expertly curated PTM information on individual protein pages. Here we also present the inclusion of new abundance and protein half-life data obtained from high-throughput proteome studies. These new data types have been included with the aim to facilitate cellular biology research. Database URL: : www.yeastgenome.org.


Asunto(s)
Bases de Datos de Proteínas , Genoma Fúngico , Anotación de Secuencia Molecular , Proteoma , Proteínas de Saccharomyces cerevisiae , Saccharomyces cerevisiae , Proteoma/genética , Proteoma/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo
17.
Artículo en Inglés | MEDLINE | ID: mdl-27252399

RESUMEN

The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) is the authoritative community resource for the Saccharomyces cerevisiae reference genome sequence and its annotation. To provide a wider scope of genetic and phenotypic variation in yeast, the genome sequences and their corresponding annotations from 11 alternative S. cerevisiae reference strains have been integrated into SGD. Genomic and protein sequence information for genes from these strains are now available on the Sequence and Protein tab of the corresponding Locus Summary pages. We illustrate how these genome sequences can be utilized to aid our understanding of strain-specific functional and phenotypic differences.Database URL: www.yeastgenome.org.


Asunto(s)
Bases de Datos Genéticas , Genoma Fúngico/genética , Genómica/métodos , Saccharomyces/genética , Anotación de Secuencia Molecular , Reproducibilidad de los Resultados , Saccharomyces cerevisiae/genética , Interfaz Usuario-Computador
18.
G3 (Bethesda) ; 4(3): 389-98, 2014 Mar 20.
Artículo en Inglés | MEDLINE | ID: mdl-24374639

RESUMEN

The genome of the budding yeast Saccharomyces cerevisiae was the first completely sequenced from a eukaryote. It was released in 1996 as the work of a worldwide effort of hundreds of researchers. In the time since, the yeast genome has been intensively studied by geneticists, molecular biologists, and computational scientists all over the world. Maintenance and annotation of the genome sequence have long been provided by the Saccharomyces Genome Database, one of the original model organism databases. To deepen our understanding of the eukaryotic genome, the S. cerevisiae strain S288C reference genome sequence was updated recently in its first major update since 1996. The new version, called "S288C 2010," was determined from a single yeast colony using modern sequencing technologies and serves as the anchor for further innovations in yeast genomic science.


Asunto(s)
Genoma Fúngico , Saccharomyces cerevisiae/genética , Mapeo Cromosómico , Bases de Datos Factuales , Internet , Sistemas de Lectura Abierta , Análisis de Secuencia de ADN , Interfaz Usuario-Computador
19.
Database (Oxford) ; 2013: bat004, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23396302

RESUMEN

The Saccharomyces Genome Database (SGD) is a scientific database that provides researchers with high-quality curated data about the genes and gene products of Saccharomyces cerevisiae. To provide instant and easy access to this information on mobile devices, we have developed YeastGenome, a native application for the Apple iPhone and iPad. YeastGenome can be used to quickly find basic information about S. cerevisiae genes and chromosomal features regardless of internet connectivity. With or without network access, you can view basic information and Gene Ontology annotations about a gene of interest by searching gene names and gene descriptions or by browsing the database within the app to find the gene of interest. With internet access, the app provides more detailed information about the gene, including mutant phenotypes, references and protein and genetic interactions, as well as provides hyperlinks to retrieve detailed information by showing SGD pages and views of the genome browser. SGD provides online help describing basic ways to navigate the mobile version of SGD, highlights key features and answers frequently asked questions related to the app. The app is available from iTunes (http://itunes.com/apps/yeastgenome). The YeastGenome app is provided freely as a service to our community, as part of SGD's mission to provide free and open access to all its data and annotations.


Asunto(s)
Teléfono Celular , Bases de Datos Genéticas , Genoma Fúngico/genética , Saccharomyces cerevisiae/genética , Acceso a la Información , Genes Fúngicos , Internet
20.
Sci Rep ; 3: 1802, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23652793

RESUMEN

Model organisms are widely used for understanding basic biology, and have significantly contributed to the study of human disease. In recent years, genomic analysis has provided extensive evidence of widespread conservation of gene sequence and function amongst eukaryotes, allowing insights from model organisms to help decipher gene function in a wider range of species. The InterMOD consortium is developing an infrastructure based around the InterMine data warehouse system to integrate genomic and functional data from a number of key model organisms, leading the way to improved cross-species research. So far including budding yeast, nematode worm, fruit fly, zebrafish, rat and mouse, the project has set up data warehouses, synchronized data models, and created analysis tools and links between data from different species. The project unites a number of major model organism databases, improving both the consistency and accessibility of comparative research, to the benefit of the wider scientific community.


Asunto(s)
Genoma , Modelos Genéticos , Animales , Bases de Datos Factuales , Bases de Datos Genéticas , Genómica/métodos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA