Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 46
Filtrar
1.
Nucleic Acids Res ; 47(D1): D529-D541, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30476227

RESUMEN

The Biological General Repository for Interaction Datasets (BioGRID: https://thebiogrid.org) is an open access database dedicated to the curation and archival storage of protein, genetic and chemical interactions for all major model organism species and humans. As of September 2018 (build 3.4.164), BioGRID contains records for 1 598 688 biological interactions manually annotated from 55 809 publications for 71 species, as classified by an updated set of controlled vocabularies for experimental detection methods. BioGRID also houses records for >700 000 post-translational modification sites. BioGRID now captures chemical interaction data, including chemical-protein interactions for human drug targets drawn from the DrugBank database and manually curated bioactive compounds reported in the literature. A new dedicated aspect of BioGRID annotates genome-wide CRISPR/Cas9-based screens that report gene-phenotype and gene-gene relationships. An extension of the BioGRID resource called the Open Repository for CRISPR Screens (ORCS) database (https://orcs.thebiogrid.org) currently contains over 500 genome-wide screens carried out in human or mouse cell lines. All data in BioGRID is made freely available without restriction, is directly downloadable in standard formats and can be readily incorporated into existing applications via our web service platforms. BioGRID data are also freely distributed through partner model organism databases and meta-databases.


Asunto(s)
Bases de Datos Factuales , Animales , Sistemas CRISPR-Cas , Curaduría de Datos , Descubrimiento de Drogas , Genes , Humanos , Ratones , Mapeo de Interacción de Proteínas
2.
Plant Physiol ; 179(4): 1893-1907, 2019 04.
Artículo en Inglés | MEDLINE | ID: mdl-30679268

RESUMEN

Determining the complete Arabidopsis (Arabidopsis thaliana) protein-protein interaction network is essential for understanding the functional organization of the proteome. Numerous small-scale studies and a couple of large-scale ones have elucidated a fraction of the estimated 300,000 binary protein-protein interactions in Arabidopsis. In this study, we provide evidence that a docking algorithm has the ability to identify real interactions using both experimentally determined and predicted protein structures. We ranked 0.91 million interactions generated by all possible pairwise combinations of 1,346 predicted structure models from an Arabidopsis predicted "structure-ome" and found a significant enrichment of real interactions for the top-ranking predicted interactions, as shown by cosubcellular enrichment analysis and yeast two-hybrid validation. Our success rate for computationally predicted, structure-based interactions was 63% of the success rate for published interactions naively tested using the yeast two-hybrid system and 2.7 times better than for randomly picked pairs of proteins. This study provides another perspective in interactome exploration and biological network reconstruction using protein structural information. We have made these interactions freely accessible through an improved Arabidopsis Interactions Viewer and have created community tools for accessing these and ∼2.8 million other protein-protein and protein-DNA interactions for hypothesis generation by researchers worldwide. The Arabidopsis Interactions Viewer is freely available at http://bar.utoronto.ca/interactions2/.


Asunto(s)
Proteínas de Arabidopsis/química , Arabidopsis/metabolismo , Mapas de Interacción de Proteínas , Programas Informáticos , Algoritmos , Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Modelos Moleculares , Simulación del Acoplamiento Molecular , Proteoma , Técnicas del Sistema de Dos Híbridos
3.
Genome Res ; 26(5): 670-80, 2016 05.
Artículo en Inglés | MEDLINE | ID: mdl-26975778

RESUMEN

We can now routinely identify coding variants within individual human genomes. A pressing challenge is to determine which variants disrupt the function of disease-associated genes. Both experimental and computational methods exist to predict pathogenicity of human genetic variation. However, a systematic performance comparison between them has been lacking. Therefore, we developed and exploited a panel of 26 yeast-based functional complementation assays to measure the impact of 179 variants (101 disease- and 78 non-disease-associated variants) from 22 human disease genes. Using the resulting reference standard, we show that experimental functional assays in a 1-billion-year diverged model organism can identify pathogenic alleles with significantly higher precision and specificity than current computational methods.


Asunto(s)
Prueba de Complementación Genética/métodos , Enfermedades Genéticas Congénitas , Saccharomyces cerevisiae , Transcripción Genética , Enfermedades Genéticas Congénitas/genética , Enfermedades Genéticas Congénitas/metabolismo , Humanos , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo
4.
Nucleic Acids Res ; 45(D1): D369-D379, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27980099

RESUMEN

The Biological General Repository for Interaction Datasets (BioGRID: https://thebiogrid.org) is an open access database dedicated to the annotation and archival of protein, genetic and chemical interactions for all major model organism species and humans. As of September 2016 (build 3.4.140), the BioGRID contains 1 072 173 genetic and protein interactions, and 38 559 post-translational modifications, as manually annotated from 48 114 publications. This dataset represents interaction records for 66 model organisms and represents a 30% increase compared to the previous 2015 BioGRID update. BioGRID curates the biomedical literature for major model organism species, including humans, with a recent emphasis on central biological processes and specific human diseases. To facilitate network-based approaches to drug discovery, BioGRID now incorporates 27 501 chemical-protein interactions for human drug targets, as drawn from the DrugBank database. A new dynamic interaction network viewer allows the easy navigation and filtering of all genetic and protein interaction data, as well as for bioactive compounds and their established targets. BioGRID data are directly downloadable without restriction in a variety of standardized formats and are freely distributed through partner model organism databases and meta-databases.


Asunto(s)
Biología Computacional , Bases de Datos Genéticas , Proteínas , Animales , Biología Computacional/métodos , Curaduría de Datos , Minería de Datos , Humanos , Mapeo de Interacción de Proteínas , Mapas de Interacción de Proteínas , Procesamiento Proteico-Postraduccional , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Programas Informáticos
5.
Nucleic Acids Res ; 43(Database issue): D470-8, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25428363

RESUMEN

The Biological General Repository for Interaction Datasets (BioGRID: http://thebiogrid.org) is an open access database that houses genetic and protein interactions curated from the primary biomedical literature for all major model organism species and humans. As of September 2014, the BioGRID contains 749,912 interactions as drawn from 43,149 publications that represent 30 model organisms. This interaction count represents a 50% increase compared to our previous 2013 BioGRID update. BioGRID data are freely distributed through partner model organism databases and meta-databases and are directly downloadable in a variety of formats. In addition to general curation of the published literature for the major model species, BioGRID undertakes themed curation projects in areas of particular relevance for biomedical sciences, such as the ubiquitin-proteasome system and various human disease-associated interaction networks. BioGRID curation is coordinated through an Interaction Management System (IMS) that facilitates the compilation interaction records through structured evidence codes, phenotype ontologies, and gene annotation. The BioGRID architecture has been improved in order to support a broader range of interaction and post-translational modification types, to allow the representation of more complex multi-gene/protein interactions, to account for cellular phenotypes through structured ontologies, to expedite curation through semi-automated text-mining approaches, and to enhance curation quality control.


Asunto(s)
Bases de Datos Genéticas , Redes Reguladoras de Genes , Mapeo de Interacción de Proteínas , Ácido Araquidónico/metabolismo , Enfermedad/genética , Humanos , Internet
6.
Nucleic Acids Res ; 41(Database issue): D816-23, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23203989

RESUMEN

The Biological General Repository for Interaction Datasets (BioGRID: http//thebiogrid.org) is an open access archive of genetic and protein interactions that are curated from the primary biomedical literature for all major model organism species. As of September 2012, BioGRID houses more than 500 000 manually annotated interactions from more than 30 model organisms. BioGRID maintains complete curation coverage of the literature for the budding yeast Saccharomyces cerevisiae, the fission yeast Schizosaccharomyces pombe and the model plant Arabidopsis thaliana. A number of themed curation projects in areas of biomedical importance are also supported. BioGRID has established collaborations and/or shares data records for the annotation of interactions and phenotypes with most major model organism databases, including Saccharomyces Genome Database, PomBase, WormBase, FlyBase and The Arabidopsis Information Resource. BioGRID also actively engages with the text-mining community to benchmark and deploy automated tools to expedite curation workflows. BioGRID data are freely accessible through both a user-defined interactive interface and in batch downloads in a wide variety of formats, including PSI-MI2.5 and tab-delimited files. BioGRID records can also be interrogated and analyzed with a series of new bioinformatics tools, which include a post-translational modification viewer, a graphical viewer, a REST service and a Cytoscape plugin.


Asunto(s)
Bases de Datos Genéticas , Redes Reguladoras de Genes , Mapeo de Interacción de Proteínas , Arabidopsis/genética , Arabidopsis/metabolismo , Humanos , Internet , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Schizosaccharomyces/genética , Schizosaccharomyces/metabolismo , Interfaz Usuario-Computador
7.
Nat Rev Genet ; 9(7): 509-15, 2008 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-18475267

RESUMEN

The Gene Ontology (GO) project is a collaboration among model organism databases to describe gene products from all organisms using a consistent and computable language. GO produces sets of explicitly defined, structured vocabularies that describe biological processes, molecular functions and cellular components of gene products in both a computer- and human-readable manner. Here we describe key aspects of GO, which, when overlooked, can cause erroneous results, and address how these pitfalls can be avoided.


Asunto(s)
Biología Computacional , Bases de Datos Genéticas , Proteínas/genética , Proteínas/metabolismo , Humanos , Procesamiento de Lenguaje Natural , Programas Informáticos
8.
Nucleic Acids Res ; 39(Database issue): D698-704, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21071413

RESUMEN

The Biological General Repository for Interaction Datasets (BioGRID) is a public database that archives and disseminates genetic and protein interaction data from model organisms and humans (http://www.thebiogrid.org). BioGRID currently holds 347,966 interactions (170,162 genetic, 177,804 protein) curated from both high-throughput data sets and individual focused studies, as derived from over 23,000 publications in the primary literature. Complete coverage of the entire literature is maintained for budding yeast (Saccharomyces cerevisiae), fission yeast (Schizosaccharomyces pombe) and thale cress (Arabidopsis thaliana), and efforts to expand curation across multiple metazoan species are underway. The BioGRID houses 48,831 human protein interactions that have been curated from 10,247 publications. Current curation drives are focused on particular areas of biology to enable insights into conserved networks and pathways that are relevant to human health. The BioGRID 3.0 web interface contains new search and display features that enable rapid queries across multiple data types and sources. An automated Interaction Management System (IMS) is used to prioritize, coordinate and track curation across international sites and projects. BioGRID provides interaction data to several model organism databases, resources such as Entrez-Gene and other interaction meta-databases. The entire BioGRID 3.0 data collection may be downloaded in multiple file formats, including PSI MI XML. Source code for BioGRID 3.0 is freely available without any restrictions.


Asunto(s)
Bases de Datos Genéticas , Redes Reguladoras de Genes , Mapeo de Interacción de Proteínas , Animales , Arabidopsis/genética , Arabidopsis/metabolismo , Humanos , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Schizosaccharomyces/genética , Schizosaccharomyces/metabolismo , Interfaz Usuario-Computador
9.
Cell Rep Methods ; 3(9): 100580, 2023 09 25.
Artículo en Inglés | MEDLINE | ID: mdl-37703883

RESUMEN

Human biology is rooted in highly specialized cell types programmed by a common genome, 98% of which is outside of genes. Genetic variation in the enormous noncoding space is linked to the majority of disease risk. To address the problem of linking these variants to expression changes in primary human cells, we introduce ExPectoSC, an atlas of modular deep-learning-based models for predicting cell-type-specific gene expression directly from sequence. We provide models for 105 primary human cell types covering 7 organ systems, demonstrate their accuracy, and then apply them to prioritize relevant cell types for complex human diseases. The resulting atlas of sequence-based gene expression and variant effects is publicly available in a user-friendly interface and readily extensible to any primary cell types. We demonstrate the accuracy of our approach through systematic evaluations and apply the models to prioritize ClinVar clinical variants of uncertain significance, verifying our top predictions experimentally.


Asunto(s)
Ascomicetos , Humanos , Expresión Génica/genética
10.
Sci Adv ; 9(21): eadg5702, 2023 05 26.
Artículo en Inglés | MEDLINE | ID: mdl-37235661

RESUMEN

Genome-wide phenotypic screens in the budding yeast Saccharomyces cerevisiae, enabled by its knockout collection, have produced the largest, richest, and most systematic phenotypic description of any organism. However, integrative analyses of this rich data source have been virtually impossible because of the lack of a central data repository and consistent metadata annotations. Here, we describe the aggregation, harmonization, and analysis of ~14,500 yeast knockout screens, which we call Yeast Phenome. Using this unique dataset, we characterized two unknown genes (YHR045W and YGL117W) and showed that tryptophan starvation is a by-product of many chemical treatments. Furthermore, we uncovered an exponential relationship between phenotypic similarity and intergenic distance, which suggests that gene positions in both yeast and human genomes are optimized for function.


Asunto(s)
Saccharomyces cerevisiae , Humanos , Saccharomyces cerevisiae/genética
11.
Nucleic Acids Res ; 38(Database issue): D433-6, 2010 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-19906697

RESUMEN

The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is a scientific database for the molecular biology and genetics of the yeast Saccharomyces cerevisiae, which is commonly known as baker's or budding yeast. The information in SGD includes functional annotations, mapping and sequence information, protein domains and structure, expression data, mutant phenotypes, physical and genetic interactions and the primary literature from which these data are derived. Here we describe how published phenotypes and genetic interaction data are annotated and displayed in SGD.


Asunto(s)
Biología Computacional/métodos , Bases de Datos de Ácidos Nucleicos , Genoma Fúngico , Mutación , Saccharomyces cerevisiae/genética , Biología Computacional/tendencias , ADN de Hongos , Bases de Datos Genéticas , Bases de Datos de Proteínas , Genes Fúngicos , Almacenamiento y Recuperación de la Información/métodos , Internet , Fenotipo , Estructura Terciaria de Proteína , Programas Informáticos
12.
Database (Oxford) ; 20222022 10 05.
Artículo en Inglés | MEDLINE | ID: mdl-36197453

RESUMEN

The coronavirus disease 2019 (COVID-19) pandemic has compelled biomedical researchers to communicate data in real time to establish more effective medical treatments and public health policies. Nontraditional sources such as preprint publications, i.e. articles not yet validated by peer review, have become crucial hubs for the dissemination of scientific results. Natural language processing (NLP) systems have been recently developed to extract and organize COVID-19 data in reasoning systems. Given this scenario, the BioCreative COVID-19 text mining tool interactive demonstration track was created to assess the landscape of the available tools and to gauge user interest, thereby providing a two-way communication channel between NLP system developers and potential end users. The goal was to inform system designers about the performance and usability of their products and to suggest new additional features. Considering the exploratory nature of this track, the call for participation solicited teams to apply for the track, based on their system's ability to perform COVID-19-related tasks and interest in receiving user feedback. We also recruited volunteer users to test systems. Seven teams registered systems for the track, and >30 individuals volunteered as test users; these volunteer users covered a broad range of specialties, including bench scientists, bioinformaticians and biocurators. The users, who had the option to participate anonymously, were provided with written and video documentation to familiarize themselves with the NLP tools and completed a survey to record their evaluation. Additional feedback was also provided by NLP system developers. The track was well received as shown by the overall positive feedback from the participating teams and the users. Database URL: https://biocreative.bioinformatics.udel.edu/tasks/biocreative-vii/track-4/.


Asunto(s)
COVID-19 , COVID-19/epidemiología , Minería de Datos/métodos , Bases de Datos Factuales , Documentación , Humanos , Procesamiento de Lenguaje Natural
13.
Protein Sci ; 30(1): 187-200, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33070389

RESUMEN

The BioGRID (Biological General Repository for Interaction Datasets, thebiogrid.org) is an open-access database resource that houses manually curated protein and genetic interactions from multiple species including yeast, worm, fly, mouse, and human. The ~1.93 million curated interactions in BioGRID can be used to build complex networks to facilitate biomedical discoveries, particularly as related to human health and disease. All BioGRID content is curated from primary experimental evidence in the biomedical literature, and includes both focused low-throughput studies and large high-throughput datasets. BioGRID also captures protein post-translational modifications and protein or gene interactions with bioactive small molecules including many known drugs. A built-in network visualization tool combines all annotations and allows users to generate network graphs of protein, genetic and chemical interactions. In addition to general curation across species, BioGRID undertakes themed curation projects in specific aspects of cellular regulation, for example the ubiquitin-proteasome system, as well as specific disease areas, such as for the SARS-CoV-2 virus that causes COVID-19 severe acute respiratory syndrome. A recent extension of BioGRID, named the Open Repository of CRISPR Screens (ORCS, orcs.thebiogrid.org), captures single mutant phenotypes and genetic interactions from published high throughput genome-wide CRISPR/Cas9-based genetic screens. BioGRID-ORCS contains datasets for over 1,042 CRISPR screens carried out to date in human, mouse and fly cell lines. The biomedical research community can freely access all BioGRID data through the web interface, standardized file downloads, or via model organism databases and partner meta-databases.


Asunto(s)
COVID-19/genética , Bases de Datos Factuales , Mapeo de Interacción de Proteínas , Proteínas/genética , Animales , COVID-19/virología , Humanos , Ratones , SARS-CoV-2/genética , SARS-CoV-2/patogenicidad , Interfaz Usuario-Computador
14.
Nucleic Acids Res ; 36(Database issue): D637-40, 2008 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-18000002

RESUMEN

The Biological General Repository for Interaction Datasets (BioGRID) database (http://www.thebiogrid.org) was developed to house and distribute collections of protein and genetic interactions from major model organism species. BioGRID currently contains over 198 000 interactions from six different species, as derived from both high-throughput studies and conventional focused studies. Through comprehensive curation efforts, BioGRID now includes a virtually complete set of interactions reported to date in the primary literature for both the budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe. A number of new features have been added to the BioGRID including an improved user interface to display interactions based on different attributes, a mirror site and a dedicated interaction management system to coordinate curation across different locations. The BioGRID provides interaction data with monthly updates to Saccharomyces Genome Database, Flybase and Entrez Gene. Source code for the BioGRID and the linked Osprey network visualization system is now freely available without restriction.


Asunto(s)
Bases de Datos Genéticas , Redes Reguladoras de Genes , Mapeo de Interacción de Proteínas , Animales , Caenorhabditis elegans/genética , Caenorhabditis elegans/metabolismo , Sistemas de Administración de Bases de Datos , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Humanos , Internet , Ratones , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Schizosaccharomyces/genética , Proteínas de Schizosaccharomyces pombe/metabolismo , Interfaz Usuario-Computador
15.
Nucleic Acids Res ; 36(Database issue): D577-81, 2008 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-17982175

RESUMEN

The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) collects and organizes biological information about the chromosomal features and gene products of the budding yeast Saccharomyces cerevisiae. Although published data from traditional experimental methods are the primary sources of evidence supporting Gene Ontology (GO) annotations for a gene product, high-throughput experiments and computational predictions can also provide valuable insights in the absence of an extensive body of literature. Therefore, GO annotations available at SGD now include high-throughput data as well as computational predictions provided by the GO Annotation Project (GOA UniProt; http://www.ebi.ac.uk/GOA/). Because the annotation method used to assign GO annotations varies by data source, GO resources at SGD have been modified to distinguish data sources and annotation methods. In addition to providing information for genes that have not been experimentally characterized, GO annotations from independent sources can be compared to those made by SGD to help keep the literature-based GO annotations current.


Asunto(s)
Bases de Datos Genéticas , Genes Fúngicos , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Biología Computacional , Genoma Fúngico , Genómica , Internet , Proteínas de Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/fisiología , Interfaz Usuario-Computador , Vocabulario Controlado
16.
Neuron ; 107(5): 821-835.e12, 2020 09 09.
Artículo en Inglés | MEDLINE | ID: mdl-32603655

RESUMEN

A major obstacle to treating Alzheimer's disease (AD) is our lack of understanding of the molecular mechanisms underlying selective neuronal vulnerability, a key characteristic of the disease. Here, we present a framework integrating high-quality neuron-type-specific molecular profiles across the lifetime of the healthy mouse, which we generated using bacTRAP, with postmortem human functional genomics and quantitative genetics data. We demonstrate human-mouse conservation of cellular taxonomy at the molecular level for neurons vulnerable and resistant in AD, identify specific genes and pathways associated with AD neuropathology, and pinpoint a specific functional gene module underlying selective vulnerability, enriched in processes associated with axonal remodeling, and affected by amyloid accumulation and aging. We have made all cell-type-specific profiles and functional networks available at http://alz.princeton.edu. Overall, our study provides a molecular framework for understanding the complex interplay between Aß, aging, and neurodegeneration within the most vulnerable neurons in AD.


Asunto(s)
Enfermedad de Alzheimer/patología , Perfilación de la Expresión Génica/métodos , Aprendizaje Automático , Neuronas/patología , Transcriptoma , Envejecimiento/genética , Envejecimiento/patología , Enfermedad de Alzheimer/genética , Animales , Redes Reguladoras de Genes/fisiología , Humanos , Ratones
17.
Nucleic Acids Res ; 35(Database issue): D468-71, 2007 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-17142221

RESUMEN

The recent explosion in protein data generated from both directed small-scale studies and large-scale proteomics efforts has greatly expanded the quantity of available protein information and has prompted the Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) to enhance the depth and accessibility of protein annotations. In particular, we have expanded ongoing efforts to improve the integration of experimental information and sequence-based predictions and have redesigned the protein information web pages. A key feature of this redesign is the development of a GBrowse-derived interactive Proteome Browser customized to improve the visualization of sequence-based protein information. This Proteome Browser has enabled SGD to unify the display of hidden Markov model (HMM) domains, protein family HMMs, motifs, transmembrane regions, signal peptides, hydropathy plots and profile hits using several popular prediction algorithms. In addition, a physico-chemical properties page has been introduced to provide easy access to basic protein information. Improvements to the layout of the Protein Information page and integration of the Proteome Browser will facilitate the ongoing expansion of sequence-specific experimental information captured in SGD, including post-translational modifications and other user-defined annotations. Finally, SGD continues to improve upon the availability of genetic and physical interaction data in an ongoing collaboration with BioGRID by providing direct access to more than 82,000 manually-curated interactions.


Asunto(s)
Bases de Datos de Proteínas , Proteómica , Proteínas de Saccharomyces cerevisiae/química , Gráficos por Computador , Genoma Fúngico , Internet , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética , Análisis de Secuencia de Proteína , Interfaz Usuario-Computador
19.
Cell Syst ; 8(2): 152-162.e6, 2019 02 27.
Artículo en Inglés | MEDLINE | ID: mdl-30685436

RESUMEN

A key challenge for the diagnosis and treatment of complex human diseases is identifying their molecular basis. Here, we developed a unified computational framework, URSAHD (Unveiling RNA Sample Annotation for Human Diseases), that leverages machine learning and the hierarchy of anatomical relationships present among diseases to integrate thousands of clinical gene expression profiles and identify molecular characteristics specific to each of the hundreds of complex diseases. URSAHD can distinguish between closely related diseases more accurately than literature-validated genes or traditional differential-expression-based computational approaches and is applicable to any disease, including rare and understudied ones. We demonstrate the utility of URSAHD in classifying related nervous system cancers and experimentally verifying novel neuroblastoma-associated genes identified by URSAHD. We highlight the applications for potential targeted drug-repurposing and for quantitatively assessing the molecular response to clinical therapies. URSAHD is freely available for public use, including the use of underlying models, at ursahd.princeton.edu.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Genómica/métodos , Aprendizaje Automático/normas , Transcriptoma/genética , Humanos
20.
Nucleic Acids Res ; 34(Database issue): D442-5, 2006 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-16381907

RESUMEN

Sequencing and annotation of the entire Saccharomyces cerevisiae genome has made it possible to gain a genome-wide perspective on yeast genes and gene products. To make this information available on an ongoing basis, the Saccharomyces Genome Database (SGD) (http://www.yeastgenome.org/) has created the Genome Snapshot (http://db.yeastgenome.org/cgi-bin/genomeSnapShot.pl). The Genome Snapshot summarizes the current state of knowledge about the genes and chromosomal features of S.cerevisiae. The information is organized into two categories: (i) number of each type of chromosomal feature annotated in the genome and (ii) number and distribution of genes annotated to Gene Ontology terms. Detailed lists are accessible through SGD's Advanced Search tool (http://db.yeastgenome.org/cgi-bin/search/featureSearch), and all the data presented on this page are available from the SGD ftp site (ftp://ftp.yeastgenome.org/yeast/).


Asunto(s)
Bases de Datos Genéticas , Genoma Fúngico , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Cromosomas Fúngicos , Gráficos por Computador , Genómica , Internet , Proteínas de Saccharomyces cerevisiae/clasificación , Proteínas de Saccharomyces cerevisiae/fisiología , Interfaz Usuario-Computador
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA