Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
Nucleic Acids Res ; 46(W1): W60-W64, 2018 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-29912392

RESUMEN

GeneMANIA (http://genemania.org) is a flexible user-friendly web site for generating hypotheses about gene function, analyzing gene lists and prioritizing genes for functional assays. Given a query gene list, GeneMANIA finds functionally similar genes using a wealth of genomics and proteomics data. In this mode, it weights each functional genomic dataset according to its predictive value for the query. Another use of GeneMANIA is gene function prediction. Given a single query gene, GeneMANIA finds genes likely to share function with it based on their interactions with it. Enriched Gene Ontology categories among this set can point to the function of the gene. Nine organisms are currently supported (Arabidopsis thaliana, Caenorhabditis elegans, Danio rerio, Drosophila melanogaster, Escherichia coli, Homo sapiens, Mus musculus, Rattus norvegicus and Saccharomyces cerevisiae). Hundreds of data sets and hundreds of millions of interactions have been collected from GEO, BioGRID, IRefIndex and I2D, as well as organism-specific functional genomics data sets. Users can customize their search by selecting specific data sets to query and by uploading their own data sets to analyze. We have recently updated the user interface to GeneMANIA to make it more intuitive and make more efficient use of visual space. GeneMANIA can now be used effectively on a variety of devices.


Asunto(s)
Bases de Datos Genéticas , Genómica , Internet , Programas Informáticos , Algoritmos , Animales , Arabidopsis/genética , Drosophila melanogaster/genética , Ontología de Genes , Redes Reguladoras de Genes/genética , Humanos , Ratones , Ratas , Saccharomyces cerevisiae/genética , Pez Cebra/genética
2.
Bioinformatics ; 34(13): i429-i437, 2018 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-29949959

RESUMEN

Motivation: Alternative splice site selection is inherently competitive and the probability of a given splice site to be used also depends on the strength of neighboring sites. Here, we present a new model named the competitive splice site model (COSSMO), which explicitly accounts for these competitive effects and predicts the percent selected index (PSI) distribution over any number of putative splice sites. We model an alternative splicing event as the choice of a 3' acceptor site conditional on a fixed upstream 5' donor site or the choice of a 5' donor site conditional on a fixed 3' acceptor site. We build four different architectures that use convolutional layers, communication layers, long short-term memory and residual networks, respectively, to learn relevant motifs from sequence alone. We also construct a new dataset from genome annotations and RNA-Seq read data that we use to train our model. Results: COSSMO is able to predict the most frequently used splice site with an accuracy of 70% on unseen test data, and achieve an R2 of 0.6 in modeling the PSI distribution. We visualize the motifs that COSSMO learns from sequence and show that COSSMO recognizes the consensus splice site sequences and many known splicing factors with high specificity. Availability and implementation: Model predictions, our training dataset, and code are available from http://cossmo.genes.toronto.edu. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Empalme Alternativo , Aprendizaje Profundo , Sitios de Empalme de ARN , Análisis de Secuencia de ARN/métodos , Biología Computacional/métodos , Humanos , Modelos Genéticos , Probabilidad , Programas Informáticos
3.
Development ; 141(1): 224-35, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24346703

RESUMEN

Comprehensive functional annotation of vertebrate genomes is fundamental to biological discovery. Reverse genetic screening has been highly useful for determination of gene function, but is untenable as a systematic approach in vertebrate model organisms given the number of surveyable genes and observable phenotypes. Unbiased prediction of gene-phenotype relationships offers a strategy to direct finite experimental resources towards likely phenotypes, thus maximizing de novo discovery of gene functions. Here we prioritized genes for phenotypic assay in zebrafish through machine learning, predicting the effect of loss of function of each of 15,106 zebrafish genes on 338 distinct embryonic anatomical processes. Focusing on cardiovascular phenotypes, the learning procedure predicted known knockdown and mutant phenotypes with high precision. In proof-of-concept studies we validated 16 high-confidence cardiac predictions using targeted morpholino knockdown and initial blinded phenotyping in embryonic zebrafish, confirming a significant enrichment for cardiac phenotypes as compared with morpholino controls. Subsequent detailed analyses of cardiac function confirmed these results, identifying novel physiological defects for 11 tested genes. Among these we identified tmem88a, a recently described attenuator of Wnt signaling, as a discrete regulator of the patterning of intercellular coupling in the zebrafish cardiac epithelium. Thus, we show that systematic prioritization in zebrafish can accelerate the pace of developmental gene function discovery.


Asunto(s)
Regulación del Desarrollo de la Expresión Génica , Corazón/embriología , Proteínas de la Membrana/metabolismo , Miocardio/citología , Proteínas de Pez Cebra/metabolismo , Pez Cebra/embriología , Pez Cebra/genética , Animales , Embrión no Mamífero/metabolismo , Técnicas de Silenciamiento del Gen , Proteínas de la Membrana/genética , Morfolinos/genética , Fenotipo , Vía de Señalización Wnt/genética , Proteínas de Pez Cebra/genética
4.
Bioinformatics ; 31(3): 306-10, 2015 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-25316676

RESUMEN

MOTIVATION: The model bacterium Escherichia coli is among the best studied prokaryotes, yet nearly half of its proteins are still of unknown biological function. This is despite a wealth of available large-scale physical and genetic interaction data. To address this, we extended the GeneMANIA function prediction web application developed for model eukaryotes to support E.coli. RESULTS: We integrated 48 distinct E.coli functional interaction datasets and used the GeneMANIA algorithm to produce thousands of novel functional predictions and prioritize genes for further functional assays. Our analysis achieved cross-validation performance comparable to that reported for eukaryotic model organisms, and revealed new functions for previously uncharacterized genes in specific bioprocesses, including components required for cell adhesion, iron-sulphur complex assembly and ribosome biogenesis. The GeneMANIA approach for network-based function prediction provides an innovative new tool for probing mechanisms underlying bacterial bioprocesses. CONTACT: gary.bader@utoronto.ca; mohan.babu@uregina.ca SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Proteínas de Escherichia coli/metabolismo , Escherichia coli/genética , Redes Reguladoras de Genes , Programas Informáticos , Fenotipo
5.
Nucleic Acids Res ; 41(Web Server issue): W115-22, 2013 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-23794635

RESUMEN

GeneMANIA (http://www.genemania.org) is a flexible user-friendly web interface for generating hypotheses about gene function, analyzing gene lists and prioritizing genes for functional assays. Given a query gene list, GeneMANIA extends the list with functionally similar genes that it identifies using available genomics and proteomics data. GeneMANIA also reports weights that indicate the predictive value of each selected data set for the query. GeneMANIA can also be used in a function prediction setting: given a query gene, GeneMANIA finds a small set of genes that are most likely to share function with that gene based on their interactions with it. Enriched Gene Ontology categories among this set can sometimes point to the function of the gene. Seven organisms are currently supported (Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster, Mus musculus, Homo sapiens, Rattus norvegicus and Saccharomyces cerevisiae), and hundreds of data sets have been collected from GEO, BioGRID, IRefIndex and I2D, as well as organism-specific functional genomics data sets. Users can customize their search by selecting specific data sets to query and by uploading their own data sets to analyze.


Asunto(s)
Genes , Programas Informáticos , Algoritmos , Animales , Redes Reguladoras de Genes , Humanos , Internet , Ratones , Ratas
6.
Nucleic Acids Res ; 39(Database issue): D301-8, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21036867

RESUMEN

The RNA-Binding Protein DataBase (RBPDB) is a collection of experimental observations of RNA-binding sites, both in vitro and in vivo, manually curated from primary literature. To build RBPDB, we performed a literature search for experimental binding data for all RNA-binding proteins (RBPs) with known RNA-binding domains in four metazoan species (human, mouse, fly and worm). In total, RPBDB contains binding data on 272 RBPs, including 71 that have motifs in position weight matrix format, and 36 sets of sequences of in vivo-bound transcripts from immunoprecipitation experiments. The database is accessible by a web interface which allows browsing by domain or by organism, searching and export of records, and bulk data downloads. Users can also use RBPDB to scan sequences for RBP-binding sites. RBPDB is freely available, without registration at http://rbpdb.ccbr.utoronto.ca/.


Asunto(s)
Bases de Datos de Proteínas , Proteínas de Unión al ARN/metabolismo , Animales , Sitios de Unión , Proteínas de Caenorhabditis elegans/química , Proteínas de Caenorhabditis elegans/metabolismo , Proteínas de Drosophila/química , Proteínas de Drosophila/metabolismo , Humanos , Ratones , Estructura Terciaria de Proteína , ARN Mensajero/química , ARN Mensajero/metabolismo , Proteínas de Unión al ARN/química , Análisis de Secuencia de ARN
7.
Nucleic Acids Res ; 38(Web Server issue): W214-20, 2010 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-20576703

RESUMEN

GeneMANIA (http://www.genemania.org) is a flexible, user-friendly web interface for generating hypotheses about gene function, analyzing gene lists and prioritizing genes for functional assays. Given a query list, GeneMANIA extends the list with functionally similar genes that it identifies using available genomics and proteomics data. GeneMANIA also reports weights that indicate the predictive value of each selected data set for the query. Six organisms are currently supported (Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster, Mus musculus, Homo sapiens and Saccharomyces cerevisiae) and hundreds of data sets have been collected from GEO, BioGRID, Pathway Commons and I2D, as well as organism-specific functional genomics data sets. Users can select arbitrary subsets of the data sets associated with an organism to perform their analyses and can upload their own data sets to analyze. The GeneMANIA algorithm performs as well or better than other gene function prediction methods on yeast and mouse benchmarks. The high accuracy of the GeneMANIA prediction algorithm, an intuitive user interface and large database make GeneMANIA a useful tool for any biologist.


Asunto(s)
Genes , Programas Informáticos , Algoritmos , Animales , Redes Reguladoras de Genes , Genes/fisiología , Genómica , Humanos , Internet , Ratones
8.
F1000Res ; 3: 153, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25254104

RESUMEN

The GeneMANIA Cytoscape app enables users to construct a composite gene-gene functional interaction network from a gene list. The resulting network includes the genes most related to the original list, and functional annotations from Gene Ontology. The edges are annotated with details about the publication or data source the interactions were derived from. The app leverages GeneMANIA's database of 1800+ networks, containing over 500 million interactions spanning 8 organisms: A. thaliana, C. elegans, D. melanogaster, D. rerio, H. sapiens, M. musculus, R. norvegicus, and S. cerevisiae. Users may also import their own organisms, networks, and expression profiles. The app is compatible with Cytoscape versions 2 and 3.

9.
Front Genet ; 5: 123, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-24904632

RESUMEN

Significant effort has been invested in network-based gene function prediction algorithms based on the guilt by association (GBA) principle. Existing approaches for assessing prediction performance typically compute evaluation metrics, either averaged across all functions being considered, or strictly from properties of the network. Since the success of GBA algorithms depends on the specific function being predicted, evaluation metrics should instead be computed for each function. We describe a novel method for computing the usefulness of a network by measuring its impact on gene function cross validation prediction performance across all gene functions. We have implemented this in software called Network Assessor, and describe its use in the GeneMANIA (GM) quality control system. Network Assessor is part of the GM command line tools.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA