Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Nucleic Acids Res ; 47(W1): W183-W190, 2019 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-31069376

RESUMEN

High-throughput experiments produce increasingly large datasets that are difficult to analyze and integrate. While most data integration approaches focus on aligning metadata, data integration can be achieved by abstracting experimental results into gene sets. Such gene sets can be made available for reuse through gene set enrichment analysis tools such as Enrichr. Enrichr currently only supports gene sets compiled from human and mouse, limiting accessibility for investigators that study other model organisms. modEnrichr is an expansion of Enrichr for four model organisms: fish, fly, worm and yeast. The gene set libraries within FishEnrichr, FlyEnrichr, WormEnrichr and YeastEnrichr are created from the Gene Ontology, mRNA expression profiles, GeneRIF, pathway databases, protein domain databases and other organism-specific resources. Additionally, libraries were created by predicting gene function from RNA-seq co-expression data processed uniformly from the gene expression omnibus for each organism. The modEnrichr suite of tools provides the ability to convert gene lists across species using an ortholog conversion tool that automatically detects the species. For complex analyses, modEnrichr provides API access that enables submitting batch queries. In summary, modEnrichr leverages existing model organism databases and other resources to facilitate comprehensive hypothesis generation through data integration.


Asunto(s)
Bases de Datos Genéticas , Expresión Génica/genética , Biblioteca de Genes , Biblioteca Genómica , Programas Informáticos , Animales , Biología Computacional , Ontología de Genes , Humanos , Metadatos
2.
Nucleic Acids Res ; 47(W1): W571-W577, 2019 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-31114885

RESUMEN

The frequency by which genes are studied correlates with the prior knowledge accumulated about them. This leads to an imbalance in research attention where some genes are highly investigated while others are ignored. Geneshot is a search engine developed to illuminate this gap and to promote attention to the under-studied genome. Through a simple web interface, Geneshot enables researchers to enter arbitrary search terms, to receive ranked lists of genes relevant to the search terms. Returned ranked gene lists contain genes that were previously published in association with the search terms, as well as genes predicted to be associated with the terms based on data integration from multiple sources. The search results are presented with interactive visualizations. To predict gene function, Geneshot utilizes gene-gene similarity matrices from processed RNA-seq data, or from gene-gene co-occurrence data obtained from multiple sources. In addition, Geneshot can be used to analyze the novelty of gene sets and augment gene sets with additional relevant genes. The Geneshot web-server and API are freely and openly available from https://amp.pharm.mssm.edu/geneshot.


Asunto(s)
Genes , Programas Informáticos , Minería de Datos , Expresión Génica , Internet , Publicaciones , RNA-Seq , Investigadores , Interfaz Usuario-Computador
3.
Nucleic Acids Res ; 47(W1): W212-W224, 2019 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-31114921

RESUMEN

Identifying the transcription factors (TFs) responsible for observed changes in gene expression is an important step in understanding gene regulatory networks. ChIP-X Enrichment Analysis 3 (ChEA3) is a transcription factor enrichment analysis tool that ranks TFs associated with user-submitted gene sets. The ChEA3 background database contains a collection of gene set libraries generated from multiple sources including TF-gene co-expression from RNA-seq studies, TF-target associations from ChIP-seq experiments, and TF-gene co-occurrence computed from crowd-submitted gene lists. Enrichment results from these distinct sources are integrated to generate a composite rank that improves the prediction of the correct upstream TF compared to ranks produced by individual libraries. We compare ChEA3 with existing TF prediction tools and show that ChEA3 performs better. By integrating the ChEA3 libraries, we illuminate general transcription factor properties such as whether the TF behaves as an activator or a repressor. The ChEA3 web-server is available from https://amp.pharm.mssm.edu/ChEA3.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Biblioteca de Genes , Factores de Transcripción/genética , Secuenciación de Inmunoprecipitación de Cromatina/métodos , Conjuntos de Datos como Asunto , Regulación de la Expresión Génica/genética , Redes Reguladoras de Genes/genética , Humanos
4.
Nucleic Acids Res ; 46(W1): W171-W179, 2018 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-29800326

RESUMEN

While gene expression data at the mRNA level can be globally and accurately measured, profiling the activity of cell signaling pathways is currently much more difficult. eXpression2Kinases (X2K) computationally predicts involvement of upstream cell signaling pathways, given a signature of differentially expressed genes. X2K first computes enrichment for transcription factors likely to regulate the expression of the differentially expressed genes. The next step of X2K connects these enriched transcription factors through known protein-protein interactions (PPIs) to construct a subnetwork. The final step performs kinase enrichment analysis on the members of the subnetwork. X2K Web is a new implementation of the original eXpression2Kinases algorithm with important enhancements. X2K Web includes many new transcription factor and kinase libraries, and PPI networks. For demonstration, thousands of gene expression signatures induced by kinase inhibitors, applied to six breast cancer cell lines, are provided for fetching directly into X2K Web. The results are displayed as interactive downloadable vector graphic network images and bar graphs. Benchmarking various settings via random permutations enabled the identification of an optimal set of parameters to be used as the default settings in X2K Web. X2K Web is freely available from http://X2K.cloud.


Asunto(s)
Expresión Génica , Proteínas Quinasas/metabolismo , Transducción de Señal , Programas Informáticos , Animales , Línea Celular Tumoral , Expresión Génica/efectos de los fármacos , Humanos , Internet , Ratones , Mapeo de Interacción de Proteínas , Inhibidores de Proteínas Quinasas/farmacología , Transducción de Señal/genética , Factores de Transcripción/metabolismo
5.
Nat Commun ; 9(1): 1366, 2018 04 10.
Artículo en Inglés | MEDLINE | ID: mdl-29636450

RESUMEN

RNA sequencing (RNA-seq) is the leading technology for genome-wide transcript quantification. However, publicly available RNA-seq data is currently provided mostly in raw form, a significant barrier for global and integrative retrospective analyses. ARCHS4 is a web resource that makes the majority of published RNA-seq data from human and mouse available at the gene and transcript levels. For developing ARCHS4, available FASTQ files from RNA-seq experiments from the Gene Expression Omnibus (GEO) were aligned using a cloud-based infrastructure. In total 187,946 samples are accessible through ARCHS4 with 103,083 mouse and 84,863 human. Additionally, the ARCHS4 web interface provides intuitive exploration of the processed data through querying tools, interactive visualization, and gene pages that provide average expression across cell lines and tissues, top co-expressed genes for each gene, and predicted biological functions and protein-protein interactions for each gene based on prior knowledge combined with co-expression.


Asunto(s)
Minería de Datos/métodos , Genoma , Programas Informáticos , Transcriptoma , Animales , Bases de Datos Genéticas , Ontología de Genes , Redes Reguladoras de Genes , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Internet , Ratones , Anotación de Secuencia Molecular , Mapeo de Interacción de Proteínas
6.
Bioinformatics ; 34(12): 2150-2152, 2018 06 15.
Artículo en Inglés | MEDLINE | ID: mdl-29420694

RESUMEN

Motivation: As part of the NIH Library of Integrated Network-based Cellular Signatures program, hundreds of thousands of transcriptomic signatures were generated with the L1000 technology, profiling the response of human cell lines to over 20 000 small molecule compounds. This effort is a promising approach toward revealing the mechanisms-of-action (MOA) for marketed drugs and other less studied potential therapeutic compounds. Results: L1000 fireworks display (L1000FWD) is a web application that provides interactive visualization of over 16 000 drug and small-molecule induced gene expression signatures. L1000FWD enables coloring of signatures by different attributes such as cell type, time point, concentration, as well as drug attributes such as MOA and clinical phase. Signature similarity search is implemented to enable the search for mimicking or opposing signatures given as input of up and down gene sets. Each point on the L1000FWD interactive map is linked to a signature landing page, which provides multifaceted knowledge from various sources about the signature and the drug. Notably such information includes most frequent diagnoses, co-prescribed drugs and age distribution of prescriptions as extracted from the Mount Sinai Health System electronic medical records. Overall, L1000FWD serves as a platform for identifying functions for novel small molecules using unsupervised clustering, as well as for exploring drug MOA. Availability and implementation: L1000FWD is freely accessible at: http://amp.pharm.mssm.edu/L1000FWD. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Farmacogenética/métodos , Programas Informáticos , Transcriptoma/efectos de los fármacos , Aprendizaje Automático no Supervisado , Línea Celular , Análisis por Conglomerados , Visualización de Datos , Regulación de la Expresión Génica , Humanos
7.
Cell Syst ; 6(1): 13-24, 2018 01 24.
Artículo en Inglés | MEDLINE | ID: mdl-29199020

RESUMEN

The Library of Integrated Network-Based Cellular Signatures (LINCS) is an NIH Common Fund program that catalogs how human cells globally respond to chemical, genetic, and disease perturbations. Resources generated by LINCS include experimental and computational methods, visualization tools, molecular and imaging data, and signatures. By assembling an integrated picture of the range of responses of human cells exposed to many perturbations, the LINCS program aims to better understand human disease and to advance the development of new therapies. Perturbations under study include drugs, genetic perturbations, tissue micro-environments, antibodies, and disease-causing mutations. Responses to perturbations are measured by transcript profiling, mass spectrometry, cell imaging, and biochemical methods, among other assays. The LINCS program focuses on cellular physiology shared among tissues and cell types relevant to an array of diseases, including cancer, heart disease, and neurodegenerative disorders. This Perspective describes LINCS technologies, datasets, tools, and approaches to data accessibility and reusability.


Asunto(s)
Catalogación/métodos , Biología de Sistemas/métodos , Biología Computacional/métodos , Bases de Datos de Compuestos Químicos/normas , Perfilación de la Expresión Génica/métodos , Biblioteca de Genes , Humanos , Almacenamiento y Recuperación de la Información/métodos , Programas Nacionales de Salud , National Institutes of Health (U.S.)/normas , Transcriptoma , Estados Unidos
8.
Exp Parasitol ; 126(4): 582-91, 2010 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-20542033

RESUMEN

The parasitic protozoan, Leishmania, survives in harsh environments within its mammalian and sand fly hosts. Secreted proteins likely play critical roles in the parasite's interactions with its environment. As a preliminary identification of the spectrum of potential excreted/secreted (ES) proteins of Leishmania infantum chagasi (Lic), a causative agent of visceral leishmaniasis, we used standard algorithms to screen the annotated L. infantum genome for genes whose predicted protein products have an N-terminal signal peptide and lack transmembrane domains and membrane anchors. A suite of 181 candidate ES proteins were identified. These included several that were documented in the literature to be released by other Leishmania spp. Six candidate ES proteins were selected for further validation of their expression and release by different parasite stages. We found both amastigote-specific and promastigote-specific released proteins. The ES proteins of Lic are candidates for future studies of parasite virulence determinants and host protective immunity.


Asunto(s)
Genoma de Protozoos , Leishmania infantum/metabolismo , Leishmaniasis Visceral/parasitología , Proteínas Protozoarias/metabolismo , Algoritmos , Animales , Clonación Molecular , Cricetinae , Humanos , Sueros Inmunes/inmunología , Immunoblotting , Leishmania infantum/genética , Leishmania infantum/inmunología , Masculino , Mesocricetus , Microscopía Confocal , Proteínas Protozoarias/genética , Proteínas Protozoarias/aislamiento & purificación , Proteínas Recombinantes/biosíntesis
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...