Pesquisa | Biblioteca Virtual em Saúde

Geneshot: search engine for ranking genes from arbitrary text queries.

Lachmann, Alexander; Schilder, Brian M; Wojciechowicz, Megan L; Torre, Denis; Kuleshov, Maxim V; Keenan, Alexandra B; Ma'ayan, Avi.

Nucleic Acids Res ; 47(W1): W571-W577, 2019 07 02.

Artigo em Inglês | MEDLINE | ID: mdl-31114885

RESUMO

The frequency by which genes are studied correlates with the prior knowledge accumulated about them. This leads to an imbalance in research attention where some genes are highly investigated while others are ignored. Geneshot is a search engine developed to illuminate this gap and to promote attention to the under-studied genome. Through a simple web interface, Geneshot enables researchers to enter arbitrary search terms, to receive ranked lists of genes relevant to the search terms. Returned ranked gene lists contain genes that were previously published in association with the search terms, as well as genes predicted to be associated with the terms based on data integration from multiple sources. The search results are presented with interactive visualizations. To predict gene function, Geneshot utilizes gene-gene similarity matrices from processed RNA-seq data, or from gene-gene co-occurrence data obtained from multiple sources. In addition, Geneshot can be used to analyze the novelty of gene sets and augment gene sets with additional relevant genes. The Geneshot web-server and API are freely and openly available from https://amp.pharm.mssm.edu/geneshot.

Assuntos

Genes , Software , Mineração de Dados , Expressão Gênica , Internet , Publicações , RNA-Seq , Pesquisadores , Interface Usuário-Computador

modEnrichr: a suite of gene set enrichment analysis tools for model organisms.

Kuleshov, Maxim V; Diaz, Jennifer E L; Flamholz, Zachary N; Keenan, Alexandra B; Lachmann, Alexander; Wojciechowicz, Megan L; Cagan, Ross L; Ma'ayan, Avi.

Nucleic Acids Res ; 47(W1): W183-W190, 2019 07 02.

Artigo em Inglês | MEDLINE | ID: mdl-31069376

RESUMO

High-throughput experiments produce increasingly large datasets that are difficult to analyze and integrate. While most data integration approaches focus on aligning metadata, data integration can be achieved by abstracting experimental results into gene sets. Such gene sets can be made available for reuse through gene set enrichment analysis tools such as Enrichr. Enrichr currently only supports gene sets compiled from human and mouse, limiting accessibility for investigators that study other model organisms. modEnrichr is an expansion of Enrichr for four model organisms: fish, fly, worm and yeast. The gene set libraries within FishEnrichr, FlyEnrichr, WormEnrichr and YeastEnrichr are created from the Gene Ontology, mRNA expression profiles, GeneRIF, pathway databases, protein domain databases and other organism-specific resources. Additionally, libraries were created by predicting gene function from RNA-seq co-expression data processed uniformly from the gene expression omnibus for each organism. The modEnrichr suite of tools provides the ability to convert gene lists across species using an ortholog conversion tool that automatically detects the species. For complex analyses, modEnrichr provides API access that enables submitting batch queries. In summary, modEnrichr leverages existing model organism databases and other resources to facilitate comprehensive hypothesis generation through data integration.

Assuntos

Bases de Dados Genéticas , Expressão Gênica/genética , Biblioteca Gênica , Biblioteca Genômica , Software , Animais , Biologia Computacional , Ontologia Genética , Humanos , Metadados

ChEA3: transcription factor enrichment analysis by orthogonal omics integration.

Keenan, Alexandra B; Torre, Denis; Lachmann, Alexander; Leong, Ariel K; Wojciechowicz, Megan L; Utti, Vivian; Jagodnik, Kathleen M; Kropiwnicki, Eryk; Wang, Zichen; Ma'ayan, Avi.

Nucleic Acids Res ; 47(W1): W212-W224, 2019 07 02.

Artigo em Inglês | MEDLINE | ID: mdl-31114921

RESUMO

Identifying the transcription factors (TFs) responsible for observed changes in gene expression is an important step in understanding gene regulatory networks. ChIP-X Enrichment Analysis 3 (ChEA3) is a transcription factor enrichment analysis tool that ranks TFs associated with user-submitted gene sets. The ChEA3 background database contains a collection of gene set libraries generated from multiple sources including TF-gene co-expression from RNA-seq studies, TF-target associations from ChIP-seq experiments, and TF-gene co-occurrence computed from crowd-submitted gene lists. Enrichment results from these distinct sources are integrated to generate a composite rank that improves the prediction of the correct upstream TF compared to ranks produced by individual libraries. We compare ChEA3 with existing TF prediction tools and show that ChEA3 performs better. By integrating the ChEA3 libraries, we illuminate general transcription factor properties such as whether the TF behaves as an activator or a repressor. The ChEA3 web-server is available from https://amp.pharm.mssm.edu/ChEA3.

Assuntos

Biologia Computacional/métodos , Bases de Dados Genéticas , Biblioteca Gênica , Fatores de Transcrição/genética , Sequenciamento de Cromatina por Imunoprecipitação/métodos , Conjuntos de Dados como Assunto , Regulação da Expressão Gênica/genética , Redes Reguladoras de Genes/genética , Humanos

eXpression2Kinases (X2K) Web: linking expression signatures to upstream cell signaling networks.

Clarke, Daniel J B; Kuleshov, Maxim V; Schilder, Brian M; Torre, Denis; Duffy, Mary E; Keenan, Alexandra B; Lachmann, Alexander; Feldmann, Axel S; Gundersen, Gregory W; Silverstein, Moshe C; Wang, Zichen; Ma'ayan, Avi.

Nucleic Acids Res ; 46(W1): W171-W179, 2018 07 02.

Artigo em Inglês | MEDLINE | ID: mdl-29800326

RESUMO

While gene expression data at the mRNA level can be globally and accurately measured, profiling the activity of cell signaling pathways is currently much more difficult. eXpression2Kinases (X2K) computationally predicts involvement of upstream cell signaling pathways, given a signature of differentially expressed genes. X2K first computes enrichment for transcription factors likely to regulate the expression of the differentially expressed genes. The next step of X2K connects these enriched transcription factors through known protein-protein interactions (PPIs) to construct a subnetwork. The final step performs kinase enrichment analysis on the members of the subnetwork. X2K Web is a new implementation of the original eXpression2Kinases algorithm with important enhancements. X2K Web includes many new transcription factor and kinase libraries, and PPI networks. For demonstration, thousands of gene expression signatures induced by kinase inhibitors, applied to six breast cancer cell lines, are provided for fetching directly into X2K Web. The results are displayed as interactive downloadable vector graphic network images and bar graphs. Benchmarking various settings via random permutations enabled the identification of an optimal set of parameters to be used as the default settings in X2K Web. X2K Web is freely available from http://X2K.cloud.

Assuntos

Expressão Gênica , Proteínas Quinases/metabolismo , Transdução de Sinais , Software , Animais , Linhagem Celular Tumoral , Expressão Gênica/efeitos dos fármacos , Humanos , Internet , Camundongos , Mapeamento de Interação de Proteínas , Inibidores de Proteínas Quinases/farmacologia , Transdução de Sinais/genética , Fatores de Transcrição/metabolismo

L1000FWD: fireworks visualization of drug-induced transcriptomic signatures.

Wang, Zichen; Lachmann, Alexander; Keenan, Alexandra B; Ma'ayan, Avi.

Bioinformatics ; 34(12): 2150-2152, 2018 06 15.

Artigo em Inglês | MEDLINE | ID: mdl-29420694

RESUMO

Motivation: As part of the NIH Library of Integrated Network-based Cellular Signatures program, hundreds of thousands of transcriptomic signatures were generated with the L1000 technology, profiling the response of human cell lines to over 20 000 small molecule compounds. This effort is a promising approach toward revealing the mechanisms-of-action (MOA) for marketed drugs and other less studied potential therapeutic compounds. Results: L1000 fireworks display (L1000FWD) is a web application that provides interactive visualization of over 16 000 drug and small-molecule induced gene expression signatures. L1000FWD enables coloring of signatures by different attributes such as cell type, time point, concentration, as well as drug attributes such as MOA and clinical phase. Signature similarity search is implemented to enable the search for mimicking or opposing signatures given as input of up and down gene sets. Each point on the L1000FWD interactive map is linked to a signature landing page, which provides multifaceted knowledge from various sources about the signature and the drug. Notably such information includes most frequent diagnoses, co-prescribed drugs and age distribution of prescriptions as extracted from the Mount Sinai Health System electronic medical records. Overall, L1000FWD serves as a platform for identifying functions for novel small molecules using unsupervised clustering, as well as for exploring drug MOA. Availability and implementation: L1000FWD is freely accessible at: http://amp.pharm.mssm.edu/L1000FWD. Supplementary information: Supplementary data are available at Bioinformatics online.

Assuntos

Farmacogenética/métodos , Software , Transcriptoma/efeitos dos fármacos , Aprendizado de Máquina não Supervisionado , Linhagem Celular , Análise por Conglomerados , Visualização de Dados , Regulação da Expressão Gênica , Humanos

Leishmania infantum chagasi: a genome-based approach to identification of excreted/secreted proteins.

DebRoy, Sruti; Keenan, Alexandra B; Ueno, Norikiyo; Jeronimo, Selma M B; Donelson, John E; Wilson, Mary E.

Exp Parasitol ; 126(4): 582-91, 2010 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-20542033

RESUMO

The parasitic protozoan, Leishmania, survives in harsh environments within its mammalian and sand fly hosts. Secreted proteins likely play critical roles in the parasite's interactions with its environment. As a preliminary identification of the spectrum of potential excreted/secreted (ES) proteins of Leishmania infantum chagasi (Lic), a causative agent of visceral leishmaniasis, we used standard algorithms to screen the annotated L. infantum genome for genes whose predicted protein products have an N-terminal signal peptide and lack transmembrane domains and membrane anchors. A suite of 181 candidate ES proteins were identified. These included several that were documented in the literature to be released by other Leishmania spp. Six candidate ES proteins were selected for further validation of their expression and release by different parasite stages. We found both amastigote-specific and promastigote-specific released proteins. The ES proteins of Lic are candidates for future studies of parasite virulence determinants and host protective immunity.

Assuntos

Genoma de Protozoário , Leishmania infantum/metabolismo , Leishmaniose Visceral/parasitologia , Proteínas de Protozoários/metabolismo , Algoritmos , Animais , Clonagem Molecular , Cricetinae , Humanos , Soros Imunes/imunologia , Immunoblotting , Leishmania infantum/genética , Leishmania infantum/imunologia , Masculino , Mesocricetus , Microscopia Confocal , Proteínas de Protozoários/genética , Proteínas de Protozoários/isolamento & purificação , Proteínas Recombinantes/biossíntese

Massive mining of publicly available RNA-seq data from human and mouse.

Lachmann, Alexander; Torre, Denis; Keenan, Alexandra B; Jagodnik, Kathleen M; Lee, Hoyjin J; Wang, Lily; Silverstein, Moshe C; Ma'ayan, Avi.

Nat Commun ; 9(1): 1366, 2018 04 10.

Artigo em Inglês | MEDLINE | ID: mdl-29636450

RESUMO

RNA sequencing (RNA-seq) is the leading technology for genome-wide transcript quantification. However, publicly available RNA-seq data is currently provided mostly in raw form, a significant barrier for global and integrative retrospective analyses. ARCHS4 is a web resource that makes the majority of published RNA-seq data from human and mouse available at the gene and transcript levels. For developing ARCHS4, available FASTQ files from RNA-seq experiments from the Gene Expression Omnibus (GEO) were aligned using a cloud-based infrastructure. In total 187,946 samples are accessible through ARCHS4 with 103,083 mouse and 84,863 human. Additionally, the ARCHS4 web interface provides intuitive exploration of the processed data through querying tools, interactive visualization, and gene pages that provide average expression across cell lines and tissues, top co-expressed genes for each gene, and predicted biological functions and protein-protein interactions for each gene based on prior knowledge combined with co-expression.

Assuntos

Mineração de Dados/métodos , Genoma , Software , Transcriptoma , Animais , Bases de Dados Genéticas , Ontologia Genética , Redes Reguladoras de Genes , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Internet , Camundongos , Anotação de Sequência Molecular , Mapeamento de Interação de Proteínas

The Library of Integrated Network-Based Cellular Signatures NIH Program: System-Level Cataloging of Human Cells Response to Perturbations.

Keenan, Alexandra B; Jenkins, Sherry L; Jagodnik, Kathleen M; Koplev, Simon; He, Edward; Torre, Denis; Wang, Zichen; Dohlman, Anders B; Silverstein, Moshe C; Lachmann, Alexander; Kuleshov, Maxim V; Ma'ayan, Avi; Stathias, Vasileios; Terryn, Raymond; Cooper, Daniel; Forlin, Michele; Koleti, Amar; Vidovic, Dusica; Chung, Caty; Schürer, Stephan C; Vasiliauskas, Jouzas; Pilarczyk, Marcin; Shamsaei, Behrouz; Fazel, Mehdi; Ren, Yan; Niu, Wen; Clark, Nicholas A; White, Shana; Mahi, Naim; Zhang, Lixia; Kouril, Michal; Reichard, John F; Sivaganesan, Siva; Medvedovic, Mario; Meller, Jaroslaw; Koch, Rick J; Birtwistle, Marc R; Iyengar, Ravi; Sobie, Eric A; Azeloglu, Evren U; Kaye, Julia; Osterloh, Jeannette; Haston, Kelly; Kalra, Jaslin; Finkbiener, Steve; Li, Jonathan; Milani, Pamela; Adam, Miriam; Escalante-Chong, Renan; Sachs, Karen.

Cell Syst ; 6(1): 13-24, 2018 01 24.

Artigo em Inglês | MEDLINE | ID: mdl-29199020

RESUMO

The Library of Integrated Network-Based Cellular Signatures (LINCS) is an NIH Common Fund program that catalogs how human cells globally respond to chemical, genetic, and disease perturbations. Resources generated by LINCS include experimental and computational methods, visualization tools, molecular and imaging data, and signatures. By assembling an integrated picture of the range of responses of human cells exposed to many perturbations, the LINCS program aims to better understand human disease and to advance the development of new therapies. Perturbations under study include drugs, genetic perturbations, tissue micro-environments, antibodies, and disease-causing mutations. Responses to perturbations are measured by transcript profiling, mass spectrometry, cell imaging, and biochemical methods, among other assays. The LINCS program focuses on cellular physiology shared among tissues and cell types relevant to an array of diseases, including cancer, heart disease, and neurodegenerative disorders. This Perspective describes LINCS technologies, datasets, tools, and approaches to data accessibility and reusability.

Assuntos

Catalogação/métodos , Biologia de Sistemas/métodos , Biologia Computacional/métodos , Bases de Dados de Compostos Químicos/normas , Perfilação da Expressão Gênica/métodos , Biblioteca Gênica , Humanos , Armazenamento e Recuperação da Informação/métodos , Programas Nacionais de Saúde , National Institutes of Health (U.S.)/normas , Transcriptoma , Estados Unidos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA