Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
Bioinformatics ; 28(3): 373-80, 2012 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-22135418

RESUMO

MOTIVATION: Metabolomics is a rapidly evolving field that holds promise to provide insights into genotype-phenotype relationships in cancers, diabetes and other complex diseases. One of the major informatics challenges is providing tools that link metabolite data with other types of high-throughput molecular data (e.g. transcriptomics, proteomics), and incorporate prior knowledge of pathways and molecular interactions. RESULTS: We describe a new, substantially redesigned version of our tool Metscape that allows users to enter experimental data for metabolites, genes and pathways and display them in the context of relevant metabolic networks. Metscape 2 uses an internal relational database that integrates data from KEGG and EHMN databases. The new version of the tool allows users to identify enriched pathways from expression profiling data, build and analyze the networks of genes and metabolites, and visualize changes in the gene/metabolite data. We demonstrate the applications of Metscape to annotate molecular pathways for human and mouse metabolites implicated in the pathogenesis of sepsis-induced acute lung injury, for the analysis of gene expression and metabolite data from pancreatic ductal adenocarcinoma, and for identification of the candidate metabolites involved in cancer and inflammation. AVAILABILITY: Metscape is part of the National Institutes of Health-supported National Center for Integrative Biomedical Informatics (NCIBI) suite of tools, freely available at http://metscape.ncibi.org. It can be downloaded from http://cytoscape.org or installed via Cytoscape plugin manager. CONTACT: metscape-help@umich.edu; akarnovs@umich.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Perfilação da Expressão Gênica , Metabolômica , Software , Adenocarcinoma/genética , Adenocarcinoma/metabolismo , Animais , Humanos , Inflamação/metabolismo , Redes e Vias Metabólicas , Camundongos , Neoplasias Pancreáticas/genética , Neoplasias Pancreáticas/metabolismo , Proteômica , Sepse/metabolismo
2.
BMC Bioinformatics ; 12: 81, 2011 Mar 20.
Artigo em Inglês | MEDLINE | ID: mdl-21418606

RESUMO

BACKGROUND: Gene set enrichment testing has helped bridge the gap from an individual gene to a systems biology interpretation of microarray data. Although gene sets are defined a priori based on biological knowledge, current methods for gene set enrichment testing treat all genes equal. It is well-known that some genes, such as those responsible for housekeeping functions, appear in many pathways, whereas other genes are more specialized and play a unique role in a single pathway. Drawing inspiration from the field of information retrieval, we have developed and present here an approach to incorporate gene appearance frequency (in KEGG pathways) into two current methods, Gene Set Enrichment Analysis (GSEA) and logistic regression-based LRpath framework, to generate more reproducible and biologically meaningful results. RESULTS: Two breast cancer microarray datasets were analyzed to identify gene sets differentially expressed between histological grade 1 and 3 breast cancer. The correlation of Normalized Enrichment Scores (NES) between gene sets, generated by the original GSEA and GSEA with the appearance frequency of genes incorporated (GSEA-AF), was compared. GSEA-AF resulted in higher correlation between experiments and more overlapping top gene sets. Several cancer related gene sets achieved higher NES in GSEA-AF as well. The same datasets were also analyzed by LRpath and LRpath with the appearance frequency of genes incorporated (LRpath-AF). Two well-studied lung cancer datasets were also analyzed in the same manner to demonstrate the validity of the method, and similar results were obtained. CONCLUSIONS: We introduce an alternative way to integrate KEGG PATHWAY information into gene set enrichment testing. The performance of GSEA and LRpath can be enhanced with the integration of appearance frequency of genes. We conclude that, generally, gene set analysis methods with the integration of information from KEGG PATHWAY performs better both statistically and biologically.


Assuntos
Perfilação da Expressão Gênica/métodos , Regulação da Expressão Gênica , Frequência do Gene , Humanos , Armazenamento e Recuperação da Informação , Análise de Sequência com Séries de Oligonucleotídeos , Reprodutibilidade dos Testes
3.
Bioinformatics ; 26(7): 971-3, 2010 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-20139469

RESUMO

SUMMARY: Metscape is a plug-in for Cytoscape, used to visualize and interpret metabolomic data in the context of human metabolic networks. We have developed a metabolite database by extracting and integrating information from several public sources. By querying this database, Metscape allows users to trace the connections between metabolites and genes, visualize compound networks and display compound structures as well as information for reactions, enzymes, genes and pathways. Applying the pathway filter, users can create subnetworks that consist of compounds and reactions from a given pathway. Metscape allows users to upload experimental data, and visualize and explore compound networks over time, or experimental conditions. Color and size of the nodes are used to visualize these dynamic changes. Metscape can display the entire metabolic network or any of the pathway-specific networks that exist in the database. AVAILABILITY: Metscape can be installed from within Cytoscape 2.6.x under 'Network and Attribute I/O' category. For more information, please visit http://metscape.ncibi.org/tryplugin.html.


Assuntos
Redes e Vias Metabólicas , Metabolômica/métodos , Software , Bases de Dados Factuais , Humanos
4.
Bioinformatics ; 26(4): 456-63, 2010 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-20007254

RESUMO

MOTIVATION: The elucidation of biological concepts enriched with differentially expressed genes has become an integral part of the analysis and interpretation of genomic data. Of additional importance is the ability to explore networks of relationships among previously defined biological concepts from diverse information sources, and to explore results visually from multiple perspectives. Accomplishing these tasks requires a unified framework for agglomeration of data from various genomic resources, novel visualizations, and user functionality. RESULTS: We have developed ConceptGen, a web-based gene set enrichment and gene set relation mapping tool that is streamlined and simple to use. ConceptGen offers over 20,000 concepts comprising 14 different types of biological knowledge, including data not currently available in any other gene set enrichment or gene set relation mapping tool. We demonstrate the functionalities of ConceptGen using gene expression data modeling TGF-beta-induced epithelial-mesenchymal transition and metabolomics data comparing metastatic versus localized prostate cancers.


Assuntos
Perfilação da Expressão Gênica/métodos , Reconhecimento Automatizado de Padrão/métodos , Software , Animais , Biologia Computacional , Bases de Dados Genéticas , Redes Reguladoras de Genes , Humanos , Masculino , Metástase Neoplásica/genética , Análise de Sequência com Séries de Oligonucleotídeos , Neoplasias Pancreáticas/genética , Fator de Crescimento Transformador beta/genética , Fator de Crescimento Transformador beta/metabolismo
5.
Nucleic Acids Res ; 37(Database issue): D642-6, 2009 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18978014

RESUMO

Molecular interaction data exists in a number of repositories, each with its own data format, molecule identifier and information coverage. Michigan molecular interactions (MiMI) assists scientists searching through this profusion of molecular interaction data. The original release of MiMI gathered data from well-known protein interaction databases, and deep merged this information while keeping track of provenance. Based on the feedback received from users, MiMI has been completely redesigned. This article describes the resulting MiMI Release 2 (MiMIr2). New functionality includes extension from proteins to genes and to pathways; identification of highlighted sentences in source publications; seamless two-way linkage with Cytoscape; query facilities based on MeSH/GO terms and other concepts; approximate graph matching to find relevant pathways; support for querying in bulk; and a user focus-group driven interface design. MiMI is part of the NIH's; National Center for Integrative Biomedical Informatics (NCIBI) and is publicly available at: http://mimi.ncibi.org.


Assuntos
Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas , Proteínas/metabolismo , Gráficos por Computador , Proteínas/genética , Interface Usuário-Computador
6.
Bioinformatics ; 25(1): 137-8, 2009 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-18812364

RESUMO

UNLABELLED: The MiMI molecular interaction repository integrates data from multiple sources, resolves interactions to standard gene names and symbols, links to annotation data from GO, MeSH and PubMed and normalizes the descriptions of interaction type. Here, we describe a Cytoscape plugin that retrieves interaction and annotation data from MiMI and links out to multiple data sources and tools. Community annotation of the interactome is supported. AVAILABILITY: MiMI plugin v3.0.1 can be installed from within Cytoscape 2.6 using the Cytoscape plugin manager in 'Network and Attribute I/0' category. The plugin is also preloaded when Cytoscape is launched using Java WebStart at http://mimi.ncibi.org by querying a gene and clicking 'View in MiMI Plugin for Cytoscape' link.


Assuntos
Biologia Computacional/métodos , Software , Bases de Dados Genéticas , Interface Usuário-Computador
7.
BMC Bioinformatics ; 10 Suppl 9: S3, 2009 Sep 17.
Artigo em Inglês | MEDLINE | ID: mdl-19761573

RESUMO

BACKGROUND: Chronic renal diseases are currently classified based on morphological similarities such as whether they produce predominantly inflammatory or non-inflammatory responses. However, such classifications do not reliably predict the course of the disease and its response to therapy. In contrast, recent studies in diseases such as breast cancer suggest that a classification which includes molecular information could lead to more accurate diagnoses and prediction of treatment response. This article describes how we extracted gene expression profiles from biopsies of patients with chronic renal diseases, and used network visualizations and associated quantitative measures to rapidly analyze similarities and differences between the diseases. RESULTS: The analysis revealed three main regularities: (1) Many genes associated with a single disease, and fewer genes associated with many diseases. (2) Unexpected combinations of renal diseases that share relatively large numbers of genes. (3) Uniform concordance in the regulation of all genes in the network. CONCLUSION: The overall results suggest the need to define a molecular-based classification of renal diseases, in addition to hypotheses for the unexpected patterns of shared genes and the uniformity in gene concordance. Furthermore, the results demonstrate the utility of network analyses to rapidly understand complex relationships between diseases and regulated genes.


Assuntos
Biologia Computacional/métodos , Redes Reguladoras de Genes , Nefropatias/classificação , Nefropatias/genética , Perfilação da Expressão Gênica/métodos , Humanos , Análise de Sequência com Séries de Oligonucleotídeos
8.
Nucleic Acids Res ; 35(Database issue): D566-71, 2007 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17130145

RESUMO

Protein interaction data exists in a number of repositories. Each repository has its own data format, molecule identifier and supplementary information. Michigan Molecular Interactions (MiMI) assists scientists searching through this overwhelming amount of protein interaction data. MiMI gathers data from well-known protein interaction databases and deep-merges the information. Utilizing an identity function, molecules that may have different identifiers but represent the same real-world object are merged. Thus, MiMI allows the users to retrieve information from many different databases at once, highlighting complementary and contradictory information. To help scientists judge the usefulness of a piece of data, MiMI tracks the provenance of all data. Finally, a simple yet powerful user interface aids users in their queries, and frees them from the onerous task of knowing the data format or learning a query language. MiMI allows scientists to query all data, whether corroborative or contradictory, and specify which sources to utilize. MiMI is part of the National Center for Integrative Biomedical Informatics (NCIBI) and is publicly available at: http://mimi.ncibi.org.


Assuntos
Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas , Internet , Interface Usuário-Computador
9.
Big Data ; 5(2): 73-84, 2017 06.
Artigo em Inglês | MEDLINE | ID: mdl-28632443

RESUMO

Big data technology offers unprecedented opportunities to society as a whole and also to its individual members. At the same time, this technology poses significant risks to those it overlooks. In this article, we give an overview of recent technical work on diversity, particularly in selection tasks, discuss connections between diversity and fairness, and identify promising directions for future work that will position diversity as an important component of a data-responsible society. We argue that diversity should come to the forefront of our discourse, for reasons that are both ethical-to mitigate the risks of exclusion-and utilitarian, to enable more powerful, accurate, and engaging data analysis and use.


Assuntos
Interpretação Estatística de Dados , Algoritmos , Crowdsourcing , Pesquisa Empírica , Modelos Estatísticos , Seleção de Pessoal
10.
Big Data ; 5(3): 177-188, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-28816500

RESUMO

Significant research challenges must be addressed in the cleaning, transformation, integration, modeling, and analytics of Big Data sources for finance. This article surveys the progress made so far in this direction and obstacles yet to be overcome. These are issues that are of interest to data-driven financial institutions in both corporate finance and consumer finance. These challenges are also of interest to the legal profession as well as to regulators. The discussion is relevant to technology firms that support the growing field of FinTech.


Assuntos
Administração Financeira , Modelos Estatísticos , Tomada de Decisões
11.
OMICS ; 7(1): 101-2, 2003.
Artigo em Inglês | MEDLINE | ID: mdl-12831566

RESUMO

Protein data, from sequence and structure to interaction, is being generated through many diverse methodologies; it is stored and reported in numerous forms and multiple places. The magnitude of the data limits researchers abilities to utilize all information generated. Effective integration of protein data can be accomplished through better data modeling. We demonstrate this through the MIPD project.


Assuntos
Proteínas/química , Integração de Sistemas , Modelos Moleculares
12.
J Am Med Inform Assoc ; 19(2): 166-70, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22101971

RESUMO

The National Center for Integrative and Biomedical Informatics (NCIBI) is one of the eight NCBCs. NCIBI supports information access and data analysis for biomedical researchers, enabling them to build computational and knowledge models of biological systems to address the Driving Biological Problems (DBPs). The NCIBI DBPs have included prostate cancer progression, organ-specific complications of type 1 and 2 diabetes, bipolar disorder, and metabolic analysis of obesity syndrome. Collaborating with these and other partners, NCIBI has developed a series of software tools for exploratory analysis, concept visualization, and literature searches, as well as core database and web services resources. Many of our training and outreach initiatives have been in collaboration with the Research Centers at Minority Institutions (RCMI), integrating NCIBI and RCMI faculty and students, culminating each year in an annual workshop. Our future directions include focusing on the TranSMART data sharing and analysis initiative.


Assuntos
Pesquisa Biomédica , Disseminação de Informação , Medicina Integrativa , Informática Médica , Bases de Dados como Assunto , Previsões , Objetivos , National Institutes of Health (U.S.) , Estados Unidos
13.
BMC Med Genomics ; 3: 49, 2010 Oct 27.
Artigo em Inglês | MEDLINE | ID: mdl-20979611

RESUMO

BACKGROUND: Reactive oxygen species (ROS) are known mediators of cellular damage in multiple diseases including diabetic complications. Despite its importance, no comprehensive database is currently available for the genes associated with ROS. METHODS: We present ROS- and diabetes-related targets (genes/proteins) collected from the biomedical literature through a text mining technology. A web-based literature mining tool, SciMiner, was applied to 1,154 biomedical papers indexed with diabetes and ROS by PubMed to identify relevant targets. Over-represented targets in the ROS-diabetes literature were obtained through comparisons against randomly selected literature. The expression levels of nine genes, selected from the top ranked ROS-diabetes set, were measured in the dorsal root ganglia (DRG) of diabetic and non-diabetic DBA/2J mice in order to evaluate the biological relevance of literature-derived targets in the pathogenesis of diabetic neuropathy. RESULTS: SciMiner identified 1,026 ROS- and diabetes-related targets from the 1,154 biomedical papers (http://jdrf.neurology.med.umich.edu/ROSDiabetes/). Fifty-three targets were significantly over-represented in the ROS-diabetes literature compared to randomly selected literature. These over-represented targets included well-known members of the oxidative stress response including catalase, the NADPH oxidase family, and the superoxide dismutase family of proteins. Eight of the nine selected genes exhibited significant differential expression between diabetic and non-diabetic mice. For six genes, the direction of expression change in diabetes paralleled enhanced oxidative stress in the DRG. CONCLUSIONS: Literature mining compiled ROS-diabetes related targets from the biomedical literature and led us to evaluate the biological relevance of selected targets in the pathogenesis of diabetic neuropathy.


Assuntos
Mineração de Dados/métodos , Diabetes Mellitus/genética , Diabetes Mellitus/metabolismo , Publicações Periódicas como Assunto , Espécies Reativas de Oxigênio/metabolismo , Pesquisa , Animais , Nefropatias Diabéticas/genética , Nefropatias Diabéticas/metabolismo , Gânglios Espinais/metabolismo , Perfilação da Expressão Gênica , Humanos , Camundongos , Proteínas/metabolismo , Células Receptoras Sensoriais/metabolismo
15.
PLoS One ; 3(5): e2265, 2008 May 28.
Artigo em Inglês | MEDLINE | ID: mdl-18509477

RESUMO

The advancement of the computational biology field hinges on progress in three fundamental directions--the development of new computational algorithms, the availability of informatics resource management infrastructures and the capability of tools to interoperate and synergize. There is an explosion in algorithms and tools for computational biology, which makes it difficult for biologists to find, compare and integrate such resources. We describe a new infrastructure, iTools, for managing the query, traversal and comparison of diverse computational biology resources. Specifically, iTools stores information about three types of resources--data, software tools and web-services. The iTools design, implementation and resource meta-data content reflect the broad research, computational, applied and scientific expertise available at the seven National Centers for Biomedical Computing. iTools provides a system for classification, categorization and integration of different computational biology resources across space-and-time scales, biomedical problems, computational infrastructures and mathematical foundations. A large number of resources are already iTools-accessible to the community and this infrastructure is rapidly growing. iTools includes human and machine interfaces to its resource meta-data repository. Investigators or computer programs may utilize these interfaces to search, compare, expand, revise and mine meta-data descriptions of existent computational biology resources. We propose two ways to browse and display the iTools dynamic collection of resources. The first one is based on an ontology of computational biology resources, and the second one is derived from hyperbolic projections of manifolds or complex structures onto planar discs. iTools is an open source project both in terms of the source code development as well as its meta-data content. iTools employs a decentralized, portable, scalable and lightweight framework for long-term resource management. We demonstrate several applications of iTools as a framework for integrated bioinformatics. iTools and the complete details about its specifications, usage and interfaces are available at the iTools web page http://iTools.ccb.ucla.edu.


Assuntos
Biologia Computacional , Internet , Bases de Dados Factuais , Integração de Sistemas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA