Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
Bioinformatics ; 28(3): 373-80, 2012 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-22135418

RESUMEN

MOTIVATION: Metabolomics is a rapidly evolving field that holds promise to provide insights into genotype-phenotype relationships in cancers, diabetes and other complex diseases. One of the major informatics challenges is providing tools that link metabolite data with other types of high-throughput molecular data (e.g. transcriptomics, proteomics), and incorporate prior knowledge of pathways and molecular interactions. RESULTS: We describe a new, substantially redesigned version of our tool Metscape that allows users to enter experimental data for metabolites, genes and pathways and display them in the context of relevant metabolic networks. Metscape 2 uses an internal relational database that integrates data from KEGG and EHMN databases. The new version of the tool allows users to identify enriched pathways from expression profiling data, build and analyze the networks of genes and metabolites, and visualize changes in the gene/metabolite data. We demonstrate the applications of Metscape to annotate molecular pathways for human and mouse metabolites implicated in the pathogenesis of sepsis-induced acute lung injury, for the analysis of gene expression and metabolite data from pancreatic ductal adenocarcinoma, and for identification of the candidate metabolites involved in cancer and inflammation. AVAILABILITY: Metscape is part of the National Institutes of Health-supported National Center for Integrative Biomedical Informatics (NCIBI) suite of tools, freely available at http://metscape.ncibi.org. It can be downloaded from http://cytoscape.org or installed via Cytoscape plugin manager. CONTACT: metscape-help@umich.edu; akarnovs@umich.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Perfilación de la Expresión Génica , Metabolómica , Programas Informáticos , Adenocarcinoma/genética , Adenocarcinoma/metabolismo , Animales , Humanos , Inflamación/metabolismo , Redes y Vías Metabólicas , Ratones , Neoplasias Pancreáticas/genética , Neoplasias Pancreáticas/metabolismo , Proteómica , Sepsis/metabolismo
2.
BMC Bioinformatics ; 12: 81, 2011 Mar 20.
Artículo en Inglés | MEDLINE | ID: mdl-21418606

RESUMEN

BACKGROUND: Gene set enrichment testing has helped bridge the gap from an individual gene to a systems biology interpretation of microarray data. Although gene sets are defined a priori based on biological knowledge, current methods for gene set enrichment testing treat all genes equal. It is well-known that some genes, such as those responsible for housekeeping functions, appear in many pathways, whereas other genes are more specialized and play a unique role in a single pathway. Drawing inspiration from the field of information retrieval, we have developed and present here an approach to incorporate gene appearance frequency (in KEGG pathways) into two current methods, Gene Set Enrichment Analysis (GSEA) and logistic regression-based LRpath framework, to generate more reproducible and biologically meaningful results. RESULTS: Two breast cancer microarray datasets were analyzed to identify gene sets differentially expressed between histological grade 1 and 3 breast cancer. The correlation of Normalized Enrichment Scores (NES) between gene sets, generated by the original GSEA and GSEA with the appearance frequency of genes incorporated (GSEA-AF), was compared. GSEA-AF resulted in higher correlation between experiments and more overlapping top gene sets. Several cancer related gene sets achieved higher NES in GSEA-AF as well. The same datasets were also analyzed by LRpath and LRpath with the appearance frequency of genes incorporated (LRpath-AF). Two well-studied lung cancer datasets were also analyzed in the same manner to demonstrate the validity of the method, and similar results were obtained. CONCLUSIONS: We introduce an alternative way to integrate KEGG PATHWAY information into gene set enrichment testing. The performance of GSEA and LRpath can be enhanced with the integration of appearance frequency of genes. We conclude that, generally, gene set analysis methods with the integration of information from KEGG PATHWAY performs better both statistically and biologically.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Regulación de la Expresión Génica , Frecuencia de los Genes , Humanos , Almacenamiento y Recuperación de la Información , Análisis de Secuencia por Matrices de Oligonucleótidos , Reproducibilidad de los Resultados
3.
Bioinformatics ; 26(7): 971-3, 2010 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-20139469

RESUMEN

SUMMARY: Metscape is a plug-in for Cytoscape, used to visualize and interpret metabolomic data in the context of human metabolic networks. We have developed a metabolite database by extracting and integrating information from several public sources. By querying this database, Metscape allows users to trace the connections between metabolites and genes, visualize compound networks and display compound structures as well as information for reactions, enzymes, genes and pathways. Applying the pathway filter, users can create subnetworks that consist of compounds and reactions from a given pathway. Metscape allows users to upload experimental data, and visualize and explore compound networks over time, or experimental conditions. Color and size of the nodes are used to visualize these dynamic changes. Metscape can display the entire metabolic network or any of the pathway-specific networks that exist in the database. AVAILABILITY: Metscape can be installed from within Cytoscape 2.6.x under 'Network and Attribute I/O' category. For more information, please visit http://metscape.ncibi.org/tryplugin.html.


Asunto(s)
Redes y Vías Metabólicas , Metabolómica/métodos , Programas Informáticos , Bases de Datos Factuales , Humanos
4.
Bioinformatics ; 26(4): 456-63, 2010 Feb 15.
Artículo en Inglés | MEDLINE | ID: mdl-20007254

RESUMEN

MOTIVATION: The elucidation of biological concepts enriched with differentially expressed genes has become an integral part of the analysis and interpretation of genomic data. Of additional importance is the ability to explore networks of relationships among previously defined biological concepts from diverse information sources, and to explore results visually from multiple perspectives. Accomplishing these tasks requires a unified framework for agglomeration of data from various genomic resources, novel visualizations, and user functionality. RESULTS: We have developed ConceptGen, a web-based gene set enrichment and gene set relation mapping tool that is streamlined and simple to use. ConceptGen offers over 20,000 concepts comprising 14 different types of biological knowledge, including data not currently available in any other gene set enrichment or gene set relation mapping tool. We demonstrate the functionalities of ConceptGen using gene expression data modeling TGF-beta-induced epithelial-mesenchymal transition and metabolomics data comparing metastatic versus localized prostate cancers.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Reconocimiento de Normas Patrones Automatizadas/métodos , Programas Informáticos , Animales , Biología Computacional , Bases de Datos Genéticas , Redes Reguladoras de Genes , Humanos , Masculino , Metástasis de la Neoplasia/genética , Análisis de Secuencia por Matrices de Oligonucleótidos , Neoplasias Pancreáticas/genética , Factor de Crecimiento Transformador beta/genética , Factor de Crecimiento Transformador beta/metabolismo
5.
Nucleic Acids Res ; 37(Database issue): D642-6, 2009 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-18978014

RESUMEN

Molecular interaction data exists in a number of repositories, each with its own data format, molecule identifier and information coverage. Michigan molecular interactions (MiMI) assists scientists searching through this profusion of molecular interaction data. The original release of MiMI gathered data from well-known protein interaction databases, and deep merged this information while keeping track of provenance. Based on the feedback received from users, MiMI has been completely redesigned. This article describes the resulting MiMI Release 2 (MiMIr2). New functionality includes extension from proteins to genes and to pathways; identification of highlighted sentences in source publications; seamless two-way linkage with Cytoscape; query facilities based on MeSH/GO terms and other concepts; approximate graph matching to find relevant pathways; support for querying in bulk; and a user focus-group driven interface design. MiMI is part of the NIH's; National Center for Integrative Biomedical Informatics (NCIBI) and is publicly available at: http://mimi.ncibi.org.


Asunto(s)
Bases de Datos de Proteínas , Mapeo de Interacción de Proteínas , Proteínas/metabolismo , Gráficos por Computador , Proteínas/genética , Interfaz Usuario-Computador
6.
Bioinformatics ; 25(1): 137-8, 2009 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-18812364

RESUMEN

UNLABELLED: The MiMI molecular interaction repository integrates data from multiple sources, resolves interactions to standard gene names and symbols, links to annotation data from GO, MeSH and PubMed and normalizes the descriptions of interaction type. Here, we describe a Cytoscape plugin that retrieves interaction and annotation data from MiMI and links out to multiple data sources and tools. Community annotation of the interactome is supported. AVAILABILITY: MiMI plugin v3.0.1 can be installed from within Cytoscape 2.6 using the Cytoscape plugin manager in 'Network and Attribute I/0' category. The plugin is also preloaded when Cytoscape is launched using Java WebStart at http://mimi.ncibi.org by querying a gene and clicking 'View in MiMI Plugin for Cytoscape' link.


Asunto(s)
Biología Computacional/métodos , Programas Informáticos , Bases de Datos Genéticas , Interfaz Usuario-Computador
7.
BMC Bioinformatics ; 10 Suppl 9: S3, 2009 Sep 17.
Artículo en Inglés | MEDLINE | ID: mdl-19761573

RESUMEN

BACKGROUND: Chronic renal diseases are currently classified based on morphological similarities such as whether they produce predominantly inflammatory or non-inflammatory responses. However, such classifications do not reliably predict the course of the disease and its response to therapy. In contrast, recent studies in diseases such as breast cancer suggest that a classification which includes molecular information could lead to more accurate diagnoses and prediction of treatment response. This article describes how we extracted gene expression profiles from biopsies of patients with chronic renal diseases, and used network visualizations and associated quantitative measures to rapidly analyze similarities and differences between the diseases. RESULTS: The analysis revealed three main regularities: (1) Many genes associated with a single disease, and fewer genes associated with many diseases. (2) Unexpected combinations of renal diseases that share relatively large numbers of genes. (3) Uniform concordance in the regulation of all genes in the network. CONCLUSION: The overall results suggest the need to define a molecular-based classification of renal diseases, in addition to hypotheses for the unexpected patterns of shared genes and the uniformity in gene concordance. Furthermore, the results demonstrate the utility of network analyses to rapidly understand complex relationships between diseases and regulated genes.


Asunto(s)
Biología Computacional/métodos , Redes Reguladoras de Genes , Enfermedades Renales/clasificación , Enfermedades Renales/genética , Perfilación de la Expresión Génica/métodos , Humanos , Análisis de Secuencia por Matrices de Oligonucleótidos
8.
Nucleic Acids Res ; 35(Database issue): D566-71, 2007 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-17130145

RESUMEN

Protein interaction data exists in a number of repositories. Each repository has its own data format, molecule identifier and supplementary information. Michigan Molecular Interactions (MiMI) assists scientists searching through this overwhelming amount of protein interaction data. MiMI gathers data from well-known protein interaction databases and deep-merges the information. Utilizing an identity function, molecules that may have different identifiers but represent the same real-world object are merged. Thus, MiMI allows the users to retrieve information from many different databases at once, highlighting complementary and contradictory information. To help scientists judge the usefulness of a piece of data, MiMI tracks the provenance of all data. Finally, a simple yet powerful user interface aids users in their queries, and frees them from the onerous task of knowing the data format or learning a query language. MiMI allows scientists to query all data, whether corroborative or contradictory, and specify which sources to utilize. MiMI is part of the National Center for Integrative Biomedical Informatics (NCIBI) and is publicly available at: http://mimi.ncibi.org.


Asunto(s)
Bases de Datos de Proteínas , Mapeo de Interacción de Proteínas , Internet , Interfaz Usuario-Computador
9.
Big Data ; 5(2): 73-84, 2017 06.
Artículo en Inglés | MEDLINE | ID: mdl-28632443

RESUMEN

Big data technology offers unprecedented opportunities to society as a whole and also to its individual members. At the same time, this technology poses significant risks to those it overlooks. In this article, we give an overview of recent technical work on diversity, particularly in selection tasks, discuss connections between diversity and fairness, and identify promising directions for future work that will position diversity as an important component of a data-responsible society. We argue that diversity should come to the forefront of our discourse, for reasons that are both ethical-to mitigate the risks of exclusion-and utilitarian, to enable more powerful, accurate, and engaging data analysis and use.


Asunto(s)
Interpretación Estadística de Datos , Algoritmos , Colaboración de las Masas , Investigación Empírica , Modelos Estadísticos , Selección de Personal
10.
Big Data ; 5(3): 177-188, 2017 09.
Artículo en Inglés | MEDLINE | ID: mdl-28816500

RESUMEN

Significant research challenges must be addressed in the cleaning, transformation, integration, modeling, and analytics of Big Data sources for finance. This article surveys the progress made so far in this direction and obstacles yet to be overcome. These are issues that are of interest to data-driven financial institutions in both corporate finance and consumer finance. These challenges are also of interest to the legal profession as well as to regulators. The discussion is relevant to technology firms that support the growing field of FinTech.


Asunto(s)
Administración Financiera , Modelos Estadísticos , Toma de Decisiones
11.
OMICS ; 7(1): 101-2, 2003.
Artículo en Inglés | MEDLINE | ID: mdl-12831566

RESUMEN

Protein data, from sequence and structure to interaction, is being generated through many diverse methodologies; it is stored and reported in numerous forms and multiple places. The magnitude of the data limits researchers abilities to utilize all information generated. Effective integration of protein data can be accomplished through better data modeling. We demonstrate this through the MIPD project.


Asunto(s)
Proteínas/química , Integración de Sistemas , Modelos Moleculares
12.
J Am Med Inform Assoc ; 19(2): 166-70, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22101971

RESUMEN

The National Center for Integrative and Biomedical Informatics (NCIBI) is one of the eight NCBCs. NCIBI supports information access and data analysis for biomedical researchers, enabling them to build computational and knowledge models of biological systems to address the Driving Biological Problems (DBPs). The NCIBI DBPs have included prostate cancer progression, organ-specific complications of type 1 and 2 diabetes, bipolar disorder, and metabolic analysis of obesity syndrome. Collaborating with these and other partners, NCIBI has developed a series of software tools for exploratory analysis, concept visualization, and literature searches, as well as core database and web services resources. Many of our training and outreach initiatives have been in collaboration with the Research Centers at Minority Institutions (RCMI), integrating NCIBI and RCMI faculty and students, culminating each year in an annual workshop. Our future directions include focusing on the TranSMART data sharing and analysis initiative.


Asunto(s)
Investigación Biomédica , Difusión de la Información , Medicina Integrativa , Informática Médica , Bases de Datos como Asunto , Predicción , Objetivos , National Institutes of Health (U.S.) , Estados Unidos
13.
BMC Med Genomics ; 3: 49, 2010 Oct 27.
Artículo en Inglés | MEDLINE | ID: mdl-20979611

RESUMEN

BACKGROUND: Reactive oxygen species (ROS) are known mediators of cellular damage in multiple diseases including diabetic complications. Despite its importance, no comprehensive database is currently available for the genes associated with ROS. METHODS: We present ROS- and diabetes-related targets (genes/proteins) collected from the biomedical literature through a text mining technology. A web-based literature mining tool, SciMiner, was applied to 1,154 biomedical papers indexed with diabetes and ROS by PubMed to identify relevant targets. Over-represented targets in the ROS-diabetes literature were obtained through comparisons against randomly selected literature. The expression levels of nine genes, selected from the top ranked ROS-diabetes set, were measured in the dorsal root ganglia (DRG) of diabetic and non-diabetic DBA/2J mice in order to evaluate the biological relevance of literature-derived targets in the pathogenesis of diabetic neuropathy. RESULTS: SciMiner identified 1,026 ROS- and diabetes-related targets from the 1,154 biomedical papers (http://jdrf.neurology.med.umich.edu/ROSDiabetes/). Fifty-three targets were significantly over-represented in the ROS-diabetes literature compared to randomly selected literature. These over-represented targets included well-known members of the oxidative stress response including catalase, the NADPH oxidase family, and the superoxide dismutase family of proteins. Eight of the nine selected genes exhibited significant differential expression between diabetic and non-diabetic mice. For six genes, the direction of expression change in diabetes paralleled enhanced oxidative stress in the DRG. CONCLUSIONS: Literature mining compiled ROS-diabetes related targets from the biomedical literature and led us to evaluate the biological relevance of selected targets in the pathogenesis of diabetic neuropathy.


Asunto(s)
Minería de Datos/métodos , Diabetes Mellitus/genética , Diabetes Mellitus/metabolismo , Publicaciones Periódicas como Asunto , Especies Reactivas de Oxígeno/metabolismo , Investigación , Animales , Nefropatías Diabéticas/genética , Nefropatías Diabéticas/metabolismo , Ganglios Espinales/metabolismo , Perfilación de la Expresión Génica , Humanos , Ratones , Proteínas/metabolismo , Células Receptoras Sensoriales/metabolismo
15.
PLoS One ; 3(5): e2265, 2008 May 28.
Artículo en Inglés | MEDLINE | ID: mdl-18509477

RESUMEN

The advancement of the computational biology field hinges on progress in three fundamental directions--the development of new computational algorithms, the availability of informatics resource management infrastructures and the capability of tools to interoperate and synergize. There is an explosion in algorithms and tools for computational biology, which makes it difficult for biologists to find, compare and integrate such resources. We describe a new infrastructure, iTools, for managing the query, traversal and comparison of diverse computational biology resources. Specifically, iTools stores information about three types of resources--data, software tools and web-services. The iTools design, implementation and resource meta-data content reflect the broad research, computational, applied and scientific expertise available at the seven National Centers for Biomedical Computing. iTools provides a system for classification, categorization and integration of different computational biology resources across space-and-time scales, biomedical problems, computational infrastructures and mathematical foundations. A large number of resources are already iTools-accessible to the community and this infrastructure is rapidly growing. iTools includes human and machine interfaces to its resource meta-data repository. Investigators or computer programs may utilize these interfaces to search, compare, expand, revise and mine meta-data descriptions of existent computational biology resources. We propose two ways to browse and display the iTools dynamic collection of resources. The first one is based on an ontology of computational biology resources, and the second one is derived from hyperbolic projections of manifolds or complex structures onto planar discs. iTools is an open source project both in terms of the source code development as well as its meta-data content. iTools employs a decentralized, portable, scalable and lightweight framework for long-term resource management. We demonstrate several applications of iTools as a framework for integrated bioinformatics. iTools and the complete details about its specifications, usage and interfaces are available at the iTools web page http://iTools.ccb.ucla.edu.


Asunto(s)
Biología Computacional , Internet , Bases de Datos Factuales , Integración de Sistemas
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA