Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
1.
Curr Pharm Des ; 20(38): 5945-56, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-24641232

RESUMEN

Non-Communicable Diseases (NCDs) are among the most pressing global health problems of the twenty-first century. Their rising incidence and prevalence is linked to severe morbidity and mortality, and they are putting economic and managerial pressure on healthcare systems around the world. Moreover, NCDs are impeding healthy aging by negatively affecting the quality of life of a growing number of the global population. NCDs result from the interaction of various genetic, environmental and habitual factors, and cluster in complex ways, making the complex identification of resulting phenotypes not only difficult, but also a top research priority. The degree of complexity required to interpret large patient datasets generated by advanced high-throughput functional genomics assays has now increased to the point that novel computational biology approaches are essential to extract information that is relevant to the clinical decision-making process. Consequently, system-level models that interpret the interactions between extensive tissues, cellular and molecular measurements and clinical features are also being created to identify new disease phenotypes, so that disease definition and treatment are optimized, and novel therapeutic targets discovered. Likewise, Systems Medicine (SM) platforms applied to extensively-characterized patients provide a basis for more targeted clinical trials, and represent a promising tool to achieve better prevention and patient care, thereby promoting healthy aging globally. The present paper: (1) reviews the novel systems approaches to NCDs; (2) discusses how to move efficiently from Systems Biology to Systems Medicine; and (3) presents the scientific and clinical background of the San Raffaele Systems Medicine Platform.


Asunto(s)
Medicina Clínica/métodos , Comprensión , Manejo de la Enfermedad , Biología de Sistemas/métodos , Medicina Clínica/tendencias , Humanos , Biología de Sistemas/tendencias
2.
PLoS Comput Biol ; 8(2): e1002365, 2012 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-22319435

RESUMEN

Elucidation of new biomarkers and potential drug targets from high-throughput profiling data is a challenging task due to a limited number of available biological samples and questionable reproducibility of differential changes in cross-dataset comparisons. In this paper we propose a novel computational approach for drug and biomarkers discovery using comprehensive analysis of multiple expression profiling datasets.The new method relies on aggregation of individual profiling experiments combined with leave-one-dataset-out validation approach. Aggregated datasets were studied using Sub-Network Enrichment Analysis algorithm (SNEA) to find consistent statistically significant key regulators within the global literature-extracted expression regulation network. These regulators were linked to the consistent differentially expressed genes.We have applied our approach to several publicly available human muscle gene expression profiling datasets related to Duchenne muscular dystrophy (DMD). In order to detect both enhanced and repressed processes we considered up- and down-regulated genes separately. Applying the proposed approach to the regulators search we discovered the disturbance in the activity of several muscle-related transcription factors (e.g. MYOG and MYOD1), regulators of inflammation, regeneration, and fibrosis. Almost all SNEA-derived regulators of down-regulated genes (e.g. AMPK, TORC2, PPARGC1A) correspond to a single common pathway important for fast-to-slow twitch fiber type transition. We hypothesize that this process can affect the severity of DMD symptoms, making corresponding regulators and downstream genes valuable candidates for being potential drug targets and exploratory biomarkers.


Asunto(s)
Biología Computacional/métodos , Descubrimiento de Drogas/métodos , Perfilación de la Expresión Génica/métodos , Distrofia Muscular de Duchenne/tratamiento farmacológico , Algoritmos , Biomarcadores/análisis , Bases de Datos Genéticas , Humanos , Masculino , Metaanálisis como Asunto , Distrofia Muscular de Duchenne/genética , Distrofia Muscular de Duchenne/metabolismo , Análisis de Secuencia por Matrices de Oligonucleótidos
3.
Am J Cancer Res ; 2(1): 93-103, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22206048

RESUMEN

Non-small cell lung cancer (NSCLC) is the most common type of lung cancer, with a poor response to chemotherapy and low survival rate. This unfavorable treatment response is likely to derive from both late diagnosis and from complex, incompletely understood biology, and heterogeneity among NSCLC subtypes. To define the relative contributions of major cellular pathways to the biogenesis of NSCLC and highlight major differences between NSCLC subtypes, we studied the molecular signatures of lung adenocarcinoma (ADC) and squamous cell carcinoma (SCC), based on analysis of gene expression and comparison of tumor samples with normal lung tissue. Our results suggest the existence of specific molecular networks and subtype-specific differences between lung ADC and SCC subtypes, mostly found in cell cycle, DNA repair, and metabolic pathways. However, we also observed similarities across major gene interaction networks and pathways in ADC and SCC. These data provide a new insight into the biology of ADC and SCC and can be used to explore novel therapeutic interventions in lung cancer chemoprevention and treatment.

4.
Genes Cancer ; 2(9): 870-9, 2011 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-22593799

RESUMEN

Triple-negative breast cancer (TNBC) is an aggressive breast cancer subtype with a high rate of proliferation and metastasis, as well as poor prognosis for advanced-stage disease. Although TNBC was previously classified together with basal-like and BRCA1/2-related breast cancers, genomic profiling now shows that there is incomplete overlap, with important distinctions associated with each subtype. The biology of TNBC is still poorly understood; therefore, to define the relative contributions of major cellular pathways in TNBC, we have studied its molecular signature based on analysis of gene expression. Comparisons were then made with normal breast tissue. Our results suggest the existence of molecular networks in TNBC, characterized by explicit alterations in the cell cycle, DNA repair, nucleotide synthesis, metabolic pathways, NF-κB signaling, inflammatory response, and angiogenesis. Moreover, we also characterized TNBC as a cancer of mixed phenotypes, suggesting that TNBC extends beyond the basal-like molecular signature and may constitute an independent subtype of breast cancer. The data provide a new insight into the biology of TNBC.

5.
J Bioinform Comput Biol ; 8(3): 593-606, 2010 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-20556864

RESUMEN

Heterogeneous high-throughput biological data become readily available for various diseases. The amount of data points generated by such experiments does not allow manual integration of the information to design the most optimal therapy for a disease. We describe a novel computational workflow for designing therapy using Ariadne Genomics Pathway Studio software. We use publically available microarray experiments for glioblastoma and automatically constructed ResNet and ChemEffect databases to exemplify how to find potentially effective chemicals for glioblastoma--the disease yet without effective treatment. Our first approach involved construction of signaling pathway affected in glioblastoma using scientific literature and data available in ResNet database. Compounds known to affect multiple proteins in this pathway were found in ChemEffect database. Another approach involved analysis of differential expression in glioblastoma patients using Sub-Network Enrichment Analysis (SNEA). SNEA identified angiogenesis-related protein Cyr61 as the major positive regulator upstream of genes differentially expressed in glioblastoma. Using our findings, we then identified breast cancer drug Fulvestrant as a major inhibitor of glioblastoma pathway as well as Cyr61. This suggested Fulvestrant as a potential treatment against glioblastoma. We further show how to increase efficacy of glioblastoma treatment by finding optimal combinations of Fulvestrant with other drugs.


Asunto(s)
Antineoplásicos/administración & dosificación , Técnicas Químicas Combinatorias/métodos , Glioblastoma/tratamiento farmacológico , Glioblastoma/metabolismo , Modelos Biológicos , Proteínas de Neoplasias/metabolismo , Transducción de Señal/efectos de los fármacos , Animales , Simulación por Computador , Diseño de Fármacos , Humanos
6.
PLoS One ; 5(2): e9256, 2010 Feb 17.
Artículo en Inglés | MEDLINE | ID: mdl-20174649

RESUMEN

Microarray-based expression profiling of living systems is a quick and inexpensive method to obtain insights into the nature of various diseases and phenotypes. A typical microarray profile can yield hundreds or even thousands of differentially expressed genes and finding biologically plausible themes or regulatory mechanisms underlying these changes is a non-trivial and daunting task. We describe a novel approach for systems-level interpretation of microarray expression data using a manually constructed "overview" pathway depicting the main cellular signaling channels (Atlas of Signaling). Currently, the developed pathway focuses on signal transduction from surface receptors to transcription factors and further transcriptional regulation of cellular "workhorse" proteins. We show how the constructed Atlas of Signaling in combination with an enrichment analysis algorithm allows quick identification and visualization of the main signaling cascades and cellular processes affected in a gene expression profiling experiment. We validate our approach using several publicly available gene expression datasets.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Redes Reguladoras de Genes , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Transducción de Señal/genética , Algoritmos , Regulación de la Expresión Génica , Modelos Genéticos , Proteoma/genética , Programas Informáticos
7.
Expert Opin Drug Discov ; 4(12): 1307-18, 2009 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-23480468

RESUMEN

IMPORTANCE OF THE FIELD: Drug discovery and development is a very complex and costly process. Understanding the detailed molecular mechanisms of a disease and drug actions can make it more efficient not only for new target discovery but also for lead prioritization, drug repositioning and development of biomarkers for drug efficacy and safety. Access to formalized knowledge about functions of proteins and small molecules is crucial for rationalization of the drug development process, and scientific publications are the main source of this knowledge. Protein knowledge networks capturing protein functions, protein-protein relations and organization of proteins in complex cellular sub-systems are making their way into modern drug discovery. Chemical networks representing multiple aspects of chemical functional information integrated into a protein systems biology network is even more advanced and promising paradigm. AREAS COVERED IN THIS REVIEW: This review describes utilization of literature-derived protein and chemical functional knowledge bases in drug development. WHAT THE READER WILL GAIN: Readers will gain an understanding of how integrated protein and chemical knowledge networks can be used for understanding and building the models of cellular events, disease mechanisms, and drug actions, finding biomarkers of drug efficacy and safety, as well as interpretation of high-throughput gene expression, proteomic and metabolomic experiments. TAKE HOME MESSAGE: Integrated literature-derived protein and chemical knowledge bases can rationalize many aspects of drug development process including drug repositioning and biomarker design.

8.
BMC Bioinformatics ; 8: 243, 2007 Jul 10.
Artículo en Inglés | MEDLINE | ID: mdl-17620146

RESUMEN

BACKGROUND: Uncovering cellular roles of a protein is a task of tremendous importance and complexity that requires dedicated experimental work as well as often sophisticated data mining and processing tools. Protein functions, often referred to as its annotations, are believed to manifest themselves through topology of the networks of inter-proteins interactions. In particular, there is a growing body of evidence that proteins performing the same function are more likely to interact with each other than with proteins with other functions. However, since functional annotation and protein network topology are often studied separately, the direct relationship between them has not been comprehensively demonstrated. In addition to having the general biological significance, such demonstration would further validate the data extraction and processing methods used to compose protein annotation and protein-protein interactions datasets. RESULTS: We developed a method for automatic extraction of protein functional annotation from scientific text based on the Natural Language Processing (NLP) technology. For the protein annotation extracted from the entire PubMed, we evaluated the precision and recall rates, and compared the performance of the automatic extraction technology to that of manual curation used in public Gene Ontology (GO) annotation. In the second part of our presentation, we reported a large-scale investigation into the correspondence between communities in the literature-based protein networks and GO annotation groups of functionally related proteins. We found a comprehensive two-way match: proteins within biological annotation groups form significantly denser linked network clusters than expected by chance and, conversely, densely linked network communities exhibit a pronounced non-random overlap with GO groups. We also expanded the publicly available GO biological process annotation using the relations extracted by our NLP technology. An increase in the number and size of GO groups without any noticeable decrease of the link density within the groups indicated that this expansion significantly broadens the public GO annotation without diluting its quality. We revealed that functional GO annotation correlates mostly with clustering in a physical interaction protein network, while its overlap with indirect regulatory network communities is two to three times smaller. CONCLUSION: Protein functional annotations extracted by the NLP technology expand and enrich the existing GO annotation system. The GO functional modularity correlates mostly with the clustering in the physical interaction network, suggesting that the essential role of structural organization maintained by these interactions. Reciprocally, clustering of proteins in physical interaction networks can serve as an evidence for their functional similarity.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas/clasificación , Genes , Reconocimiento de Normas Patrones Automatizadas/métodos , Proteínas/fisiología , Análisis por Conglomerados , Biología Computacional/normas , Bases de Datos Genéticas/normas , Bases de Datos de Proteínas , Almacenamiento y Recuperación de la Información , Procesamiento de Lenguaje Natural , Reconocimiento de Normas Patrones Automatizadas/normas , Mapeo de Interacción de Proteínas , PubMed , Reproducibilidad de los Resultados , Terminología como Asunto
9.
J Bioinform Comput Biol ; 5(2B): 429-56, 2007 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-17636854

RESUMEN

Microarray-based characterization of tissues, cellular and disease states, and environmental condition and treatment responses provides genome-wide snapshots containing large amounts of invaluable information. However, the lack of inherent structure within the data and strong noise make extracting and interpreting this information and formulating and prioritizing domain relevant hypotheses difficult tasks. Integration with different types of biological data is required to place the expression measurements into a biologically meaningful context. A few approaches in microarray data interpretation are discussed with the emphasis on the use of molecular network information. Statistical procedures are demonstrated that superimpose expression data onto the transcription regulation network mined from scientific literature and aim at selecting transcription regulators with significant patterns of expression changes downstream. Tests are suggested that take into account network topology and signs of transcription regulation effects. The approaches are illustrated using two different expression datasets, the performance is compared, and biological relevance of the predictions is discussed.


Asunto(s)
Algoritmos , Perfilación de la Expresión Génica/métodos , Modelos Biológicos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Proteoma/metabolismo , Transducción de Señal/fisiología , Transcripción Genética/fisiología , Simulación por Computador
10.
BMC Bioinformatics ; 7: 171, 2006 Mar 24.
Artículo en Inglés | MEDLINE | ID: mdl-16563163

RESUMEN

BACKGROUND: Scientific literature is a source of the most reliable and comprehensive knowledge about molecular interaction networks. Formalization of this knowledge is necessary for computational analysis and is achieved by automatic fact extraction using various text-mining algorithms. Most of these techniques suffer from high false positive rates and redundancy of the extracted information. The extracted facts form a large network with no pathways defined. RESULTS: We describe the methodology for automatic curation of Biological Association Networks (BANs) derived by a natural language processing technology called Medscan. The curated data is used for automatic pathway reconstruction. The algorithm for the reconstruction of signaling pathways is also described and validated by comparison with manually curated pathways and tissue-specific gene expression profiles. CONCLUSION: Biological Association Networks extracted by MedScan technology contain sufficient information for constructing thousands of mammalian signaling pathways for multiple tissues. The automatically curated MedScan data is adequate for automatic generation of good quality signaling networks. The automatically generated Regulome pathways and manually curated pathways used for their validation are available free in the ResNetCore database from Ariadne Genomics, Inc. 1. The pathways can be viewed and analyzed through the use of a free demo version of PathwayStudio software. The Medscan technology is also available for evaluation using the free demo version of PathwayStudio software.


Asunto(s)
Bases de Datos Bibliográficas , Procesamiento de Lenguaje Natural , Publicaciones Periódicas como Asunto , Mapeo de Interacción de Proteínas/métodos , Proteínas/clasificación , Proteínas/metabolismo , Transducción de Señal/fisiología , Almacenamiento y Recuperación de la Información/métodos , Programas Informáticos
11.
Bioinformatics ; 20(5): 604-11, 2004 Mar 22.
Artículo en Inglés | MEDLINE | ID: mdl-15033866

RESUMEN

MOTIVATION: The living cell is a complex machine that depends on the proper functioning of its numerous parts, including proteins. Understanding protein functions and how they modify and regulate each other is the next great challenge for life-sciences researchers. The collective knowledge about protein functions and pathways is scattered throughout numerous publications in scientific journals. Bringing the relevant information together becomes a bottleneck in a research and discovery process. The volume of such information grows exponentially, which renders manual curation impractical. As a viable alternative, automated literature processing tools could be employed to extract and organize biological data into a knowledge base, making it amenable to computational analysis and data mining. RESULTS: We present MedScan, a completely automated natural language processing-based information extraction system. We have used MedScan to extract 2976 interactions between human proteins from MEDLINE abstracts dated after 1988. The precision of the extracted information was found to be 91%. Comparison with the existing protein interaction databases BIND and DIP revealed that 96% of extracted information is novel. The recall rate of MedScan was found to be 21%. Additional experiments with MedScan suggest that MEDLINE is a unique source of diverse protein function information, which can be extracted in a completely automated way with a reasonably high precision. Further directions of the MedScan technology improvement are discussed. AVAILABILITY: MedScan is available for commercial licensing from Ariadne Genomics, Inc.


Asunto(s)
Inteligencia Artificial , Bases de Datos de Proteínas , Almacenamiento y Recuperación de la Información/métodos , MEDLINE , Procesamiento de Lenguaje Natural , Mapeo de Interacción de Proteínas/métodos , Proteínas/metabolismo , Indización y Redacción de Resúmenes , Sistemas de Administración de Bases de Datos , Humanos , Reconocimiento de Normas Patrones Automatizadas , Publicaciones Periódicas como Asunto , Reproducibilidad de los Resultados , Semántica , Sensibilidad y Especificidad , Vocabulario Controlado
12.
J Am Med Inform Assoc ; 11(3): 174-8, 2004.
Artículo en Inglés | MEDLINE | ID: mdl-14764613

RESUMEN

OBJECTIVE: The aim of this study was to develop a practical and efficient protein identification system for biomedical corpora. DESIGN: The developed system, called ProtScan, utilizes a carefully constructed dictionary of mammalian proteins in conjunction with a specialized tokenization algorithm to identify and tag protein name occurrences in biomedical texts and also takes advantage of Medline "Name-of-Substance" (NOS) annotation. The dictionaries for ProtScan were constructed in a semi-automatic way from various public-domain sequence databases followed by an intensive expert curation step. MEASUREMENTS: The recall and precision of the system have been determined using 1000 randomly selected and hand-tagged Medline abstracts. RESULTS: The developed system is capable of identifying protein occurrences in Medline abstracts with a 98% precision and 88% recall. It was also found to be capable of processing approximately 300 abstracts per second. Without utilization of NOS annotation, precision and recall were found to be 98.5% and 84%, respectively. CONCLUSION: The developed system appears to be well suited for protein-based Medline indexing and can help to improve biomedical information retrieval. Further approaches to ProtScan's recall improvement also are discussed.


Asunto(s)
Almacenamiento y Recuperación de la Información/métodos , MEDLINE , Proteínas , Terminología como Asunto , Indización y Redacción de Resúmenes , Algoritmos , Animales , Diccionarios como Asunto , Mamíferos
13.
Bioinformatics ; 19(16): 2155-7, 2003 Nov 01.
Artículo en Inglés | MEDLINE | ID: mdl-14594725

RESUMEN

SUMMARY: PathwayAssist is a software application developed for navigation and analysis of biological pathways, gene regulation networks and protein interaction maps. It comes with the built-in natural language processing module MedScan and the comprehensive database describing more than 100 000 events of regulation, interaction and modification between proteins, cell processes and small molecules. AVAILABILITY: PathwayAssist is available for commercial licensing from Ariadne Genomics, Inc. The light version with limited functionality will be available for free for academic users at www.ariadnegenomics.com/downloads/.


Asunto(s)
Sistemas de Administración de Bases de Datos , Bases de Datos de Proteínas , Regulación de la Expresión Génica/fisiología , Almacenamiento y Recuperación de la Información/métodos , Metabolismo/fisiología , Mapeo de Interacción de Proteínas/métodos , Programas Informáticos , Interfaz Usuario-Computador , Modelos Biológicos
14.
Bioinformatics ; 19(13): 1699-706, 2003 Sep 01.
Artículo en Inglés | MEDLINE | ID: mdl-12967967

RESUMEN

MOTIVATION: The importance of extracting biomedical information from scientific publications is well recognized. A number of information extraction systems for the biomedical domain have been reported, but none of them have become widely used in practical applications. Most proposals to date make rather simplistic assumptions about the syntactic aspect of natural language. There is an urgent need for a system that has broad coverage and performs well in real-text applications. RESULTS: We present a general biomedical domain-oriented NLP engine called MedScan that efficiently processes sentences from MEDLINE abstracts and produces a set of regularized logical structures representing the meaning of each sentence. The engine utilizes a specially developed context-free grammar and lexicon. Preliminary evaluation of the system's performance, accuracy, and coverage exhibited encouraging results. Further approaches for increasing the coverage and reducing parsing ambiguity of the engine, as well as its application for information extraction are discussed.


Asunto(s)
Indización y Redacción de Resúmenes/métodos , Inteligencia Artificial , Sistemas de Administración de Bases de Datos , Almacenamiento y Recuperación de la Información/métodos , MEDLINE , Procesamiento de Lenguaje Natural , Reconocimiento de Normas Patrones Automatizadas , Publicaciones Periódicas como Asunto , Descriptores , Vocabulario Controlado
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...