RESUMO
BACKGROUND: Despite the unprecedented and increasing amount of data, relatively little progress has been made in molecular characterization of mechanisms underlying Parkinson's disease. In the area of Parkinson's research, there is a pressing need to integrate various pieces of information into a meaningful context of presumed disease mechanism(s). Disease ontologies provide a novel means for organizing, integrating, and standardizing the knowledge domains specific to disease in a compact, formalized and computer-readable form and serve as a reference for knowledge exchange or systems modeling of disease mechanism. METHODS: The Parkinson's disease ontology was built according to the life cycle of ontology building. Structural, functional, and expert evaluation of the ontology was performed to ensure the quality and usability of the ontology. A novelty metric has been introduced to measure the gain of new knowledge using the ontology. Finally, a cause-and-effect model was built around PINK1 and two gene expression studies from the Gene Expression Omnibus database were re-annotated to demonstrate the usability of the ontology. RESULTS: The Parkinson's disease ontology with a subclass-based taxonomic hierarchy covers the broad spectrum of major biomedical concepts from molecular to clinical features of the disease, and also reflects different views on disease features held by molecular biologists, clinicians and drug developers. The current version of the ontology contains 632 concepts, which are organized under nine views. The structural evaluation showed the balanced dispersion of concept classes throughout the ontology. The functional evaluation demonstrated that the ontology-driven literature search could gain novel knowledge not present in the reference Parkinson's knowledge map. The ontology was able to answer specific questions related to Parkinson's when evaluated by experts. Finally, the added value of the Parkinson's disease ontology is demonstrated by ontology-driven modeling of PINK1 and re-annotation of gene expression datasets relevant to Parkinson's disease. CONCLUSIONS: Parkinson's disease ontology delivers the knowledge domain of Parkinson's disease in a compact, computer-readable form, which can be further edited and enriched by the scientific community and also to be used to construct, represent and automatically extend Parkinson's-related computable models. A practical version of the Parkinson's disease ontology for browsing and editing can be publicly accessed at http://bioportal.bioontology.org/ontologies/PDON .
Assuntos
Ontologia Genética , Conhecimento , Doença de Parkinson/genética , Software , Animais , Bases de Dados Genéticas , Modelos Animais de Doenças , Regulação da Expressão Gênica , Redes Reguladoras de Genes , Humanos , Anotação de Sequência Molecular , Doença de Parkinson/etiologiaRESUMO
INTRODUCTION: The discovery and development of new treatments for Alzheimer's disease (AD) requires a profound mechanistic understanding of the disease. Here, we propose a model-driven approach supporting the systematic identification of putative disease mechanisms. METHODS: We have created a model for AD and a corresponding model for the normal physiology of neurons using biological expression language to systematically model causal and correlative relationships between biomolecules, pathways, and clinical readouts. Through model-model comparison we identify "chains of causal relationships" that lead to new insights into putative disease mechanisms. RESULTS: Using differential analysis of our models we identified a new mechanism explaining the effect of amyloid-beta on apoptosis via both the neurotrophic tyrosine kinase receptor, type 2 and nerve growth factor receptor branches of the neurotrophin signaling pathway. We also provide the example of a model-guided interpretation of genetic variation data for a comorbidity analysis between AD and type 2 diabetes mellitus. DISCUSSION: The two computable, literature-based models introduced here provide a powerful framework for the generation and validation of rational, testable hypotheses across disease areas.
Assuntos
Doença de Alzheimer/fisiopatologia , Modelos Neurológicos , Neurônios/fisiologia , Doença de Alzheimer/complicações , Doença de Alzheimer/epidemiologia , Doença de Alzheimer/genética , Precursor de Proteína beta-Amiloide/metabolismo , Animais , Encéfalo/fisiologia , Encéfalo/fisiopatologia , Comorbidade , Humanos , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Since the decoding of the Human Genome, techniques from bioinformatics, statistics, and machine learning have been instrumental in uncovering patterns in increasing amounts and types of different data produced by technical profiling technologies applied to clinical samples, animal models, and cellular systems. Yet, progress on unravelling biological mechanisms, causally driving diseases, has been limited, in part due to the inherent complexity of biological systems. Whereas we have witnessed progress in the areas of cancer, cardiovascular and metabolic diseases, the area of neurodegenerative diseases has proved to be very challenging. This is in part because the aetiology of neurodegenerative diseases such as Alzheimer´s disease or Parkinson´s disease is unknown, rendering it very difficult to discern early causal events. Here we describe a panel of bioinformatics and modeling approaches that have recently been developed to identify candidate mechanisms of neurodegenerative diseases based on publicly available data and knowledge. We identify two complementary strategies-data mining techniques using genetic data as a starting point to be further enriched using other data-types, or alternatively to encode prior knowledge about disease mechanisms in a model based framework supporting reasoning and enrichment analysis. Our review illustrates the challenges entailed in integrating heterogeneous, multiscale and multimodal information in the area of neurology in general and neurodegeneration in particular. We conclude, that progress would be accelerated by increasing efforts on performing systematic collection of multiple data-types over time from each individual suffering from neurodegenerative disease. The work presented here has been driven by project AETIONOMY; a project funded in the course of the Innovative Medicines Initiative (IMI); which is a public-private partnership of the European Federation of Pharmaceutical Industry Associations (EFPIA) and the European Commission (EC).
Assuntos
Mineração de Dados , Doenças Neurodegenerativas/genética , Animais , Biologia Computacional , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Bases de Conhecimento , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Speculative statements communicating experimental findings are frequently found in scientific articles, and their purpose is to provide an impetus for further investigations into the given topic. Automated recognition of speculative statements in scientific text has gained interest in recent years as systematic analysis of such statements could transform speculative thoughts into testable hypotheses. We describe here a pattern matching approach for the detection of speculative statements in scientific text that uses a dictionary of speculative patterns to classify sentences as hypothetical. To demonstrate the practical utility of our approach, we applied it to the domain of Alzheimer's disease and showed that our automated approach captures a wide spectrum of scientific speculations on Alzheimer's disease. Subsequent exploration of derived hypothetical knowledge leads to generation of a coherent overview on emerging knowledge niches, and can thus provide added value to ongoing research activities.
Assuntos
Modelos Teóricos , Doença de Alzheimer , Automação , Humanos , Processamento de Linguagem NaturalRESUMO
BACKGROUND: Biomedical ontologies offer the capability to structure and represent domain-specific knowledge semantically. Disease-specific ontologies can facilitate knowledge exchange across multiple disciplines, and ontology-driven mining approaches can generate great value for modeling disease mechanisms. However, in the case of neurodegenerative diseases such as Alzheimer's disease, there is a lack of formal representation of the relevant knowledge domain. METHODS: Alzheimer's disease ontology (ADO) is constructed in accordance to the ontology building life cycle. The Protégé OWL editor was used as a tool for building ADO in Ontology Web Language format. RESULTS: ADO was developed with the purpose of containing information relevant to four main biological views-preclinical, clinical, etiological, and molecular/cellular mechanisms-and was enriched by adding synonyms and references. Validation of the lexicalized ontology by means of named entity recognition-based methods showed a satisfactory performance (F score = 72%). In addition to structural and functional evaluation, a clinical expert in the field performed a manual evaluation and curation of ADO. Through integration of ADO into an information retrieval environment, we show that the ontology supports semantic search in scientific text. The usefulness of ADO is authenticated by dedicated use case scenarios. CONCLUSIONS: Development of ADO as an open ADO is a first attempt to organize information related to Alzheimer's disease in a formalized, structured manner. We demonstrate that ADO is able to capture both established and scattered knowledge existing in scientific text.
Assuntos
Doença de Alzheimer , Armazenamento e Recuperação da Informação , Modelos Biológicos , Biologia Computacional , HumanosRESUMO
MOTIVATION: There have been many successful experimental and bioinformatics efforts to elucidate transcription factor (TF)-target networks in several organisms. For many organisms, these annotations are complemented by miRNA-target networks of good quality. Attempts that use these networks in combination with gene expression data to draw conclusions on TF or miRNA activity are, however, still relatively sparse. RESULTS: In this study, we propose Bayesian inference of regulation of transcriptional activity (BIRTA) as a novel approach to infer both, TF and miRNA activities, from combined miRNA and mRNA expression data in a condition specific way. That means our model explains mRNA and miRNA expression for a specific experimental condition by the activities of certain miRNAs and TFs, hence allowing for differentiating between switches from active to inactive (negative switch) and inactive to active (positive switch) forms. Extensive simulations of our model reveal its good prediction performance in comparison to other approaches. Furthermore, the utility of BIRTA is demonstrated at the example of Escherichia coli data comparing aerobic and anaerobic growth conditions, and by human expression data from pancreas and ovarian cancer. AVAILABILITY AND IMPLEMENTATION: The method is implemented in the R package birta, which is freely available for Bio-conductor (>=2.10) on http://www.bioconductor.org/packages/release/bioc/html/birta.html.
Assuntos
Perfilação da Expressão Gênica , Redes Reguladoras de Genes , MicroRNAs/metabolismo , Fatores de Transcrição/metabolismo , Teorema de Bayes , Biologia Computacional/métodos , Escherichia coli/genética , Escherichia coli/metabolismo , Feminino , Regulação da Expressão Gênica , Humanos , Neoplasias Ovarianas/genética , Neoplasias Ovarianas/metabolismo , Neoplasias Pancreáticas/genética , Neoplasias Pancreáticas/metabolismo , RNA Mensageiro/metabolismoRESUMO
BACKGROUND: While the majority of studies have focused on the association between sex hormones and dementia, emerging evidence supports the role of other hormone signals in increasing dementia risk. However, due to the lack of an integrated view on mechanistic interactions of hormone signaling pathways associated with dementia, molecular mechanisms through which hormones contribute to the increased risk of dementia has remained unclear and capacity of translating hormone signals to potential therapeutic and diagnostic applications in relation to dementia has been undervalued. METHODS: Using an integrative knowledge- and data-driven approach, a global hormone interaction network in the context of dementia was constructed, which was further filtered down to a model of convergent hormone signaling pathways. This model was evaluated for its biological and clinical relevance through pathway recovery test, evidence-based analysis, and biomarker-guided analysis. Translational validation of the model was performed using the proposed novel mechanism discovery approach based on 'serendipitous off-target effects'. RESULTS: Our results reveal the existence of a well-connected hormone interaction network underlying dementia. Seven hormone signaling pathways converge at the core of the hormone interaction network, which are shown to be mechanistically linked to the risk of dementia. Amongst these pathways, estrogen signaling pathway takes the major part in the model and insulin signaling pathway is analyzed for its association to learning and memory functions. Validation of the model through serendipitous off-target effects suggests that hormone signaling pathways substantially contribute to the pathogenesis of dementia. CONCLUSIONS: The integrated network model of hormone interactions underlying dementia may serve as an initial translational platform for identifying potential therapeutic targets and candidate biomarkers for dementia-spectrum disorders such as Alzheimer's disease.
Assuntos
Demência/genética , Demência/metabolismo , Predisposição Genética para Doença , Hormônios/metabolismo , Pesquisa Translacional Biomédica/métodos , Algoritmos , Doença de Alzheimer/metabolismo , Biomarcadores/metabolismo , Análise por Conglomerados , Redes Reguladoras de Genes , Genômica , Humanos , Fenótipo , Transdução de SinaisRESUMO
MOTIVATION: Biomedical ontologies have proved to be valuable tools for data analysis and data interoperability. Protein-ligand interactions are key players in drug discovery and development; however, existing public ontologies that describe the knowledge space of biomolecular interactions do not cover all aspects relevant to pharmaceutical modelling and simulation. RESULTS: The protein--ligand interaction ontology (PLIO) was developed around three main concepts, namely target, ligand and interaction, and was enriched by adding synonyms, useful annotations and references. The quality of the ontology was assessed based on structural, functional and usability features. Validation of the lexicalized ontology by means of natural language processing (NLP)-based methods showed a satisfactory performance (F-score = 81%). Through integration into our information retrieval environment we can demonstrate that PLIO supports lexical search in PubMed abstracts. The usefulness of PLIO is demonstrated by two use-case scenarios and it is shown that PLIO is able to capture both confirmatory and new knowledge from simulation and empirical studies. AVAILABILITY: The PLIO ontology is made freely available to the public at http://www.scai.fraunhofer.de/bioinformatics/downloads.html.
Assuntos
Ligantes , Proteínas/química , Vocabulário Controlado , Mineração de Dados , Protease de HIV/química , Processamento de Linguagem Natural , Ligação Proteica , PubMed , Antagonistas de Receptores Purinérgicos P1/químicaRESUMO
BACKGROUND: For selection and evaluation of potential biomarkers, inclusion of already published information is of utmost importance. In spite of significant advancements in text- and data-mining techniques, the vast knowledge space of biomarkers in biomedical text has remained unexplored. Existing named entity recognition approaches are not sufficiently selective for the retrieval of biomarker information from the literature. The purpose of this study was to identify textual features that enhance the effectiveness of biomarker information retrieval for different indication areas and diverse end user perspectives. METHODS: A biomarker terminology was created and further organized into six concept classes. Performance of this terminology was optimized towards balanced selectivity and specificity. The information retrieval performance using the biomarker terminology was evaluated based on various combinations of the terminology's six classes. Further validation of these results was performed on two independent corpora representing two different neurodegenerative diseases. RESULTS: The current state of the biomarker terminology contains 119 entity classes supported by 1890 different synonyms. The result of information retrieval shows improved retrieval rate of informative abstracts, which is achieved by including clinical management terms and evidence of gene/protein alterations (e.g. gene/protein expression status or certain polymorphisms) in combination with disease and gene name recognition. When additional filtering through other classes (e.g. diagnostic or prognostic methods) is applied, the typical high number of unspecific search results is significantly reduced. The evaluation results suggest that this approach enables the automated identification of biomarker information in the literature. A demo version of the search engine SCAIView, including the biomarker retrieval, is made available to the public through http://www.scaiview.com/scaiview-academia.html. CONCLUSIONS: The approach presented in this paper demonstrates that using a dedicated biomarker terminology for automated analysis of the scientific literature maybe helpful as an aid to finding biomarker information in text. Successful extraction of candidate biomarkers information from published resources can be considered as the first step towards developing novel hypotheses. These hypotheses will be valuable for the early decision-making in the drug discovery and development process.
Assuntos
Biomarcadores , Mineração de Dados , Terminologia como Assunto , Algoritmos , Humanos , Ferramenta de BuscaRESUMO
INTRODUCTION: Ontology-based annotation of evidence, using disease-specific ontologies, can accelerate analysis and interpretation of the knowledge domain of diseases. Although many domain-specific disease ontologies have been developed so far, in the area of cardiovascular diseases, there is a lack of ontological representation of the disease knowledge domain of stroke. METHODS: The stroke ontology (STO) was created on the basis of the ontology development life cycle and was built using Protégé ontology editor in the ontology web language format. The ontology was evaluated in terms of structural and functional features, expert evaluation, and competency questions. RESULTS: The stroke ontology covers a broad range of major biomedical and risk factor concepts. The majority of concepts are enriched by synonyms, definitions, and references. The ontology attempts to incorporate different users' views on the stroke domain such as neuroscientists, molecular biologists, and clinicians. Evaluation of the ontology based on natural language processing showed a high precision (0.94), recall (0.80), and F-score (0.78) values, indicating that STO has an acceptable coverage of the stroke knowledge domain. Performance evaluation using competency questions designed by a clinician showed that the ontology can be used to answer expert questions in light of published evidence. CONCLUSIONS: The stroke ontology is the first, multiple-view ontology in the domain of brain stroke that can be used as a tool for representation, formalization, and standardization of the heterogeneous data related to the stroke domain. Since this is a draft version of the ontology, the contribution of the stroke scientific community can help to improve the usability of the current version.
RESUMO
BACKGROUND: Stroke is a leading cause of death and disability worldwide. According to the Iranian Ministry of Medical Health and Education, out of 100,000 stroke incidents in the country, 25,000 lead to death. Thus, identifying risk factors of stroke can help healthcare providers to establish prevention strategies. This study was conducted to investigate the prevalence of stroke risk factors and their distribution based on stroke subtypes in Sayad Shirazi Hospital, Gorgan, Northeastern Iran. MATERIAL AND METHODS: A retrospective hospital-based study was conducted at Sayad Shirazi Hospital in Gorgan, the only referral university hospital for stroke patients in Gorgan city. All medical records with a diagnosis of stroke were identified based on the International Classification of Diseases, Revision 10, from August 23, 2015, to August 22, 2016. A valid and reliable data gathering form was used to capture data about demographics, diagnostics, lifestyle, risk factors, and medical history. RESULTS: Out of 375 cases, two-thirds were marked with ischemic stroke with mean ages (standard deviation) of 66.4 (14.2) for men and 64.6 (14.2) for women. The relationship between stroke subtypes and age groups (P=0.008) and hospital outcome (P=0.0001) was significant. Multiple regression analysis showed that hypertension (Exp. (B) =1.755, P=0.037), diabetes mellitus (Exp. (B) =0.532, P=0.021), and dyslipidemia (Exp. (B) =2.325, P=0.004) significantly increased the risk of ischemic stroke. CONCLUSION: Overall, hypertension, diabetes mellitus, and dyslipidemia were the major risk factors of stroke in Gorgan. Establishment of stroke registry (population- or hospital-based) for the province is recommended.
RESUMO
The Precision Neurology development process implements systems theory with system biology and neurophysiology in a parallel, bidirectional research path: a combined hypothesis-driven investigation of systems dysfunction within distinct molecular, cellular, and large-scale neural network systems in both animal models as well as through tests for the usefulness of these candidate dynamic systems biomarkers in different diseases and subgroups at different stages of pathophysiological progression. This translational research path is paralleled by an "omics"-based, hypothesis-free, exploratory research pathway, which will collect multimodal data from progressing asymptomatic, preclinical, and clinical neurodegenerative disease (ND) populations, within the wide continuous biological and clinical spectrum of ND, applying high-throughput and high-content technologies combined with powerful computational and statistical modeling tools, aimed at identifying novel dysfunctional systems and predictive marker signatures associated with ND. The goals are to identify common biological denominators or differentiating classifiers across the continuum of ND during detectable stages of pathophysiological progression, characterize systems-based intermediate endophenotypes, validate multi-modal novel diagnostic systems biomarkers, and advance clinical intervention trial designs by utilizing systems-based intermediate endophenotypes and candidate surrogate markers. Achieving these goals is key to the ultimate development of early and effective individualized treatment of ND, such as Alzheimer's disease. The Alzheimer Precision Medicine Initiative (APMI) and cohort program (APMI-CP), as well as the Paris based core of the Sorbonne University Clinical Research Group "Alzheimer Precision Medicine" (GRC-APM) were recently launched to facilitate the passageway from conventional clinical diagnostic and drug development toward breakthrough innovation based on the investigation of the comprehensive biological nature of aging individuals. The APMI movement is gaining momentum to systematically apply both systems neurophysiology and systems biology in exploratory translational neuroscience research on ND.
Assuntos
Doença de Alzheimer/diagnóstico , Doença de Alzheimer/terapia , Medicina de Precisão , Animais , Encéfalo/diagnóstico por imagem , Humanos , Neurologia , Neurofisiologia , Biologia de Sistemas , Pesquisa Translacional BiomédicaRESUMO
Ontologies and terminologies are used for interoperability of knowledge and data in a standard manner among interdisciplinary research groups. Existing imaging ontologies capture general aspects of the imaging domain as a whole such as methodological concepts or calibrations of imaging instruments. However, none of the existing ontologies covers the diagnostic features measured by imaging technologies in the context of neurodegenerative diseases. Therefore, the Neuro-Imaging Feature Terminology (NIFT) was developed to organize the knowledge domain of measured brain features in association with neurodegenerative diseases by imaging technologies. The purpose is to identify quantitative imaging biomarkers that can be extracted from multi-modal brain imaging data. This terminology attempts to cover measured features and parameters in brain scans relevant to disease progression. In this paper, we demonstrate the systematic retrieval of measured indices from literature and how the extracted knowledge can be further used for disease modeling that integrates neuroimaging features with molecular processes.
Assuntos
Encéfalo/diagnóstico por imagem , Neuroimagem/métodos , Algoritmos , Doença de Alzheimer/diagnóstico por imagem , Disfunção Cognitiva/diagnóstico por imagem , Biologia Computacional , Bases de Dados Bibliográficas , Humanos , Processamento de Imagem Assistida por Computador , Processamento de Linguagem Natural , Terminologia como AssuntoRESUMO
Perturbance in inflammatory pathways have been identified as one of the major factors which leads to neurodegenerative diseases (NDD). Owing to the limited access of human brain tissues and the immense complexity of the brain, animal models, specifically mouse models, play a key role in advancing the NDD field. However, many of these mouse models fail to reproduce the clinical manifestations and end points of the disease. NDD drugs, which passed the efficacy test in mice, were repeatedly not successful in clinical trials. There are numerous studies which are supporting and opposing the applicability of mouse models in neuroinflammation and NDD. In this paper, we assessed to what extend a mouse can mimic the cellular and molecular interactions in humans at a mechanism level. Based on our mechanistic modeling approach, we investigate the failure of a neuroinflammation targeted drug in the late phases of clinical trials based on the comparative analyses between the two species.
Assuntos
Citocinas/metabolismo , Modelos Animais de Doenças , Encefalite/etiologia , Doenças Neurodegenerativas/complicações , Transdução de Sinais/fisiologia , Animais , Encéfalo/metabolismo , Encéfalo/patologia , Encefalite/genética , Humanos , Camundongos , Doenças Neurodegenerativas/genética , Transdução de Sinais/genética , Especificidade da EspécieRESUMO
Neurodegenerative diseases including Alzheimer's disease are complex to tackle because of the complexity of the brain, both in structure and function. Such complexity is reflected by the involvement of various brain regions and multiple pathways in the etiology of neurodegenerative diseases that render single drug target approaches ineffective. Particularly in the area of neurodegeneration, attention has been drawn to repurposing existing drugs with proven efficacy and safety profiles. However, there is a lack of systematic analysis of the brain chemical space to predict the feasibility of repurposing strategies. Using a mechanism-based, drug-target interaction modeling approach, we have identified promising drug candidates for repositioning. Mechanistic cause-and-effect models consolidate relevant prior knowledge on drugs, targets, and pathways from the scientific literature and integrate insights derived from experimental data. We demonstrate the power of this approach by predicting two repositioning candidates for Alzheimer's disease and one for amyotrophic lateral sclerosis.
Assuntos
Doença de Alzheimer/tratamento farmacológico , Esclerose Lateral Amiotrófica/tratamento farmacológico , Reposicionamento de Medicamentos , Fármacos Neuroprotetores/farmacologia , Fármacos Neuroprotetores/uso terapêutico , Doença de Alzheimer/metabolismo , Esclerose Lateral Amiotrófica/metabolismo , Biologia Computacional , Simulação por Computador , Ciclosporina/farmacologia , Ciclosporina/uso terapêutico , Donepezila , Reposicionamento de Medicamentos/métodos , Humanos , Indanos/farmacologia , Indanos/uso terapêutico , Sondas Moleculares , Piperidinas/farmacologia , Piperidinas/uso terapêutico , Riluzol/farmacologia , Riluzol/uso terapêutico , Relação Estrutura-AtividadeRESUMO
Molecular signaling pathways have been long used to demonstrate interactions among upstream causal molecules and downstream biological effects. They show the signal flow between cell compartments, the majority of which are represented as cartoons. These are often drawn manually by scanning through the literature, which is time-consuming, static, and non-interoperable. Moreover, these pathways are often devoid of context (condition and tissue) and biased toward certain disease conditions. Mining the scientific literature creates new possibilities to retrieve pathway information at higher contextual resolution and specificity. To address this challenge, we have created a pathway terminology system by combining signaling pathways and biological events to ensure a broad coverage of the entire pathway knowledge domain. This terminology was applied to mining biomedical papers and patents about neurodegenerative diseases with focus on Alzheimer's disease. We demonstrate the power of our approach by mapping literature-derived signaling pathways onto their corresponding anatomical regions in the human brain under healthy and Alzheimer's disease states. We demonstrate how this knowledge resource can be used to identify a putative mechanism explaining the mode-of-action of the approved drug Rasagiline, and show how this resource can be used for fingerprinting patents to support the discovery of pathway knowledge for Alzheimer's disease. Finally, we propose that based on next-generation cause-and-effect pathway models, a dedicated inventory of computer-processable pathway models specific to neurodegenerative diseases can be established, which hopefully accelerates context-specific enrichment analysis of experimental data with higher resolution and richer annotations.
Assuntos
Encéfalo/metabolismo , Modelos Neurológicos , Doenças Neurodegenerativas/metabolismo , Transdução de Sinais/fisiologia , Encéfalo/efeitos dos fármacos , Encéfalo/fisiopatologia , Bases de Dados Factuais , Humanos , Redes e Vias Metabólicas/fisiologia , Doenças Neurodegenerativas/fisiopatologia , Terminologia como AssuntoRESUMO
Protein interaction networks are widely used in computational biology as a graphical means of representing higher-level systemic functions in a computable form. Although, many algorithms exist that seamlessly collect and measure protein interaction information in network models, they often do not provide novel mechanistic insights using quantitative criteria. Measuring information content and knowledge representation in network models about disease mechanisms becomes crucial particularly when exploring new target candidates in a well-defined functional context of a potential disease mechanism. To this end, we have developed a knowledge-based scoring approach that uses literature-derived protein interaction features to quantify protein interaction confidence. Thereby, we introduce the novel concept of knowledge cliffs, regions of the interaction network where a significant gap between high scoring and low scoring interactions is observed, representing a divide between established and emerging knowledge on disease mechanism. To show the application of this approach, we constructed and assessed reliability of a protein-protein interaction model specific to Alzheimer's disease, which led to screening, and prioritization of four novel protein candidates. Evaluation of the identified candidates showed that two of them are already followed in clinical trials for testing potential AD drugs.
Assuntos
Doença de Alzheimer/metabolismo , Mapeamento de Interação de Proteínas , Mapas de Interação de Proteínas , Biologia Computacional/métodos , Bases de Dados Genéticas , Humanos , Modelos Biológicos , Reprodutibilidade dos TestesRESUMO
BACKGROUND: A number of compelling candidate Alzheimer's biomarkers remain buried within the literature. Indeed, there should be a systematic effort towards gathering this information through approaches that mine publicly available data and substantiate supporting evidence through disease modeling methods. In the presented work, we demonstrate that an integrative gray zone mining approach can be used as a way to tackle this challenge successfully. METHODS: The methodology presented in this work combines semantic information retrieval and experimental data through context-specific modeling of molecular interactions underlying stages in Alzheimer's disease (AD). Information about putative, highly speculative AD biomarkers was harvested from the literature using a semantic framework and was put into a functional context through disease- and stage-specific models. Staging models of AD were further validated for their functional relevance and novel biomarker candidates were predicted at the mechanistic level. RESULTS: Three interaction models were built representing three stages of AD, namely mild, moderate, and severe stages. Integrated analysis of these models using various arrays of evidence gathered from experimental data and published knowledge resources led to identification of four candidate biomarkers in the mild stage. Mode of action of these candidates was further reasoned in the mechanistic context of models by chains of arguments. Accordingly, we propose that some of these 'emerging' potential biomarker candidates have a reasonable mechanistic explanation and deserve to be investigated in more detail. CONCLUSIONS: Systematic exploration of derived hypothetical knowledge leads to generation of a coherent overview on emerging knowledge niches. Integrative analysis of this knowledge in the context of disease mechanism is a promising approach towards identification of candidate biomarkers taking into consideration the complex etiology of disease. The added value of this strategy becomes apparent particularly in the area of biomarker discovery for neurodegenerative diseases where predictive biomarkers are desperately needed.
RESUMO
BACKGROUND: In the past years, significant progress has been made to develop and use experimental settings for extensive data collection on tobacco smoke exposure and tobacco smoke exposure-associated diseases. Due to the growing number of such data, there is a need for domain-specific standard ontologies to facilitate the integration of tobacco exposure data. RESULTS: The CSEO (version 1.0) is composed of 20091 concepts. The ontology in its current form is able to capture a wide range of cigarette smoke exposure concepts within the knowledge domain of exposure science with a reasonable sensitivity and specificity. Moreover, it showed a promising performance when used to answer domain expert questions. The CSEO complies with standard upper-level ontologies and is freely accessible to the scientific community through a dedicated wiki at https://publicwiki-01.fraunhofer.de/CSEO-Wiki/index.php/Main_Page. CONCLUSIONS: The CSEO has potential to become a widely used standard within the academic and industrial community. Mainly because of the emerging need of systems toxicology to controlled vocabularies and also the lack of suitable ontologies for this domain, the CSEO prepares the ground for integrative systems-based research in the exposure science.
RESUMO
The advent of state-of-the-art brain imaging technologies in recent years and the ability of such technologies to provide high-resolution information at both structural and functional levels has spawned large efforts to introduce novel non-invasive imaging biomarkers for early prediction and diagnosis of brain disorders; however, their utility in both clinic and drug development at their best resolution remains limited to visualizing and monitoring disease progression. Given the fact that efficient translation of valuable information embedded in brain scans into clinical application is of paramount scientific and public health importance, a strategy is needed to bridge the current gap between imaging and molecular biology, particularly in neurodegenerative diseases. As an attempt to address this issue, we present a novel computational method to link readouts of imaging biomarkers to their underlying molecular pathways with the aim of guiding clinical diagnosis, prognosis and even target identification in drug discovery for Alzheimer's disease.