Your browser doesn't support javascript.
loading
Identifying and ranking potential driver genes of Alzheimer's disease using multiview evidence aggregation.
Mukherjee, Sumit; Perumal, Thanneer M; Daily, Kenneth; Sieberts, Solveig K; Omberg, Larsson; Preuss, Christoph; Carter, Gregory W; Mangravite, Lara M; Logsdon, Benjamin A.
Afiliação
  • Mukherjee S; Sage Bionetworks, Seattle, WA, USA.
  • Perumal TM; Sage Bionetworks, Seattle, WA, USA.
  • Daily K; Sage Bionetworks, Seattle, WA, USA.
  • Sieberts SK; Sage Bionetworks, Seattle, WA, USA.
  • Omberg L; Sage Bionetworks, Seattle, WA, USA.
  • Preuss C; The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, USA.
  • Carter GW; The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, USA.
  • Mangravite LM; Sage Bionetworks, Seattle, WA, USA.
  • Logsdon BA; Sage Bionetworks, Seattle, WA, USA.
Bioinformatics ; 35(14): i568-i576, 2019 07 15.
Article em En | MEDLINE | ID: mdl-31510680
ABSTRACT
MOTIVATION Late onset Alzheimer's disease is currently a disease with no known effective treatment options. To better understand disease, new multi-omic data-sets have recently been generated with the goal of identifying molecular causes of disease. However, most analytic studies using these datasets focus on uni-modal analysis of the data. Here, we propose a data driven approach to integrate multiple data types and analytic outcomes to aggregate evidences to support the hypothesis that a gene is a genetic driver of the disease. The main algorithmic contributions of our article are (i) a general machine learning framework to learn the key characteristics of a few known driver genes from multiple feature sets and identifying other potential driver genes which have similar feature representations, and (ii) A flexible ranking scheme with the ability to integrate external validation in the form of Genome Wide Association Study summary statistics. While we currently focus on demonstrating the effectiveness of the approach using different analytic outcomes from RNA-Seq studies, this method is easily generalizable to other data modalities and analysis types.

RESULTS:

We demonstrate the utility of our machine learning algorithm on two benchmark multiview datasets by significantly outperforming the baseline approaches in predicting missing labels. We then use the algorithm to predict and rank potential drivers of Alzheimer's. We show that our ranked genes show a significant enrichment for single nucleotide polymorphisms associated with Alzheimer's and are enriched in pathways that have been previously associated with the disease. AVAILABILITY AND IMPLEMENTATION Source code and link to all feature sets is available at https//github.com/Sage-Bionetworks/EvidenceAggregatedDriverRanking.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Estudo de Associação Genômica Ampla / Doença de Alzheimer Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Estudo de Associação Genômica Ampla / Doença de Alzheimer Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2019 Tipo de documento: Article País de afiliação: Estados Unidos