Your browser doesn't support javascript.
loading
From heterogeneous healthcare data to disease-specific biomarker networks: A hierarchical Bayesian network approach.
Becker, Ann-Kristin; Dörr, Marcus; Felix, Stephan B; Frost, Fabian; Grabe, Hans J; Lerch, Markus M; Nauck, Matthias; Völker, Uwe; Völzke, Henry; Kaderali, Lars.
Afiliación
  • Becker AK; Institute of Bioinformatics, University Medicine Greifswald, Greifswald, Germany.
  • Dörr M; Department of Internal Medicine B, University Medicine Greifswald, Greifswald, Germany.
  • Felix SB; German Centre for Cardiovascular Research (DZHK), partner site Greifswald, Greifswald, Germany.
  • Frost F; Department of Internal Medicine B, University Medicine Greifswald, Greifswald, Germany.
  • Grabe HJ; German Centre for Cardiovascular Research (DZHK), partner site Greifswald, Greifswald, Germany.
  • Lerch MM; Department of Internal Medicine A, University Medicine Greifswald, Greifswald, Germany.
  • Nauck M; Department of Psychiatry, University Medicine Greifswald, Greifswald, Germany.
  • Völker U; Department of Internal Medicine A, University Medicine Greifswald, Greifswald, Germany.
  • Völzke H; Institute of Clinical Chemistry and Laboratory Medicine, University Medicine Greifswald, Greifswald, Germany.
  • Kaderali L; Interfaculty Institute of Genetics and Functional Genomics, Department of Functional Genomics, University Medicine Greifswald, Greifswald, Germany.
PLoS Comput Biol ; 17(2): e1008735, 2021 02.
Article en En | MEDLINE | ID: mdl-33577591
ABSTRACT
In this work, we introduce an entirely data-driven and automated approach to reveal disease-associated biomarker and risk factor networks from heterogeneous and high-dimensional healthcare data. Our workflow is based on Bayesian networks, which are a popular tool for analyzing the interplay of biomarkers. Usually, data require extensive manual preprocessing and dimension reduction to allow for effective learning of Bayesian networks. For heterogeneous data, this preprocessing is hard to automatize and typically requires domain-specific prior knowledge. We here combine Bayesian network learning with hierarchical variable clustering in order to detect groups of similar features and learn interactions between them entirely automated. We present an optimization algorithm for the adaptive refinement of such group Bayesian networks to account for a specific target variable, like a disease. The combination of Bayesian networks, clustering, and refinement yields low-dimensional but disease-specific interaction networks. These networks provide easily interpretable, yet accurate models of biomarker interdependencies. We test our method extensively on simulated data, as well as on data from the Study of Health in Pomerania (SHIP-TREND), and demonstrate its effectiveness using non-alcoholic fatty liver disease and hypertension as examples. We show that the group network models outperform available biomarker scores, while at the same time, they provide an easily interpretable interaction network.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Informática Médica / Biomarcadores / Enfermedad Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: PLoS Comput Biol Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2021 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Informática Médica / Biomarcadores / Enfermedad Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: PLoS Comput Biol Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2021 Tipo del documento: Article País de afiliación: Alemania