Your browser doesn't support javascript.
loading
BioM2: biologically informed multi-stage machine learning for phenotype prediction using omics data.
Zhang, Shunjie; Li, Pan; Wang, Shenghan; Zhu, Jijun; Huang, Zhongting; Cai, Fuqiang; Freidel, Sebastian; Ling, Fei; Schwarz, Emanuel; Chen, Junfang.
Afiliação
  • Zhang S; School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China.
  • Li P; Center for Intelligent Medicine, Greater Bay Area Institute of Precision Medicine (Guangzhou), School of Life Sciences, Fudan University, No. 6, 2nd Nanjiang Road, Nansha District, 511462 Guangzhou, China.
  • Wang S; Center for Intelligent Medicine, Greater Bay Area Institute of Precision Medicine (Guangzhou), School of Life Sciences, Fudan University, No. 6, 2nd Nanjiang Road, Nansha District, 511462 Guangzhou, China.
  • Zhu J; Center for Intelligent Medicine, Greater Bay Area Institute of Precision Medicine (Guangzhou), School of Life Sciences, Fudan University, No. 6, 2nd Nanjiang Road, Nansha District, 511462 Guangzhou, China.
  • Huang Z; Center for Intelligent Medicine, Greater Bay Area Institute of Precision Medicine (Guangzhou), School of Life Sciences, Fudan University, No. 6, 2nd Nanjiang Road, Nansha District, 511462 Guangzhou, China.
  • Cai F; School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China.
  • Freidel S; Hector Institute for Artificial Intelligence in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, M7, Mannheim 68161, Germany.
  • Ling F; Department of Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, J5, Mannheim 68159, Germany.
  • Schwarz E; School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China.
  • Chen J; Hector Institute for Artificial Intelligence in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, M7, Mannheim 68161, Germany.
Brief Bioinform ; 25(5)2024 Jul 25.
Article em En | MEDLINE | ID: mdl-39126426
ABSTRACT
Navigating the complex landscape of high-dimensional omics data with machine learning models presents a significant challenge. The integration of biological domain knowledge into these models has shown promise in creating more meaningful stratifications of predictor variables, leading to algorithms that are both more accurate and generalizable. However, the wider availability of machine learning tools capable of incorporating such biological knowledge remains limited. Addressing this gap, we introduce BioM2, a novel R package designed for biologically informed multistage machine learning. BioM2 uniquely leverages biological information to effectively stratify and aggregate high-dimensional biological data in the context of machine learning. Demonstrating its utility with genome-wide DNA methylation and transcriptome-wide gene expression data, BioM2 has shown to enhance predictive performance, surpassing traditional machine learning models that operate without the integration of biological knowledge. A key feature of BioM2 is its ability to rank predictor variables within biological categories, specifically Gene Ontology pathways. This functionality not only aids in the interpretability of the results but also enables a subsequent modular network analysis of these variables, shedding light on the intricate systems-level biology underpinning the predictive outcome. We have proposed a biologically informed multistage machine learning framework termed BioM2 for phenotype prediction based on omics data. BioM2 has been incorporated into the BioM2 CRAN package (https//cran.r-project.org/web/packages/BioM2/index.html).
Assuntos
Palavras-chave

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Fenótipo / Aprendizado de Máquina Limite: Humans Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Fenótipo / Aprendizado de Máquina Limite: Humans Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China