Your browser doesn't support javascript.
loading
Pilot-Study to Explore Metabolic Signature of Type 2 Diabetes: A Pipeline of Tree-Based Machine Learning and Bioinformatics Techniques for Biomarkers Discovery.
Yagin, Fatma Hilal; Al-Hashem, Fahaid; Ahmad, Irshad; Ahmad, Fuzail; Alkhateeb, Abedalrhman.
Affiliation
  • Yagin FH; Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inonu University, Malatya 44280, Turkey.
  • Al-Hashem F; Department of Physiology, College of Medicine, King Khalid University, Abha 61421, Saudi Arabia.
  • Ahmad I; Department of Medical Rehabilitation Sciences, College of Applied Medical Sciences, King Khalid University, Abha 61421, Saudi Arabia.
  • Ahmad F; Department of Respiratory Care, College of Applied Sciences, Almaarefa University, Diriya, Riyadh 13713, Saudi Arabia.
  • Alkhateeb A; Department of Computer Science, Lakehead University, Thunder Bay, ON P7B 5E1, Canada.
Nutrients ; 16(10)2024 May 20.
Article de En | MEDLINE | ID: mdl-38794775
ABSTRACT

BACKGROUND:

This study aims to identify unique metabolomics biomarkers associated with Type 2 Diabetes (T2D) and develop an accurate diagnostics model using tree-based machine learning (ML) algorithms integrated with bioinformatics techniques.

METHODS:

Univariate and multivariate analyses such as fold change, a receiver operating characteristic curve (ROC), and Partial Least-Squares Discriminant Analysis (PLS-DA) were used to identify biomarker metabolites that showed significant concentration in T2D patients. Three tree-based algorithms [eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Adaptive Boosting (AdaBoost)] that demonstrated robustness in high-dimensional data analysis were used to create a diagnostic model for T2D.

RESULTS:

As a result of the biomarker discovery process validated with three different approaches, Pyruvate, D-Rhamnose, AMP, pipecolate, Tetradecenoic acid, Tetradecanoic acid, Dodecanediothioic acid, Prostaglandin E3/D3 (isobars), ADP and Hexadecenoic acid were determined as potential biomarkers for T2D. Our results showed that the XGBoost model [accuracy = 0.831, F1-score = 0.845, sensitivity = 0.882, specificity = 0.774, positive predictive value (PPV) = 0.811, negative-PV (NPV) = 0.857 and Area under the ROC curve (AUC) = 0.887] had the slight highest performance measures.

CONCLUSIONS:

ML integrated with bioinformatics techniques offers accurate and positive T2D candidate biomarker discovery. The XGBoost model can successfully distinguish T2D based on metabolites.
Sujet(s)
Mots clés

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Sujet principal: Marqueurs biologiques / Biologie informatique / Diabète de type 2 / Métabolomique / Apprentissage machine Limites: Adult / Aged / Female / Humans / Male / Middle aged Langue: En Journal: Nutrients Année: 2024 Type de document: Article Pays d'affiliation: Turquie

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Sujet principal: Marqueurs biologiques / Biologie informatique / Diabète de type 2 / Métabolomique / Apprentissage machine Limites: Adult / Aged / Female / Humans / Male / Middle aged Langue: En Journal: Nutrients Année: 2024 Type de document: Article Pays d'affiliation: Turquie