Your browser doesn't support javascript.
loading
Optimizing cardiovascular disease mortality prediction: a super learner approach in the tehran lipid and glucose study.
Darabi, Parvaneh; Gharibzadeh, Safoora; Khalili, Davood; Bagherpour-Kalo, Mehrdad; Janani, Leila.
Affiliation
  • Darabi P; Department of Biostatistics, School of Public Health, Iran University of Medical Sciences, Tehran, Iran.
  • Gharibzadeh S; Department of Epidemiology and Biostatistics, Pasteur Institute of Iran, Tehran, Iran. sgh18@leicester.ac.uk.
  • Khalili D; Prevention of Metabolic Disorders Research Center, Research Institute for Endocrine Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
  • Bagherpour-Kalo M; Department of Epidemiology and Biostatistics, School of Public health, Tehran University of Medical Sciences, Tehran, Iran.
  • Janani L; Department of Biostatistics, School of Public Health, Iran University of Medical Sciences, Tehran, Iran. l.janani@imperial.ac.uk.
BMC Med Inform Decis Mak ; 24(1): 97, 2024 Apr 16.
Article in En | MEDLINE | ID: mdl-38627734
ABSTRACT
BACKGROUND &

AIM:

Cardiovascular disease (CVD) is the most important cause of death in the world and has a potential impact on health care costs, this study aimed to evaluate the performance of machine learning survival models and determine the optimum model for predicting CVD-related mortality.

METHOD:

In this study, the research population was all participants in Tehran Lipid and Glucose Study (TLGS) aged over 30 years. We used the Gradient Boosting model (GBM), Support Vector Machine (SVM), Super Learner (SL), and Cox proportional hazard (Cox-PH) models to predict the CVD-related mortality using 26 features. The dataset was randomly divided into training (80%) and testing (20%). To evaluate the performance of the methods, we used the Brier Score (BS), Prediction Error (PE), Concordance Index (C-index), and time-dependent Area Under the Curve (TD-AUC) criteria. Four different clinical models were also performed to improve the performance of the methods.

RESULTS:

Out of 9258 participants with a mean age of (SD; range) 43.74 (15.51; 20-91), 56.60% were female. The CVD death proportion was 2.5% (228 participants). The death proportion was significantly higher in men (67.98% M, 32.02% F). Based on predefined selection criteria, the SL method has the best performance in predicting CVD-related mortality (TD-AUC > 93.50%). Among the machine learning (ML) methods, The SVM has the worst performance (TD-AUC = 90.13%). According to the relative effect, age, fasting blood sugar, systolic blood pressure, smoking, taking aspirin, diastolic blood pressure, Type 2 diabetes mellitus, hip circumference, body mss index (BMI), and triglyceride were identified as the most influential variables in predicting CVD-related mortality.

CONCLUSION:

According to the results of our study, compared to the Cox-PH model, Machine Learning models showed promising and sometimes better performance in predicting CVD-related mortality. This finding is based on the analysis of a large and diverse urban population from Tehran, Iran.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Cardiovascular Diseases / Diabetes Mellitus, Type 2 Limits: Adult / Female / Humans / Male Country/Region as subject: Asia Language: En Journal: BMC Med Inform Decis Mak Journal subject: INFORMATICA MEDICA Year: 2024 Document type: Article Affiliation country:

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Cardiovascular Diseases / Diabetes Mellitus, Type 2 Limits: Adult / Female / Humans / Male Country/Region as subject: Asia Language: En Journal: BMC Med Inform Decis Mak Journal subject: INFORMATICA MEDICA Year: 2024 Document type: Article Affiliation country: