Your browser doesn't support javascript.
loading
Improving cardiovascular risk prediction through machine learning modelling of irregularly repeated electronic health records.
Li, Chaiquan; Liu, Xiaofei; Shen, Peng; Sun, Yexiang; Zhou, Tianjing; Chen, Weiye; Chen, Qi; Lin, Hongbo; Tang, Xun; Gao, Pei.
Afiliação
  • Li C; Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, No. 38 Xueyuan Road, Haidian District, 100191 Beijing, China.
  • Liu X; Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, No. 38 Xueyuan Road, Haidian District, 100191 Beijing, China.
  • Shen P; Yinzhou District Center for Disease Control and Prevention, No. 1221 Xueshi Road, Yinzhou District, 315199 Ningbo, China.
  • Sun Y; Yinzhou District Center for Disease Control and Prevention, No. 1221 Xueshi Road, Yinzhou District, 315199 Ningbo, China.
  • Zhou T; Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, No. 38 Xueyuan Road, Haidian District, 100191 Beijing, China.
  • Chen W; Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, No. 38 Xueyuan Road, Haidian District, 100191 Beijing, China.
  • Chen Q; Yinzhou District Center for Disease Control and Prevention, No. 1221 Xueshi Road, Yinzhou District, 315199 Ningbo, China.
  • Lin H; Yinzhou District Center for Disease Control and Prevention, No. 1221 Xueshi Road, Yinzhou District, 315199 Ningbo, China.
  • Tang X; Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, No. 38 Xueyuan Road, Haidian District, 100191 Beijing, China.
  • Gao P; Key Laboratory of Epidemiology of Major Diseases, Peking University, Ministry of Education, No. 38 Xueyuan Road, Haidian District, 100191 Beijing, China.
Eur Heart J Digit Health ; 5(1): 30-40, 2024 Jan.
Article em En | MEDLINE | ID: mdl-38264696
ABSTRACT

Aims:

Existing electronic health records (EHRs) often consist of abundant but irregular longitudinal measurements of risk factors. In this study, we aim to leverage such data to improve the risk prediction of atherosclerotic cardiovascular disease (ASCVD) by applying machine learning (ML) algorithms, which can allow automatic screening of the population. Methods and

results:

A total of 215 744 Chinese adults aged between 40 and 79 without a history of cardiovascular disease were included (6081 cases) from an EHR-based longitudinal cohort study. To allow interpretability of the model, the predictors of demographic characteristics, medication treatment, and repeatedly measured records of lipids, glycaemia, obesity, blood pressure, and renal function were used. The primary outcome was ASCVD, defined as non-fatal acute myocardial infarction, coronary heart disease death, or fatal and non-fatal stroke. The eXtreme Gradient boosting (XGBoost) algorithm and Least Absolute Shrinkage and Selection Operator (LASSO) regression models were derived to predict the 5-year ASCVD risk. In the validation set, compared with the refitted Chinese guideline-recommended Cox model (i.e. the China-PAR), the XGBoost model had a significantly higher C-statistic of 0.792, (the differences in the C-statistics 0.011, 0.006-0.017, P < 0.001), with similar results reported for LASSO regression (the differences in the C-statistics 0.008, 0.005-0.011, P < 0.001). The XGBoost model demonstrated the best calibration performance (men Dx = 0.598, P = 0.75; women Dx = 1.867, P = 0.08). Moreover, the risk distribution of the ML algorithms differed from that of the conventional model. The net reclassification improvement rates of XGBoost and LASSO over the Cox model were 3.9% (1.4-6.4%) and 2.8% (0.7-4.9%), respectively.

Conclusion:

Machine learning algorithms with irregular, repeated real-world data could improve cardiovascular risk prediction. They demonstrated significantly better performance for reclassification to identify the high-risk population correctly.
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Etiology_studies / Guideline / Observational_studies / Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Etiology_studies / Guideline / Observational_studies / Prognostic_studies / Risk_factors_studies Idioma: En Ano de publicação: 2024 Tipo de documento: Article