Your browser doesn't support javascript.
loading
Development of machine learning model for diagnostic disease prediction based on laboratory tests.
Park, Dong Jin; Park, Min Woo; Lee, Homin; Kim, Young-Jin; Kim, Yeongsic; Park, Young Hoon.
Afiliação
  • Park DJ; Department of Laboratory Medicine, College of Medicine, Ewha Womans University of Korea, Seoul, South Korea.
  • Park MW; Department of Laboratory Medicine, St. Vincent's Hospital, The Catholic University of Korea, Seoul, South Korea.
  • Lee H; Department of Research, Future Lab, Seoul, South Korea.
  • Kim YJ; Finance, Fishery, Manufacture Industrial Mathematics Center on Big Data, Pusan National University, Pusan, South Korea.
  • Kim Y; Department of Laboratory Medicine, College of Medicine, The Catholic University of Korea, Seoul, South Korea.
  • Park YH; Division of Hematology, Department of Internal Medicine, College of Medicine, The Catholic University of Korea, Seoul, South Korea. carrox2yh@gmail.com.
Sci Rep ; 11(1): 7567, 2021 04 07.
Article em En | MEDLINE | ID: mdl-33828178
ABSTRACT
The use of deep learning and machine learning (ML) in medical science is increasing, particularly in the visual, audio, and language data fields. We aimed to build a new optimized ensemble model by blending a DNN (deep neural network) model with two ML models for disease prediction using laboratory test results. 86 attributes (laboratory tests) were selected from datasets based on value counts, clinical importance-related features, and missing values. We collected sample datasets on 5145 cases, including 326,686 laboratory test results. We investigated a total of 39 specific diseases based on the International Classification of Diseases, 10th revision (ICD-10) codes. These datasets were used to construct light gradient boosting machine (LightGBM) and extreme gradient boosting (XGBoost) ML models and a DNN model using TensorFlow. The optimized ensemble model achieved an F1-score of 81% and prediction accuracy of 92% for the five most common diseases. The deep learning and ML models showed differences in predictive power and disease classification patterns. We used a confusion matrix and analyzed feature importance using the SHAP value method. Our new ML model achieved high efficiency of disease prediction through classification of diseases. This study will be useful in the prediction and diagnosis of diseases.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Diagnóstico por Computador / Técnicas de Laboratório Clínico / Aprendizado de Máquina Tipo de estudo: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Diagnóstico por Computador / Técnicas de Laboratório Clínico / Aprendizado de Máquina Tipo de estudo: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article