Your browser doesn't support javascript.
loading
Machine learning-based prediction of diabetic patients using blood routine data.
Li, Honghao; Su, Dongqing; Zhang, Xinpeng; He, Yuanyuan; Luo, Xu; Xiong, Yuqiang; Zou, Min; Wei, Huiyan; Wen, Shaoran; Xi, Qilemuge; Zuo, Yongchun; Yang, Lei.
Afiliación
  • Li H; College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
  • Su D; College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
  • Zhang X; College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
  • He Y; College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
  • Luo X; College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
  • Xiong Y; College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
  • Zou M; College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
  • Wei H; Biotechnology Experimental Center, Harbin Medical University, Harbin 150081, China.
  • Wen S; The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China.
  • Xi Q; The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China.
  • Zuo Y; The State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China; Inner Mongolia International Mongolian Hospital, Hohhot 010065, China. Electronic address: yczuo@imu.edu.cn.
  • Yang L; College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China. Electronic address: leiyang@hrbmu.edu.cn.
Methods ; 229: 156-162, 2024 Sep.
Article en En | MEDLINE | ID: mdl-39019099
ABSTRACT
Diabetes stands as one of the most prevalent chronic diseases globally. The conventional methods for diagnosing diabetes are frequently overlooked until individuals manifest noticeable symptoms of the condition. This study aimed to address this gap by collecting comprehensive datasets, including 1000 instances of blood routine data from diabetes patients and an equivalent dataset from healthy individuals. To differentiate diabetes patients from their healthy counterparts, a computational framework was established, encompassing eXtreme Gradient Boosting (XGBoost), random forest, support vector machine, and elastic net algorithms. Notably, the XGBoost model emerged as the most effective, exhibiting superior predictive results with an area under the receiver operating characteristic curve (AUC) of 99.90% in the training set and 98.51% in the testing set. Moreover, the model showcased commendable performance during external validation, achieving an overall accuracy of 81.54%. The probability generated by the model serves as a risk score for diabetes susceptibility. Further interpretability was achieved through the utilization of the Shapley additive explanations (SHAP) algorithm, identifying pivotal indicators such as mean corpuscular hemoglobin concentration (MCHC), lymphocyte ratio (LY%), standard deviation of red blood cell distribution width (RDW-SD), and mean corpuscular hemoglobin (MCH). This enhances our understanding of the predictive mechanisms underlying diabetes. To facilitate the application in clinical and real-life settings, a nomogram was created based on the logistic regression algorithm, which can provide a preliminary assessment of the likelihood of an individual having diabetes. Overall, this research contributes valuable insights into the predictive modeling of diabetes, offering potential applications in clinical practice for more effective and timely diagnoses.
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Diabetes Mellitus / Aprendizaje Automático Límite: Adult / Female / Humans / Male / Middle aged Idioma: En Revista: Methods Asunto de la revista: BIOQUIMICA Año: 2024 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Diabetes Mellitus / Aprendizaje Automático Límite: Adult / Female / Humans / Male / Middle aged Idioma: En Revista: Methods Asunto de la revista: BIOQUIMICA Año: 2024 Tipo del documento: Article País de afiliación: China