Your browser doesn't support javascript.
loading
Dataset dependency of low-density lipoprotein-cholesterol estimation by machine learning.
Hidekazu, Ishida; Nagasawa, Hiroki; Yamamoto, Yasuko; Doi, Hiroki; Saito, Midori; Ishihara, Yuya; Fujita, Takashi; Ishida, Mariko; Kato, Yohei; Kikuchi, Ryosuke; Matsunami, Hidetoshi; Takemura, Masao; Ito, Hiroyasu; Saito, Kuniaki.
Affiliation
  • Hidekazu I; Department of Clinical Laboratory, Fujita Health University Hospital, Toyoake, Japan.
  • Nagasawa H; Division of Clinical Laboratory, Gifu University Hospital, Gifu, Japan.
  • Yamamoto Y; M2DS Co., Ltd, Japan.
  • Doi H; Department of Disease Control and Prevention, Fujita Health University Graduate School of Health Sciences, Toyoake, Japan.
  • Saito M; Advanced Diagnostic System Research Laboratory, Fujita Health University, Toyoake, Aichi, Japan.
  • Ishihara Y; Department of Clinical Laboratory, Fujita Health University Hospital, Toyoake, Japan.
  • Fujita T; Department of Clinical Laboratory, Fujita Health University Hospital, Toyoake, Japan.
  • Ishida M; Department of Clinical Laboratory, Fujita Health University Hospital, Toyoake, Japan.
  • Kato Y; Department of Clinical Laboratory, Fujita Health University Hospital, Toyoake, Japan.
  • Kikuchi R; Division of Clinical Laboratory, Gifu University Hospital, Gifu, Japan.
  • Matsunami H; Division of Clinical Laboratory, Gifu University Hospital, Gifu, Japan.
  • Takemura M; Division of Clinical Laboratory, Gifu University Hospital, Gifu, Japan.
  • Ito H; Matsunami Research Park, Japan.
  • Saito K; Department of Disease Control and Prevention, Fujita Health University Graduate School of Health Sciences, Toyoake, Japan.
Ann Clin Biochem ; 60(6): 396-405, 2023 11.
Article in En | MEDLINE | ID: mdl-37218090
ABSTRACT

OBJECTIVES:

We evaluated the applicability of a machine learning-based low-density lipoprotein-cholesterol (LDL-C) estimation method and the influence of the characteristics of the training datasets.

METHODS:

Three training datasets were chosen from training datasets health check-up participants at the Resource Center for Health Science (N = 2664), clinical patients at Gifu University Hospital (N = 7409), and clinical patients at Fujita Health University Hospital (N = 14,842). Nine different machine learning models were constructed through hyperparameter tuning and 10-fold cross-validation. Another test dataset of another 3711 clinical patients at Fujita Health University Hospital was selected as the test set used for comparing and validating the model against the Friedewald formula and the Martin method.

RESULTS:

The coefficients of determination of the models trained on the health check-up dataset produced coefficients of determination that were equal to or inferior to those of the Martin method. In contrast, the coefficients of determination of several models trained on clinical patients exceeded those of the Martin method. The means of the differences and the convergences to the direct method were higher for the models trained on the clinical patients' dataset than for those trained on the health check-up participants' dataset. The models trained on the latter dataset tended to overestimate the 2019 ESC/EAS Guideline for LDL-cholesterol classification.

CONCLUSION:

Although machine learning models provide valuable method for LDL-C estimates, they should be trained on datasets with matched characteristics. The versatility of machine learning methods is another important consideration.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Research Design / Machine Learning Type of study: Guideline / Prognostic_studies Limits: Humans Language: En Journal: Ann Clin Biochem Year: 2023 Document type: Article Affiliation country: Japón

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Research Design / Machine Learning Type of study: Guideline / Prognostic_studies Limits: Humans Language: En Journal: Ann Clin Biochem Year: 2023 Document type: Article Affiliation country: Japón