Your browser doesn't support javascript.
loading
Exploring the diagnostic performance of machine learning in prediction of metabolic phenotypes focusing on thyroid function.
Ahn, Hyeong Jun; Ishikawa, Kyle; Kim, Min-Hee.
Affiliation
  • Ahn HJ; Department of Quantitative Health Sciences, John A. Burns School of Medicine, University of Hawaii at Manoa, Honolulu, Hawaii, United States of America.
  • Ishikawa K; Department of Quantitative Health Sciences, John A. Burns School of Medicine, University of Hawaii at Manoa, Honolulu, Hawaii, United States of America.
  • Kim MH; Division of Endocrinology and Metabolism, Department of Internal Medicine, Eunpyeong St. Mary's Hospital, College of Medicine, The Catholic University of Korea Seoul, South Korea.
PLoS One ; 19(6): e0304785, 2024.
Article in En | MEDLINE | ID: mdl-38941283
ABSTRACT
In this study, we employed various machine learning models to predict metabolic phenotypes, focusing on thyroid function, using a dataset from the National Health and Nutrition Examination Survey (NHANES) from 2007 to 2012. Our analysis utilized laboratory parameters relevant to thyroid function or metabolic dysregulation in addition to demographic features, aiming to uncover potential associations between thyroid function and metabolic phenotypes by various machine learning methods. Multinomial Logistic Regression performed best to identify the relationship between thyroid function and metabolic phenotypes, achieving an area under receiver operating characteristic curve (AUROC) of 0.818, followed closely by Neural Network (AUROC 0.814). Following the above, the performance of Random Forest, Boosted Trees, and K Nearest Neighbors was inferior to the first two methods (AUROC 0.811, 0.811, and 0.786, respectively). In Random Forest, homeostatic model assessment for insulin resistance, serum uric acid, serum albumin, gamma glutamyl transferase, and triiodothyronine/thyroxine ratio were positioned in the upper ranks of variable importance. These results highlight the potential of machine learning in understanding complex relationships in health data. However, it's important to note that model performance may vary depending on data characteristics and specific requirements. Furthermore, we emphasize the significance of accounting for sampling weights in complex survey data analysis and the potential benefits of incorporating additional variables to enhance model accuracy and insights. Future research can explore advanced methodologies combining machine learning, sample weights, and expanded variable sets to further advance survey data analysis.
Subject(s)

Full text: 1 Database: MEDLINE Main subject: Phenotype / Thyroid Gland / Machine Learning Limits: Adult / Female / Humans / Male / Middle aged Language: En Journal: PLoS One Journal subject: CIENCIA / MEDICINA Year: 2024 Type: Article Affiliation country: United States

Full text: 1 Database: MEDLINE Main subject: Phenotype / Thyroid Gland / Machine Learning Limits: Adult / Female / Humans / Male / Middle aged Language: En Journal: PLoS One Journal subject: CIENCIA / MEDICINA Year: 2024 Type: Article Affiliation country: United States