Selecting, optimizing and externally validating a preexisting machine-learning regression algorithm for estimating waist circumference.
Comput Biol Med
; 169: 107909, 2024 Feb.
Article
en En
| MEDLINE
| ID: mdl-38181609
ABSTRACT
Obesity, typically defined by the body mass index (BMI), has well known negative health effects. However, the BMI has serious deficiencies in predicting the adverse risks associated to obesity. Waist circumference (WC) is an alternative to define obesity and a better disease predictor according to the literature. However, old databases often lack this information, it is inaccurate (collected via self-report) or it is incomplete. Thus, this study accurately assesses WC using machine learning. The novel approaches are 1) predictor variables (weight, height, age and sex) likely to appear in most data sets are used. 2) Publicly available data (including non-adults) and algorithms are used. 3) Systematic methods for data cleanup, model selection, hyperparameter optimization and external validation are performed. DATA ARE CLEANED one variable per column, no special codes, missing values or outliers. Preexisting regression algorithms are gaged by cross-validation, using one data set. The hyperparameters of the best performing algorithm are optimized. The tuned algorithm is externally validated with other data sets by cross-validation. In spite of the limited number of features, the tuned algorithm outperforms prior WC approximations, using the same or similar predictor variables. The tuned algorithm enables using data where WC is not measured, is incomplete or is unreliable. A similar approach would be useful to estimate other variables of interest.
Palabras clave
Texto completo:
1
Colección:
01-internacional
Banco de datos:
MEDLINE
Asunto principal:
Obesidad
Tipo de estudio:
Etiology_studies
/
Prognostic_studies
/
Risk_factors_studies
Límite:
Humans
Idioma:
En
Revista:
Comput Biol Med
Año:
2024
Tipo del documento:
Article