Your browser doesn't support javascript.
loading
Are the Relevant Risk Factors Being Adequately Captured in Empirical Studies of Smoking Initiation? A Machine Learning Analysis Based on the Population Assessment of Tobacco and Health Study.
Le, Thuy T T; Issabakhsh, Mona; Li, Yameng; María Sánchez-Romero, Luz; Tan, Jiale; Meza, Rafael; Levy, David; Mendez, David.
Afiliação
  • Le TTT; Department of Health Management and Policy, School of Public Health, University of Michigan, Ann Arbor, MI, USA.
  • Issabakhsh M; Department of Oncology, School of Medicine, Georgetown University, Washington, DC, USA.
  • Li Y; Department of Oncology, School of Medicine, Georgetown University, Washington, DC, USA.
  • María Sánchez-Romero L; Department of Oncology, School of Medicine, Georgetown University, Washington, DC, USA.
  • Tan J; Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA.
  • Meza R; Integrative Oncology, BC Cancer Research Institute, Vancouver BC, USA.
  • Levy D; Department of Oncology, School of Medicine, Georgetown University, Washington, DC, USA.
  • Mendez D; Department of Health Management and Policy, School of Public Health, University of Michigan, Ann Arbor, MI, USA.
Nicotine Tob Res ; 25(8): 1481-1488, 2023 Jul 14.
Article em En | MEDLINE | ID: mdl-37099744
ABSTRACT

INTRODUCTION:

Cigarette smoking continues to pose a threat to public health. Identifying individual risk factors for smoking initiation is essential to further mitigate this epidemic. To the best of our knowledge, no study today has used machine learning (ML) techniques to automatically uncover informative predictors of smoking onset among adults using the Population Assessment of Tobacco and Health (PATH) study. AIMS AND

METHODS:

In this work, we employed random forest paired with Recursive Feature Elimination to identify relevant PATH variables that predict smoking initiation among adults who have never smoked at baseline between two consecutive PATH waves. We included all potentially informative baseline variables in wave 1 (wave 4) to predict past 30-day smoking status in wave 2 (wave 5). Using the first and most recent pairs of PATH waves was found sufficient to identify the key risk factors of smoking initiation and test their robustness over time. The eXtreme Gradient Boosting method was employed to test the quality of these selected variables.

RESULTS:

As a result, classification models suggested about 60 informative PATH variables among many candidate variables in each baseline wave. With these selected predictors, the resulting models have a high discriminatory power with the area under the specificity-sensitivity curves of around 80%. We examined the chosen variables and discovered important features. Across the considered waves, two factors, (1) BMI, and (2) dental and oral health status, robustly appeared as important predictors of smoking initiation, besides other well-established predictors.

CONCLUSIONS:

Our work demonstrates that ML methods are useful to predict smoking initiation with high accuracy, identifying novel smoking initiation predictors, and to enhance our understanding of tobacco use behaviors. IMPLICATIONS Understanding individual risk factors for smoking initiation is essential to prevent smoking initiation. With this methodology, a set of the most informative predictors of smoking onset in the PATH data were identified. Besides reconfirming well-known risk factors, the findings suggested additional predictors of smoking initiation that have been overlooked in previous work. More studies that focus on the newly discovered factors (BMI and dental and oral health status,) are needed to confirm their predictive power against the onset of smoking as well as determine the underlying mechanisms.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Produtos do Tabaco / Sistemas Eletrônicos de Liberação de Nicotina / Fumar Cigarros Tipo de estudo: Etiology_studies / Observational_studies / Prognostic_studies / Risk_factors_studies Limite: Adult / Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Produtos do Tabaco / Sistemas Eletrônicos de Liberação de Nicotina / Fumar Cigarros Tipo de estudo: Etiology_studies / Observational_studies / Prognostic_studies / Risk_factors_studies Limite: Adult / Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article