Your browser doesn't support javascript.
loading
A longitudinal transition imputation model for categorical data applied to a large registry dataset.
Mamouris, Pavlos; Nassiri, Vahid; Verbeke, Geert; Janssens, Arne; Vaes, Bert; Molenberghs, Geert.
Afiliação
  • Mamouris P; Department of Public Health and Primary Care, KU Leuven, Leuven, Belgium.
  • Nassiri V; Open Analytics NV, Antwerp, Belgium.
  • Verbeke G; I-BioStat, KU Leuven University of Leuven, Leuven, Belgium.
  • Janssens A; I-BioStat, Hasselt University, Diepenbeek, Belgium.
  • Vaes B; Department of Public Health and Primary Care, KU Leuven, Leuven, Belgium.
  • Molenberghs G; Department of Public Health and Primary Care, KU Leuven, Leuven, Belgium.
Stat Med ; 42(29): 5405-5418, 2023 12 20.
Article em En | MEDLINE | ID: mdl-37752860
ABSTRACT
Imputation of longitudinal categorical covariates with several waves and many predictors is cumbersome in terms of implausible transitions, colinearity, and overfitting. We designed a simulation study with data obtained from a general practitioners' morbidity registry in Belgium for three waves, with smoking as the longitudinal covariate of interest. We set varying proportions of data on smoking to missing completely at random and missing not at random with proportions of missingness equal to 10%, 30%, 50%, and 70%. This study proposed a 3-stage approach that allows flexibility when imputing time-dependent categorical covariates. First, multiple imputation using fully conditional specification or multiple imputation for the predictor variables was deployed using the wide format such that previous and future information of the same patient was utilized. Second, a joint Markov transition model for initial, forward, backward, and intermittent probabilities was developed for each imputed dataset. Finally, this transition model was used for imputation. We compared the performance of this methodology with an analyses of the complete data and with listwise deletion in terms of bias and root mean square error. Next, we applied this methodology in a clinical case for years 2017 to 2021, where we estimated the effect of several covariates on the pneumococcal vaccination. This methodological framework ensures that the plausibility of transitions is preserved, overfitting and colinearity issues are resolved, and confounders can be utilized. Finally, a companion R package was developed to enable the replication and easy application of this methodology.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Fumar Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Fumar Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article