Your browser doesn't support javascript.
loading
Impute-then-exclude versus exclude-then-impute: Lessons when imputing a variable used both in cohort creation and as an independent variable in the analysis model.
Austin, Peter C; Giardiello, Daniele; van Buuren, Stef.
Afiliação
  • Austin PC; ICES, Toronto, Ontario, Canada.
  • Giardiello D; Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Ontario, Canada.
  • van Buuren S; Sunnybrook Research Institute, Toronto, Ontario, Canada.
Stat Med ; 42(10): 1525-1541, 2023 05 10.
Article em En | MEDLINE | ID: mdl-36807923
We examined the setting in which a variable that is subject to missingness is used both as an inclusion/exclusion criterion for creating the analytic sample and subsequently as the primary exposure in the analysis model that is of scientific interest. An example is cancer stage, where patients with stage IV cancer are often excluded from the analytic sample, and cancer stage (I to III) is an exposure variable in the analysis model. We considered two analytic strategies. The first strategy, referred to as "exclude-then-impute," excludes subjects for whom the observed value of the target variable is equal to the specified value and then uses multiple imputation to complete the data in the resultant sample. The second strategy, referred to as "impute-then-exclude," first uses multiple imputation to complete the data and then excludes subjects based on the observed or filled-in values in the completed samples. Monte Carlo simulations were used to compare five methods (one based on "exclude-then-impute" and four based on "impute-then-exclude") along with the use of a complete case analysis. We considered both missing completely at random and missing at random missing data mechanisms. We found that an impute-then-exclude strategy using substantive model compatible fully conditional specification tended to have superior performance across 72 different scenarios. We illustrated the application of these methods using empirical data on patients hospitalized with heart failure when heart failure subtype was used for cohort creation (excluding subjects with heart failure with preserved ejection fraction) and was also an exposure in the analysis model.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Projetos de Pesquisa Tipo de estudo: Health_economic_evaluation / Prognostic_studies Limite: Humans Idioma: En Revista: Stat Med Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Canadá

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Projetos de Pesquisa Tipo de estudo: Health_economic_evaluation / Prognostic_studies Limite: Humans Idioma: En Revista: Stat Med Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Canadá