Your browser doesn't support javascript.
loading
Interaction analysis under misspecification of main effects: Some common mistakes and simple solutions.
Zhang, Min; Yu, Youfei; Wang, Shikun; Salvatore, Maxwell; G Fritsche, Lars; He, Zihuai; Mukherjee, Bhramar.
Afiliação
  • Zhang M; Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan.
  • Yu Y; Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan.
  • Wang S; Department of Biostatistics, MD Anderson Cancer Center, Houston, Texas.
  • Salvatore M; Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan.
  • G Fritsche L; Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan.
  • He Z; Department of Neurology and Neurological Sciences, Stanford University, Stanford, California.
  • Mukherjee B; Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan.
Stat Med ; 39(11): 1675-1694, 2020 05 20.
Article em En | MEDLINE | ID: mdl-32101638
The statistical practice of modeling interaction with two linear main effects and a product term is ubiquitous in the statistical and epidemiological literature. Most data modelers are aware that the misspecification of main effects can potentially cause severe type I error inflation in tests for interactions, leading to spurious detection of interactions. However, modeling practice has not changed. In this article, we focus on the specific situation where the main effects in the model are misspecified as linear terms and characterize its impact on common tests for statistical interaction. We then propose some simple alternatives that fix the issue of potential type I error inflation in testing interaction due to main effect misspecification. We show that when using the sandwich variance estimator for a linear regression model with a quantitative outcome and two independent factors, both the Wald and score tests asymptotically maintain the correct type I error rate. However, if the independence assumption does not hold or the outcome is binary, using the sandwich estimator does not fix the problem. We further demonstrate that flexibly modeling the main effect under a generalized additive model can largely reduce or often remove bias in the estimates and maintain the correct type I error rate for both quantitative and binary outcomes regardless of the independence assumption. We show, under the independence assumption and for a continuous outcome, overfitting and flexibly modeling the main effects does not lead to power loss asymptotically relative to a correctly specified main effect model. Our simulation study further demonstrates the empirical fact that using flexible models for the main effects does not result in a significant loss of power for testing interaction in general. Our results provide an improved understanding of the strengths and limitations for tests of interaction in the presence of main effect misspecification. Using data from a large biobank study "The Michigan Genomics Initiative", we present two examples of interaction analysis in support of our results.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Interpretação Estatística de Dados Tipo de estudo: Prognostic_studies Limite: Humans País/Região como assunto: America do norte Idioma: En Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Interpretação Estatística de Dados Tipo de estudo: Prognostic_studies Limite: Humans País/Região como assunto: America do norte Idioma: En Ano de publicação: 2020 Tipo de documento: Article