Your browser doesn't support javascript.
loading
Cluster-specific nonignorably missing, endogenous, and continuous regressors in multilevel model for binary outcome.
Kim, Gi-Soo; Lee, Youngjo; Kim, Hongsoo; Paik, Myunghee Cho.
Affiliation
  • Kim GS; Department of Statistics, Seoul National University, Seoul, South Korea.
  • Lee Y; Department of Statistics, Seoul National University, Seoul, South Korea.
  • Kim H; Graduate School of Public Health, Dept. of Public Health Sciences, Seoul National University, Seoul, South Korea.
  • Paik MC; Institute of Aging, Seoul National University, Seoul, South Korea.
Stat Methods Med Res ; 29(7): 1818-1830, 2020 07.
Article in En | MEDLINE | ID: mdl-31552805
ABSTRACT
In multilevel regression models for observational clustered data, regressors can be correlated with cluster-level error components, namely endogenous, due to omitted cluster-level covariates, measurement error, and simultaneity. When endogeneity is ignored, regression coefficient estimators can be severely biased. To deal with endogeneity, instrument variable methods have been widely used. However, the instrument variable method often requires external instrument variables with certain conditions that cannot be verified empirically. Methods that use the within-cluster variations of the endogenous variable work under the restriction that either the outcome or the endogenous variable has a linear relationship with the cluster-level random effect. We propose a new method for binary outcome when it follows a logistic mixed-effects model and the endogenous variable is normally distributed but not linear in the random effect. The proposed estimator capitalizes on the nested data structure without requiring external instrument variables. We show that the proposed estimator is consistent and asymptotically normal. Furthermore, our method can be applied when the endogenous variable is missing in a cluster-specific nonignorable mechanism, without requiring that the missing mechanism be correctly specified. We evaluate the finite sample performance of the proposed approach via simulation and apply the method to a health care study using a San Diego inpatient dataset. Our study demonstrates that the clustered structure can be exploited to draw valid analysis of multilevel data with correlated effects.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Research Design Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Stat Methods Med Res Year: 2020 Document type: Article Affiliation country: South Korea

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Research Design Type of study: Prognostic_studies / Risk_factors_studies Language: En Journal: Stat Methods Med Res Year: 2020 Document type: Article Affiliation country: South Korea