Your browser doesn't support javascript.
loading
A note on dichotomization of continuous response variable in the presence of contamination and model misspecification.
Shentu, Yue; Xie, Minge.
Afiliación
  • Shentu Y; Department of Statistics and Biostatistics, Hill Center for Mathematical Sciences, Rutgers University, Piscataway, NJ 08854, U.S.A.
Stat Med ; 29(21): 2200-14, 2010 Sep 20.
Article en En | MEDLINE | ID: mdl-20812301
ABSTRACT
The purpose of this note is to raise awareness of the complexity of the practice involving dichotomization. It is well known that the regular regression models are effective tools for analyzing Gaussian-type response variables, and researchers are often told that it is a 'bad idea' to practice dichotomization if continuous measurements are available. We demonstrate through special cases, however, that there is another side of the story if the response variable is contaminated. Although dichotomization causes loss of information, it can also reduce input of contamination. If the reduction of contamination input outweighs the loss of information, analysis based on dichotomization can sometimes provide better results. We derive formulas of bias and variance for binary regression estimators under a contamination model of unknown additive errors, and compare them with both the least squares and robust M-estimators from the corresponding linear regression analysis using continuous responses. As a case study, we study extensively the case in which the observed response is contaminated by an error with a mean and a variance proportional to the mean and the variance of the uncontaminated true response. Conditions under which dichotomization is preferred are obtained. A simulation study based on a real data setting is provided, which supports the theoretical developments.
Asunto(s)

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Bioestadística / Modelos Estadísticos Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Stat Med Año: 2010 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Bioestadística / Modelos Estadísticos Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Stat Med Año: 2010 Tipo del documento: Article