Your browser doesn't support javascript.
loading
Modeling continuous covariates with a "spike" at zero: Bivariate approaches.
Jenkner, Carolin; Lorenz, Eva; Becher, Heiko; Sauerbrei, Willi.
Affiliation
  • Jenkner C; Institute of Medical Biometry and Statistics, Freiburg University Medical Centre, Stefan-Meier-Str. 26, 79104 Freiburg, Germany.
  • Lorenz E; Epidemiology and Biostatistics Unit, Medical Faculty, University of Heidelberg, Germany.
  • Becher H; Epidemiology and Biostatistics Unit, Medical Faculty, University of Heidelberg, Germany.
  • Sauerbrei W; Institute of Medical Biometry and Epidemiology, University Hospital Hamburg-Eppendorf, Germany.
Biom J ; 58(4): 783-96, 2016 Jul.
Article in En | MEDLINE | ID: mdl-27072783
In epidemiology and clinical research, predictors often take value zero for a large amount of observations while the distribution of the remaining observations is continuous. These predictors are called variables with a spike at zero. Examples include smoking or alcohol consumption. Recently, an extension of the fractional polynomial (FP) procedure, a technique for modeling nonlinear relationships, was proposed to deal with such situations. To indicate whether or not a value is zero, a binary variable is added to the model. In a two stage procedure, called FP-spike, the necessity of the binary variable and/or the continuous FP function for the positive part are assessed for a suitable fit. In univariate analyses, the FP-spike procedure usually leads to functional relationships that are easy to interpret. This paper introduces four approaches for dealing with two variables with a spike at zero (SAZ). The methods depend on the bivariate distribution of zero and nonzero values. Bi-Sep is the simplest of the four bivariate approaches. It uses the univariate FP-spike procedure separately for the two SAZ variables. In Bi-D3, Bi-D1, and Bi-Sub, proportions of zeros in both variables are considered simultaneously in the binary indicators. Therefore, these strategies can account for correlated variables. The methods can be used for arbitrary distributions of the covariates. For illustration and comparison of results, data from a case-control study on laryngeal cancer, with smoking and alcohol intake as two SAZ variables, is considered. In addition, a possible extension to three or more SAZ variables is outlined. A combination of log-linear models for the analysis of the correlation in combination with the bivariate approaches is proposed.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Data Interpretation, Statistical / Models, Statistical Type of study: Observational_studies / Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: Biom J Year: 2016 Type: Article Affiliation country: Germany

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Data Interpretation, Statistical / Models, Statistical Type of study: Observational_studies / Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: Biom J Year: 2016 Type: Article Affiliation country: Germany