Your browser doesn't support javascript.
loading
Ridge regression in prediction problems: automatic choice of the ridge parameter.
Cule, Erika; De Iorio, Maria.
Afiliação
  • Cule E; Department of Epidemiology and Biostatistics, Imperial College London, London, United Kingdom; Statistical Consulting Group, GlaxoSmithKline, Stevenage, United Kingdom.
Genet Epidemiol ; 37(7): 704-14, 2013 Nov.
Article em En | MEDLINE | ID: mdl-23893343
ABSTRACT
To date, numerous genetic variants have been identified as associated with diverse phenotypic traits. However, identified associations generally explain only a small proportion of trait heritability and the predictive power of models incorporating only known-associated variants has been small. Multiple regression is a popular framework in which to consider the joint effect of many genetic variants simultaneously. Ordinary multiple regression is seldom appropriate in the context of genetic data, due to the high dimensionality of the data and the correlation structure among the predictors. There has been a resurgence of interest in the use of penalised regression techniques to circumvent these difficulties. In this paper, we focus on ridge regression, a penalised regression approach that has been shown to offer good performance in multivariate prediction problems. One challenge in the application of ridge regression is the choice of the ridge parameter that controls the amount of shrinkage of the regression coefficients. We present a method to determine the ridge parameter based on the data, with the aim of good performance in high-dimensional prediction problems. We establish a theoretical justification for our approach, and demonstrate its performance on simulated genetic data and on a real data example. Fitting a ridge regression model to hundreds of thousands to millions of genetic variants simultaneously presents computational challenges. We have developed an R package, ridge, which addresses these issues. Ridge implements the automatic choice of ridge parameter presented in this paper, and is freely available from CRAN.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Fenótipo / Variação Genética / Modelos Genéticos Tipo de estudo: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Genet Epidemiol Assunto da revista: EPIDEMIOLOGIA / GENETICA MEDICA Ano de publicação: 2013 Tipo de documento: Article País de afiliação: Reino Unido

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Fenótipo / Variação Genética / Modelos Genéticos Tipo de estudo: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Genet Epidemiol Assunto da revista: EPIDEMIOLOGIA / GENETICA MEDICA Ano de publicação: 2013 Tipo de documento: Article País de afiliação: Reino Unido