Your browser doesn't support javascript.
loading
Subsampling versus bootstrapping in resampling-based model selection for multivariable regression.
De Bin, Riccardo; Janitza, Silke; Sauerbrei, Willi; Boulesteix, Anne-Laure.
Afiliação
  • De Bin R; Department of Medical Informatics, Biometry and Epidemiology, University of Munich, Marchioninistr. 15, 81377 Munich, Germany.
  • Janitza S; Department of Medical Informatics, Biometry and Epidemiology, University of Munich, Marchioninistr. 15, 81377 Munich, Germany.
  • Sauerbrei W; Department of Medical Biometry and Medical Informatics, University Medical Center Freiburg, Stefan-Meier-Str. 26, 79106 Freiburg im Breisgau, Germany.
  • Boulesteix AL; Department of Medical Informatics, Biometry and Epidemiology, University of Munich, Marchioninistr. 15, 81377 Munich, Germany.
Biometrics ; 72(1): 272-80, 2016 Mar.
Article em En | MEDLINE | ID: mdl-26288150
ABSTRACT
In recent years, increasing attention has been devoted to the problem of the stability of multivariable regression models, understood as the resistance of the model to small changes in the data on which it has been fitted. Resampling techniques, mainly based on the bootstrap, have been developed to address this issue. In particular, the approaches based on the idea of "inclusion frequency" consider the repeated implementation of a variable selection procedure, for example backward elimination, on several bootstrap samples. The analysis of the variables selected in each iteration provides useful information on the model stability and on the variables' importance. Recent findings, nevertheless, show possible pitfalls in the use of the bootstrap, and alternatives such as subsampling have begun to be taken into consideration in the literature. Using model selection frequencies and variable inclusion frequencies, we empirically compare these two different resampling techniques, investigating the effect of their use in selected classical model selection procedures for multivariable regression. We conduct our investigations by analyzing two real data examples and by performing a simulation study. Our results reveal some advantages in using a subsampling technique rather than the bootstrap in this context.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Análise Multivariada / Análise de Regressão / Modelos Estatísticos / Tamanho da Amostra Tipo de estudo: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Biometrics Ano de publicação: 2016 Tipo de documento: Article País de afiliação: Alemanha

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Análise Multivariada / Análise de Regressão / Modelos Estatísticos / Tamanho da Amostra Tipo de estudo: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Biometrics Ano de publicação: 2016 Tipo de documento: Article País de afiliação: Alemanha