Your browser doesn't support javascript.
loading
Forward variable selection for random forest models.
Velthoen, Jasper; Cai, Juan-Juan; Jongbloed, Geurt.
Afiliação
  • Velthoen J; Department of Applied Mathematics, Delft University of Technology, Delft, The Netherlands.
  • Cai JJ; Department of Econometrics and Data Science, Vrije Universiteit Amsterdam, De Boelelaan 1105, Amsterdam 1081 HV, The Netherlands.
  • Jongbloed G; Department of Applied Mathematics, Delft University of Technology, Delft, The Netherlands.
J Appl Stat ; 50(13): 2836-2856, 2023.
Article em En | MEDLINE | ID: mdl-37720244
Random forest is a popular prediction approach for handling high dimensional covariates. However, it often becomes infeasible to interpret the obtained high dimensional and non-parametric model. Aiming for an interpretable predictive model, we develop a forward variable selection method using the continuous ranked probability score (CRPS) as the loss function. eOur stepwise procedure selects at each step a variable that minimizes the CRPS risk and a stopping criterion for selection is designed based on an estimation of the CRPS risk difference of two consecutive steps. We provide mathematical motivation for our method by proving that in a population sense, the method attains the optimal set. In a simulation study, we compare the performance of our method with an existing variable selection method, for different sample sizes and correlation strength of covariates. Our method is observed to have a much lower false positive rate. We also demonstrate an application of our method to statistical post-processing of daily maximum temperature forecasts in the Netherlands. Our method selects about 10% covariates while retaining the same predictive power.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Clinical_trials / Prognostic_studies Idioma: En Revista: J Appl Stat Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Holanda

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Clinical_trials / Prognostic_studies Idioma: En Revista: J Appl Stat Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Holanda