Your browser doesn't support javascript.
loading
Tree-Values: Selective Inference for Regression Trees.
Neufeld, Anna C; Gao, Lucy L; Witten, Daniela M.
Afiliação
  • Neufeld AC; Department of Statistics, University of Washington, Seattle, WA 98195, USA.
  • Gao LL; Department of Statistics, University of British Columbia, Vancouver, British Columbia, V6T 1Z4, Canada.
  • Witten DM; Departments of Statistics and Biostatistics, University of Washington, Seattle, WA 98195, USA.
Article em En | MEDLINE | ID: mdl-38481523
ABSTRACT
We consider conducting inference on the output of the Classification and Regression Tree (CART) (Breiman et al., 1984) algorithm. A naive approach to inference that does not account for the fact that the tree was estimated from the data will not achieve standard guarantees, such as Type 1 error rate control and nominal coverage. Thus, we propose a selective inference framework for conducting inference on a fitted CART tree. In a nutshell, we condition on the fact that the tree was estimated from the data. We propose a test for the difference in the mean response between a pair of terminal nodes that controls the selective Type 1 error rate, and a confidence interval for the mean response within a single terminal node that attains the nominal selective coverage. Efficient algorithms for computing the necessary conditioning sets are provided. We apply these methods in simulation and to a dataset involving the association between portion control interventions and caloric intake.
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2022 Tipo de documento: Article