Your browser doesn't support javascript.
loading
Building more accurate decision trees with the additive tree.
Luna, José Marcio; Gennatas, Efstathios D; Ungar, Lyle H; Eaton, Eric; Diffenderfer, Eric S; Jensen, Shane T; Simone, Charles B; Friedman, Jerome H; Solberg, Timothy D; Valdes, Gilmer.
Afiliação
  • Luna JM; Department of Radiation Oncology, University of Pennsylvania, Philadelphia, PA 19104; gilmer.valdes@ucsf.edu jose.luna@pennmedicine.upenn.edu jhf@stanford.edu.
  • Gennatas ED; Department of Radiation Oncology, University of California, San Francisco, CA 94115.
  • Ungar LH; Department of Computing and Information Science, University of Pennsylvania, Philadelphia, PA 19104.
  • Eaton E; Department of Computing and Information Science, University of Pennsylvania, Philadelphia, PA 19104.
  • Diffenderfer ES; Department of Radiation Oncology, University of Pennsylvania, Philadelphia, PA 19104.
  • Jensen ST; Department of Statistics, University of Pennsylvania, Philadelphia, PA 19104.
  • Simone CB; Department of Radiation Oncology, New York Proton Center, New York, NY 10035.
  • Friedman JH; Department of Statistics, Stanford University, Stanford, CA 94305 gilmer.valdes@ucsf.edu jose.luna@pennmedicine.upenn.edu jhf@stanford.edu.
  • Solberg TD; Department of Radiation Oncology, University of California, San Francisco, CA 94115.
  • Valdes G; Department of Radiation Oncology, University of California, San Francisco, CA 94115; gilmer.valdes@ucsf.edu jose.luna@pennmedicine.upenn.edu jhf@stanford.edu.
Proc Natl Acad Sci U S A ; 116(40): 19887-19893, 2019 10 01.
Article em En | MEDLINE | ID: mdl-31527280
ABSTRACT
The expansion of machine learning to high-stakes application domains such as medicine, finance, and criminal justice, where making informed decisions requires clear understanding of the model, has increased the interest in interpretable machine learning. The widely used Classification and Regression Trees (CART) have played a major role in health sciences, due to their simple and intuitive explanation of predictions. Ensemble methods like gradient boosting can improve the accuracy of decision trees, but at the expense of the interpretability of the generated model. Additive models, such as those produced by gradient boosting, and full interaction models, such as CART, have been investigated largely in isolation. We show that these models exist along a spectrum, revealing previously unseen connections between these approaches. This paper introduces a rigorous formalization for the additive tree, an empirically validated learning technique for creating a single decision tree, and shows that this method can produce models equivalent to CART or gradient boosted stumps at the extremes by varying a single parameter. Although the additive tree is designed primarily to provide both the model interpretability and predictive performance needed for high-stakes applications like medicine, it also can produce decision trees represented by hybrid models between CART and boosted stumps that can outperform either of these approaches.
Assuntos
Palavras-chave

Texto completo: 1 Temas: ECOS / Financiamentos_gastos Bases de dados: MEDLINE Assunto principal: Algoritmos / Árvores de Decisões / Aprendizado de Máquina Tipo de estudo: Health_economic_evaluation / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Proc Natl Acad Sci U S A Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Temas: ECOS / Financiamentos_gastos Bases de dados: MEDLINE Assunto principal: Algoritmos / Árvores de Decisões / Aprendizado de Máquina Tipo de estudo: Health_economic_evaluation / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Proc Natl Acad Sci U S A Ano de publicação: 2019 Tipo de documento: Article