Your browser doesn't support javascript.
loading
Constructing bi-plots for random forest: Tutorial.
Blanchet, Lionel; Vitale, Raffaele; van Vorstenbosch, Robert; Stavropoulos, George; Pender, John; Jonkers, Daisy; Schooten, Frederik-Jan van; Smolinska, Agnieszka.
Afiliação
  • Blanchet L; Department of Pharmacology and Toxicology, School of Nutrition, Toxicology and Translational Research in Metabolism (NUTRIM), Maastricht University Medical Center+, Maastricht, the Netherlands.
  • Vitale R; Laboratoire de Spectrochimie Infrarouge et Raman - LASIR CNRS - UMR 8516, Université de Lille, Bâtiment C5, F-59000, Lille, France; Molecular Imaging and Photonics Unit, Department of Chemistry, Katholieke Universiteit Leuven, Celestijnenlaan 200F, B-3001, Leuven, Belgium.
  • van Vorstenbosch R; Department of Pharmacology and Toxicology, School of Nutrition, Toxicology and Translational Research in Metabolism (NUTRIM), Maastricht University Medical Center+, Maastricht, the Netherlands.
  • Stavropoulos G; Department of Pharmacology and Toxicology, School of Nutrition, Toxicology and Translational Research in Metabolism (NUTRIM), Maastricht University Medical Center+, Maastricht, the Netherlands.
  • Pender J; Department of Medical Microbiology, School of Nutrition and Translational Research in Metabolism (NUTRIM), Maastricht University Medical Centre+, Maastricht, the Netherlands; Department of Medical Microbiology, School for Public Health and Primary Care (CAPHRI), Maastricht University Medical Centre+
  • Jonkers D; Division Gastroenterology-Hepatology, Department of Internal Medicine, NUTRIM School of Nutrition and Translational Research in Metabolism, Maastricht University Medical Centre+, 6202 AZ, Maastricht, the Netherlands.
  • Schooten FV; Department of Pharmacology and Toxicology, School of Nutrition, Toxicology and Translational Research in Metabolism (NUTRIM), Maastricht University Medical Center+, Maastricht, the Netherlands.
  • Smolinska A; Department of Pharmacology and Toxicology, School of Nutrition, Toxicology and Translational Research in Metabolism (NUTRIM), Maastricht University Medical Center+, Maastricht, the Netherlands. Electronic address: a.smolinska@maastrichtuniversity.nl.
Anal Chim Acta ; 1131: 146-155, 2020 Sep 22.
Article em En | MEDLINE | ID: mdl-32928475
ABSTRACT
Current technological developments have allowed for a significant increase and availability of data. Consequently, this has opened enormous opportunities for the machine learning and data science field, translating into the development of new algorithms in a wide range of applications in medical, biomedical, daily-life, and national security areas. Ensemble techniques are among the pillars of the machine learning field, and they can be defined as approaches in which multiple, complex, independent/uncorrelated, predictive models are subsequently combined by either averaging or voting to yield a higher model performance. Random forest (RF), a popular ensemble method, has been successfully applied in various domains due to its ability to build predictive models with high certainty and little necessity of model optimization. RF provides both a predictive model and an estimation of the variable importance. However, the estimation of the variable importance is based on thousands of trees, and therefore, it does not specify which variable is important for which sample group. The present study demonstrates an approach based on the pseudo-sample principle that allows for construction of bi-plots (i.e. spin plots) associated with RF models. The pseudo-sample principle for RF. is explained and demonstrated by using two simulated datasets, and three different types of real data, which include political sciences, food chemistry and the human microbiome data. The pseudo-sample bi-plots, associated with RF and its unsupervised version, allow for a versatile visualization of multivariate models, and the variable importance and the relation among them.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Clinical_trials / Prognostic_studies Idioma: En Revista: Anal Chim Acta Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Holanda

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Clinical_trials / Prognostic_studies Idioma: En Revista: Anal Chim Acta Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Holanda