Your browser doesn't support javascript.
loading
Federated Random Forests can improve local performance of predictive models for various healthcare applications.
Hauschild, Anne-Christin; Lemanczyk, Marta; Matschinske, Julian; Frisch, Tobias; Zolotareva, Olga; Holzinger, Andreas; Baumbach, Jan; Heider, Dominik.
Afiliación
  • Hauschild AC; Department of Mathematics and Computer Science, University of Marburg, Marburg, Germany.
  • Lemanczyk M; Department of Mathematics and Computer Science, University of Marburg, Marburg, Germany.
  • Matschinske J; TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising-Weihenstephan, Germany.
  • Frisch T; Computational Systems Biology, University of Hamburg, Hamburg, Germany.
  • Zolotareva O; Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark.
  • Holzinger A; TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising-Weihenstephan, Germany.
  • Baumbach J; Institut für Medizinische Informatik, Statistik und Dokumentation, Medizinische Universität Graz, Graz, Austria.
  • Heider D; Computational Systems Biology, University of Hamburg, Hamburg, Germany.
Bioinformatics ; 38(8): 2278-2286, 2022 04 12.
Article en En | MEDLINE | ID: mdl-35139148
MOTIVATION: Limited data access has hindered the field of precision medicine from exploring its full potential, e.g. concerning machine learning and privacy and data protection rules.Our study evaluates the efficacy of federated Random Forests (FRF) models, focusing particularly on the heterogeneity within and between datasets. We addressed three common challenges: (i) number of parties, (ii) sizes of datasets and (iii) imbalanced phenotypes, evaluated on five biomedical datasets. RESULTS: The FRF outperformed the average local models and performed comparably to the data-centralized models trained on the entire data. With an increasing number of models and decreasing dataset size, the performance of local models decreases drastically. The FRF, however, do not decrease significantly. When combining datasets of different sizes, the FRF vastly improve compared to the average local models. We demonstrate that the FRF remain more robust and outperform the local models by analyzing different class-imbalances.Our results support that FRF overcome boundaries of clinical research and enables collaborations across institutes without violating privacy or legal regulations. Clinicians benefit from a vast collection of unbiased data aggregated from different geographic locations, demographics and other varying factors. They can build more generalizable models to make better clinical decisions, which will have relevance, especially for patients in rural areas and rare or geographically uncommon diseases, enabling personalized treatment. In combination with secure multi-party computation, federated learning has the power to revolutionize clinical practice by increasing the accuracy and robustness of healthcare AI and thus paving the way for precision medicine. AVAILABILITY AND IMPLEMENTATION: The implementation of the federated random forests can be found at https://featurecloud.ai/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Privacidad / Bosques Aleatorios Tipo de estudio: Clinical_trials / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2022 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Privacidad / Bosques Aleatorios Tipo de estudio: Clinical_trials / Prognostic_studies / Risk_factors_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2022 Tipo del documento: Article País de afiliación: Alemania