Your browser doesn't support javascript.
loading
Federated unsupervised random forest for privacy-preserving patient stratification.
Pfeifer, Bastian; Sirocchi, Christel; Bloice, Marcus D; Kreuzthaler, Markus; Urschler, Martin.
Afiliação
  • Pfeifer B; Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Graz, 8010, Austria.
  • Sirocchi C; Department of Pure and Applied Sciences, University of Urbino, Urbino, 61029, Italy.
  • Bloice MD; Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Graz, 8010, Austria.
  • Kreuzthaler M; Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Graz, 8010, Austria.
  • Urschler M; Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Graz, 8010, Austria.
Bioinformatics ; 40(Suppl 2): ii198-ii207, 2024 09 01.
Article em En | MEDLINE | ID: mdl-39230698
ABSTRACT
MOTIVATION In the realm of precision medicine, effective patient stratification and disease subtyping demand innovative methodologies tailored for multi-omics data. Clustering techniques applied to multi-omics data have become instrumental in identifying distinct subgroups of patients, enabling a finer-grained understanding of disease variability. Meanwhile, clinical datasets are often small and must be aggregated from multiple hospitals. Online data sharing, however, is seen as a significant challenge due to privacy concerns, potentially impeding big data's role in medical advancements using machine learning. This work establishes a powerful framework for advancing precision medicine through unsupervised random forest-based clustering in combination with federated computing.

RESULTS:

We introduce a novel multi-omics clustering approach utilizing unsupervised random forests. The unsupervised nature of the random forest enables the determination of cluster-specific feature importance, unraveling key molecular contributors to distinct patient groups. Our methodology is designed for federated execution, a crucial aspect in the medical domain where privacy concerns are paramount. We have validated our approach on machine learning benchmark datasets as well as on cancer data from The Cancer Genome Atlas. Our method is competitive with the state-of-the-art in terms of disease subtyping, but at the same time substantially improves the cluster interpretability. Experiments indicate that local clustering performance can be improved through federated computing. AVAILABILITY AND IMPLEMENTATION The proposed methods are available as an R-package (https//github.com/pievos101/uRF).
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Medicina de Precisão Limite: Humans Idioma: En Revista: Bioinformatics / Bioinformatics (Oxford. Online) Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Áustria

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Medicina de Precisão Limite: Humans Idioma: En Revista: Bioinformatics / Bioinformatics (Oxford. Online) Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Áustria