Your browser doesn't support javascript.
loading
dsMTL: a computational framework for privacy-preserving, distributed multi-task machine learning.
Cao, Han; Zhang, Youcheng; Baumbach, Jan; Burton, Paul R; Dwyer, Dominic; Koutsouleris, Nikolaos; Matschinske, Julian; Marcon, Yannick; Rajan, Sivanesan; Rieg, Thilo; Ryser-Welch, Patricia; Späth, Julian; Herrmann, Carl; Schwarz, Emanuel.
Afiliación
  • Cao H; Department of Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, Mannheim 68158, Germany.
  • Zhang Y; Health Data Science Unit, Medical Faculty Heidelberg & BioQuant, Heidelberg 69120, Germany.
  • Baumbach J; Chair of Computational Systems Biology, University of Hamburg, Hamburg 22607, Germany.
  • Burton PR; Computational Biomedicine Lab, Department of Mathematics and Computer Science, University of Southern Denmark, Odense 5230, Denmark.
  • Dwyer D; Population Health Sciences Institute, Newcastle University, Newcastle upon Tyne NE2 4AX, UK.
  • Koutsouleris N; Department of Psychiatry and Psychotherapy, Section for Neurodiagnostic Applications, Ludwig-Maximilian University, Munich 80638, Germany.
  • Matschinske J; Department of Psychiatry and Psychotherapy, Section for Neurodiagnostic Applications, Ludwig-Maximilian University, Munich 80638, Germany.
  • Marcon Y; Chair of Computational Systems Biology, University of Hamburg, Hamburg 22607, Germany.
  • Rajan S; Epigeny, St Ouen, France.
  • Rieg T; Department of Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, Mannheim 68158, Germany.
  • Ryser-Welch P; Department of Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, Mannheim 68158, Germany.
  • Späth J; Population Health Sciences Institute, Newcastle University, Newcastle upon Tyne NE2 4AX, UK.
  • Schwarz E; Health Data Science Unit, Medical Faculty Heidelberg & BioQuant, Heidelberg 69120, Germany.
Bioinformatics ; 38(21): 4919-4926, 2022 10 31.
Article en En | MEDLINE | ID: mdl-36073911
MOTIVATION: In multi-cohort machine learning studies, it is critical to differentiate between effects that are reproducible across cohorts and those that are cohort-specific. Multi-task learning (MTL) is a machine learning approach that facilitates this differentiation through the simultaneous learning of prediction tasks across cohorts. Since multi-cohort data can often not be combined into a single storage solution, there would be the substantial utility of an MTL application for geographically distributed data sources. RESULTS: Here, we describe the development of 'dsMTL', a computational framework for privacy-preserving, distributed multi-task machine learning that includes three supervised and one unsupervised algorithms. First, we derive the theoretical properties of these methods and the relevant machine learning workflows to ensure the validity of the software implementation. Second, we implement dsMTL as a library for the R programming language, building on the DataSHIELD platform that supports the federated analysis of sensitive individual-level data. Third, we demonstrate the applicability of dsMTL for comorbidity modeling in distributed data. We show that comorbidity modeling using dsMTL outperformed conventional, federated machine learning, as well as the aggregation of multiple models built on the distributed datasets individually. The application of dsMTL was computationally efficient and highly scalable when applied to moderate-size (n < 500), real expression data given the actual network latency. AVAILABILITY AND IMPLEMENTATION: dsMTL is freely available at https://github.com/transbioZI/dsMTLBase (server-side package) and https://github.com/transbioZI/dsMTLClient (client-side package). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Privacidad / Aprendizaje Automático Tipo de estudio: Prognostic_studies Límite: Humans Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2022 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Privacidad / Aprendizaje Automático Tipo de estudio: Prognostic_studies Límite: Humans Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2022 Tipo del documento: Article País de afiliación: Alemania
...