Your browser doesn't support javascript.
loading
Federated learning for multi-omics: A performance evaluation in Parkinson's disease.
Danek, Benjamin P; Makarious, Mary B; Dadu, Anant; Vitale, Dan; Lee, Paul Suhwan; Singleton, Andrew B; Nalls, Mike A; Sun, Jimeng; Faghri, Faraz.
Afiliação
  • Danek BP; Department of Computer Science, University of Illinois at Urbana-Champaign, Champaign, IL 61820, USA.
  • Makarious MB; Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
  • Dadu A; DataTecnica, Washington, DC 20037, USA.
  • Vitale D; Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD 20892, USA.
  • Lee PS; Department of Clinical and Movement Neurosciences, UCL Queen Square Institute of Neurology, London, UK.
  • Singleton AB; UCL Movement Disorders Centre, University College London, London, UK.
  • Nalls MA; Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
  • Sun J; DataTecnica, Washington, DC 20037, USA.
  • Faghri F; Center for Alzheimer's and Related Dementias (CARD), National Institute on Aging and National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
Patterns (N Y) ; 5(3): 100945, 2024 Mar 08.
Article em En | MEDLINE | ID: mdl-38487808
ABSTRACT
While machine learning (ML) research has recently grown more in popularity, its application in the omics domain is constrained by access to sufficiently large, high-quality datasets needed to train ML models. Federated learning (FL) represents an opportunity to enable collaborative curation of such datasets among participating institutions. We compare the simulated performance of several models trained using FL against classically trained ML models on the task of multi-omics Parkinson's disease prediction. We find that FL model performance tracks centrally trained ML models, where the most performant FL model achieves an AUC-PR of 0.876 ± 0.009, 0.014 ± 0.003 less than its centrally trained variation. We also determine that the dispersion of samples within a federation plays a meaningful role in model performance. Our study implements several open-source FL frameworks and aims to highlight some of the challenges and opportunities when applying these collaborative methods in multi-omics studies.
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article