Your browser doesn't support javascript.
loading
FedSPL: federated self-paced learning for privacy-preserving disease diagnosis.
Wang, Qingyong; Zhou, Yun.
Affiliation
  • Wang Q; Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, China.
  • Zhou Y; Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, China.
Brief Bioinform ; 23(1)2022 01 17.
Article in En | MEDLINE | ID: mdl-34874995
ABSTRACT
The growing expansion of data availability in medical fields could help improve the performance of machine learning methods. However, with healthcare data, using multi-institutional datasets is challenging due to privacy and security concerns. Therefore, privacy-preserving machine learning methods are required. Thus, we use a federated learning model to train a shared global model, which is a central server that does not contain private data, and all clients maintain the sensitive data in their own institutions. The scattered training data are connected to improve model performance, while preserving data privacy. However, in the federated training procedure, data errors or noise can reduce learning performance. Therefore, we introduce the self-paced learning, which can effectively select high-confidence samples and drop high noisy samples to improve the performances of the training model and reduce the risk of data privacy leakage. We propose the federated self-paced learning (FedSPL), which combines the advantage of federated learning and self-paced learning. The proposed FedSPL model was evaluated on gene expression data distributed across different institutions where the privacy concerns must be considered. The results demonstrate that the proposed FedSPL model is secure, i.e. it does not expose the original record to other parties, and the computational overhead during training is acceptable. Compared with learning methods based on the local data of all parties, the proposed model can significantly improve the predicted F1-score by approximately 4.3%. We believe that the proposed method has the potential to benefit clinicians in gene selections and disease prognosis.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Privacy / Machine Learning Type of study: Diagnostic_studies / Prognostic_studies Limits: Humans Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2022 Type: Article Affiliation country: China

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Privacy / Machine Learning Type of study: Diagnostic_studies / Prognostic_studies Limits: Humans Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2022 Type: Article Affiliation country: China