Mining multi-center heterogeneous medical data with distributed synthetic learning.

Chang, Qi; Yan, Zhennan; Zhou, Mu; Qu, Hui; He, Xiaoxiao; Zhang, Han; Baskaran, Lohendran; Al'Aref, Subhi; Li, Hongsheng; Zhang, Shaoting; Metaxas, Dimitris N

Chang, Qi; Yan, Zhennan; Zhou, Mu; Qu, Hui; He, Xiaoxiao; Zhang, Han; Baskaran, Lohendran; Al'Aref, Subhi; Li, Hongsheng; Zhang, Shaoting; Metaxas, Dimitris N.

Affiliation

Chang Q; Department of Computer Science, Rutgers University, Piscataway, NJ, USA.
Yan Z; SenseBrain Research, Princeton, NJ, USA.
Zhou M; SenseBrain Research, Princeton, NJ, USA.
Qu H; Shanghai Artificial Intelligence Laboratory, Shanghai, China.
He X; Department of Computer Science, Rutgers University, Piscataway, NJ, USA.
Zhang H; Department of Computer Science, Rutgers University, Piscataway, NJ, USA.
Baskaran L; Department of Computer Science, Rutgers University, Piscataway, NJ, USA.
Al'Aref S; Department of Cardiovascular Medicine, National Heart Centre Singapore, and Duke-National University Of Singapore, Singapore, Singapore.
Li H; Department of Medicine, Division of Cardiology, University of Arkansas for Medical Sciences, Little Rock, AR, USA.
Zhang S; Chinese University of Hong Kong, Hong Kong SAR, China. hsli@ee.cuhk.edu.hk.
Metaxas DN; Centre for Perceptual and Interactive Intelligence (CPII), Hong Kong SAR, China. hsli@ee.cuhk.edu.hk.

Nat Commun ; 14(1): 5510, 2023 09 07.

Article in En | MEDLINE | ID: mdl-37679325

ABSTRACT

ABSTRACT

Overcoming barriers on the use of multi-center data for medical analytics is challenging due to privacy protection and data heterogeneity in the healthcare system. In this study, we propose the Distributed Synthetic Learning (DSL) architecture to learn across multiple medical centers and ensure the protection of sensitive personal information. DSL enables the building of a homogeneous dataset with entirely synthetic medical images via a form of GAN-based synthetic learning. The proposed DSL architecture has the following key functionalities multi-modality learning, missing modality completion learning, and continual learning. We systematically evaluate the performance of DSL on different medical applications using cardiac computed tomography angiography (CTA), brain tumor MRI, and histopathology nuclei datasets. Extensive experiments demonstrate the superior performance of DSL as a high-quality synthetic medical image provider by the use of an ideal synthetic quality metric called Dist-FID. We show that DSL can be adapted to heterogeneous data and remarkably outperforms the real misaligned modalities segmentation model by 55% and the temporal datasets segmentation model by 8%.

Subject(s)

Brain Neoplasms; Learning; Humans; Angiography; Cell Nucleus; Computed Tomography Angiography

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Brain Neoplasms / Learning Type of study: Clinical_trials Limits: Humans Language: En Journal: Nat Commun Journal subject: BIOLOGIA / CIENCIA Year: 2023 Document type: Article Affiliation country: United States

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google