Mining multi-center heterogeneous medical data with distributed synthetic learning-Reference-Cited by-同舟云学术

Mining multi-center heterogeneous medical data with distributed synthetic learning

Published:2023-09-07 Issue:1 Volume:14 Page:
ISSN:2041-1723
Container-title:Nature Communications
language:en
Short-container-title:Nat Commun

Author:

Chang Qi^ORCID,Yan Zhennan^ORCID,Zhou Mu,Qu Hui,He Xiaoxiao,Zhang Han,Baskaran Lohendran,Al’Aref Subhi,Li Hongsheng,Zhang Shaoting,Metaxas Dimitris N.^ORCID

Abstract

AbstractOvercoming barriers on the use of multi-center data for medical analytics is challenging due to privacy protection and data heterogeneity in the healthcare system. In this study, we propose the Distributed Synthetic Learning (DSL) architecture to learn across multiple medical centers and ensure the protection of sensitive personal information. DSL enables the building of a homogeneous dataset with entirely synthetic medical images via a form of GAN-based synthetic learning. The proposed DSL architecture has the following key functionalities: multi-modality learning, missing modality completion learning, and continual learning. We systematically evaluate the performance of DSL on different medical applications using cardiac computed tomography angiography (CTA), brain tumor MRI, and histopathology nuclei datasets. Extensive experiments demonstrate the superior performance of DSL as a high-quality synthetic medical image provider by the use of an ideal synthetic quality metric called Dist-FID. We show that DSL can be adapted to heterogeneous data and remarkably outperforms the real misaligned modalities segmentation model by 55% and the temporal datasets segmentation model by 8%.

Funder

National Science Foundation

Publisher

Springer Science and Business Media LLC

Subject

General Physics and Astronomy,General Biochemistry, Genetics and Molecular Biology,General Chemistry,Multidisciplinary

Link

https://www.nature.com/articles/s41467-023-40687-y.pdf

Reference92 articles.

1. Domingos, P. M. A few useful things to know about machine learning. Commun. ACM 55, 78–87 (2012).

2. Vogt, N. Machine learning in neuroscience. Nat. Methods 15, 33–33 (2018).

3. Libbrecht, M. W. & Noble, W. S. Machine learning applications in genetics and genomics. Nat. Rev. Genet. 16, 321–332 (2015).