Preserving fairness and diagnostic accuracy in private large-scale AI models for medical imaging-Reference-Cited by-同舟云学术

Preserving fairness and diagnostic accuracy in private large-scale AI models for medical imaging

Published:2024-03-14 Issue:1 Volume:4 Page:
ISSN:2730-664X
Container-title:Communications Medicine
language:en
Short-container-title:Commun Med

Author:

Tayebi Arasteh Soroosh^ORCID,Ziller Alexander^ORCID,Kuhl Christiane,Makowski Marcus^ORCID,Nebelung Sven^ORCID,Braren Rickmer^ORCID,Rueckert Daniel^ORCID,Truhn Daniel^ORCID,Kaissis Georgios^ORCID

Abstract

Abstract Background Artificial intelligence (AI) models are increasingly used in the medical domain. However, as medical data is highly sensitive, special precautions to ensure its protection are required. The gold standard for privacy preservation is the introduction of differential privacy (DP) to model training. Prior work indicates that DP has negative implications on model accuracy and fairness, which are unacceptable in medicine and represent a main barrier to the widespread use of privacy-preserving techniques. In this work, we evaluated the effect of privacy-preserving training of AI models regarding accuracy and fairness compared to non-private training. Methods We used two datasets: (1) A large dataset (N = 193,311) of high quality clinical chest radiographs, and (2) a dataset (N = 1625) of 3D abdominal computed tomography (CT) images, with the task of classifying the presence of pancreatic ductal adenocarcinoma (PDAC). Both were retrospectively collected and manually labeled by experienced radiologists. We then compared non-private deep convolutional neural networks (CNNs) and privacy-preserving (DP) models with respect to privacy-utility trade-offs measured as area under the receiver operating characteristic curve (AUROC), and privacy-fairness trade-offs, measured as Pearson’s r or Statistical Parity Difference. Results We find that, while the privacy-preserving training yields lower accuracy, it largely does not amplify discrimination against age, sex or co-morbidity. However, we find an indication that difficult diagnoses and subgroups suffer stronger performance hits in private training. Conclusions Our study shows that – under the challenging realistic circumstances of a real-life clinical dataset – the privacy-preserving training of diagnostic deep learning models is possible with excellent diagnostic accuracy and fairness.

Funder

Bundesministerium für Bildung, Wissenschaft, Forschung und Technologie

The Bavarian State Ministry for Science and the Arts through the Munich Centre for Machine Learning.

Bundesministerium für Bildung und Forschung

Deutsches Konsortium für Translationale Krebsforschung

The Bavarian State Ministry for Science and the Arts through the Munich Centre for Machine Learning. ERC Grant Deep4MI

EC | Horizon 2020 Framework Programme

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s43856-024-00462-6.pdf

Reference57 articles.

1. Usynin, D. et al. Adversarial interference and its mitigations in privacy-preserving collaborative machine learning. Nat. Mach. Intell. 3, 749–758 (2021).

2. Konečny`, J., McMahan, H. B., Ramage, D. & Richtárik, P. Federated optimization: Distributed machine learning for on-device intelligence. arXiv preprint arXiv:1610.02527 (2016).

3. Konečny`, J. et al. Federated learning: Strategies for improving communication efficiency. arXiv preprint arXiv:1610.05492 (2016).

4. McMahan, B., Moore, E., Ramage, D., Hampson, S. & y Arcas, B. A. Communication-efficient learning of deep networks from decentralized data. In Artificial Intelligence and Statistics, 1273–1282 (PMLR, 2017).

5. Truhn, D. et al. Encrypted federated learning for secure decentralized collaboration in cancer image analysis. Med. Image Anal. (2024). https://doi.org/10.1016/j.media.2023.103059.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Privacy preserving technology in ophthalmology;Current Opinion in Ophthalmology;2024-08-27

2. Shielding sensitive medical imaging data;Nature Machine Intelligence;2024-07-11

3. Application of ChatGPT as a support tool in the diagnosis and management of acute bacterial tonsillitis;Health and Technology;2024-04-11

4. Enhancing Privacy and Preserving Accuracy in Medical Image Classification with Limited Labeled Samples;Lecture Notes in Computer Science;2024