Federated Learning on Clinical Benchmark Data: Performance Assessment-Reference-Cited by-同舟云学术

Federated Learning on Clinical Benchmark Data: Performance Assessment

Published:2020-10-26 Issue:10 Volume:22 Page:e20891
ISSN:1438-8871
Container-title:Journal of Medical Internet Research
language:en
Short-container-title:J Med Internet Res

Author:

Lee Geun Hyeong^ORCID,Shin Soo-Yong^ORCID

Abstract

Background Federated learning (FL) is a newly proposed machine-learning method that uses a decentralized dataset. Since data transfer is not necessary for the learning process in FL, there is a significant advantage in protecting personal privacy. Therefore, many studies are being actively conducted in the applications of FL for diverse areas. Objective The aim of this study was to evaluate the reliability and performance of FL using three benchmark datasets, including a clinical benchmark dataset. Methods To evaluate FL in a realistic setting, we implemented FL using a client-server architecture with Python. The implemented client-server version of the FL software was deployed to Amazon Web Services. Modified National Institute of Standards and Technology (MNIST), Medical Information Mart for Intensive Care-III (MIMIC-III), and electrocardiogram (ECG) datasets were used to evaluate the performance of FL. To test FL in a realistic setting, the MNIST dataset was split into 10 different clients, with one digit for each client. In addition, we conducted four different experiments according to basic, imbalanced, skewed, and a combination of imbalanced and skewed data distributions. We also compared the performance of FL to that of the state-of-the-art method with respect to in-hospital mortality using the MIMIC-III dataset. Likewise, we conducted experiments comparing basic and imbalanced data distributions using MIMIC-III and ECG data. Results FL on the basic MNIST dataset with 10 clients achieved an area under the receiver operating characteristic curve (AUROC) of 0.997 and an F1-score of 0.946. The experiment with the imbalanced MNIST dataset achieved an AUROC of 0.995 and an F1-score of 0.921. The experiment with the skewed MNIST dataset achieved an AUROC of 0.992 and an F1-score of 0.905. Finally, the combined imbalanced and skewed experiment achieved an AUROC of 0.990 and an F1-score of 0.891. The basic FL on in-hospital mortality using MIMIC-III data achieved an AUROC of 0.850 and an F1-score of 0.944, while the experiment with the imbalanced MIMIC-III dataset achieved an AUROC of 0.850 and an F1-score of 0.943. For ECG classification, the basic FL achieved an AUROC of 0.938 and an F1-score of 0.807, and the imbalanced ECG dataset achieved an AUROC of 0.943 and an F1-score of 0.807. Conclusions FL demonstrated comparative performance on different benchmark datasets. In addition, FL demonstrated reliable performance in cases where the distribution was imbalanced, skewed, and extreme, reflecting the real-life scenario in which data distributions from various hospitals are different. FL can achieve high performance while maintaining privacy protection because there is no requirement to centralize the data.

Publisher

JMIR Publications Inc.

Subject

Health Informatics

Reference49 articles.

1. A Study on Global Data Leaks in 2018InfoWatch Analytics Center2020-10-13https://infowatch.com/sites/default/files/report/analytics/Global_Data_Breaches_2018.pdf

2. Various Database Attacks and its Prevention Techniques

3. Evaluating the Risk of Re-identification of Patients from Hospital Prescription Records

Cited by 81 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Survey of Blockchain Applicability, Challenges, and Key Threats;Computers;2024-09-06

2. Privacy-by-design generation of two virtual clinical trials in multiple sclerosis and their release as open datasets;2024-09-02

3. Toward Free-Riding Attack on Cross-Silo Federated Learning Through Evolutionary Game;2024 IEEE 44th International Conference on Distributed Computing Systems (ICDCS);2024-07-23

4. Retina Fundus Photograph-Based Artificial Intelligence Algorithms in Medicine: A Systematic Review;Ophthalmology and Therapy;2024-06-24

5. A multifaceted survey on privacy preservation of federated learning: progress, challenges, and opportunities;Artificial Intelligence Review;2024-06-21