Abstract
In recent years, Federated Learning (FL) has gained traction as a privacy-centric approach in medical imaging. This study explores the challenges posed by data heterogeneity on FL algorithms, using the COVIDx CXR-3 dataset as a case study. We contrast the performance of the Federated Averaging (FedAvg) algorithm on non-identically and independently distributed (non-IID) data against identically and independently distributed (IID) data. Our findings reveal a notable performance decline with increased data heterogeneity, emphasizing the need for innovative strategies to enhance FL in diverse environments. This research contributes to the practical implementation of FL, extending beyond theoretical concepts and addressing the nuances in medical imaging applications. This research uncovers the inherent challenges in FL due to data diversity. It sets the stage for future advancements in FL strategies to effectively manage data heterogeneity, especially in sensitive fields like healthcare.
Publisher
Public Library of Science (PLoS)
Reference47 articles.
1. Client Selection in Federated Learning: Principles, Challenges, and Opportunities;L Fu;IEEE Internet of Things Journal,2023
2. Blockchain-empowered federated learning: Challenges, solutions, and future directions;J Zhu;ACM Computing Surveys,2023
3. Hospital patients’ length of stay prediction: A federated learning approach;MM Rahman;Journal of King Saud University-Computer and Information Sciences,2022
4. A contemplative perspective on federated machine learning: Taxonomy, threats & vulnerability assessment and challenges;D Jatain;Journal of King Saud University-Computer and Information Sciences,2022
5. A comprehensive survey of privacy-preserving federated learning: A taxonomy, review, and future directions;X Yin;ACM Computing Surveys (CSUR),2021