FedCSS: Joint Client-and-Sample Selection for Hard Sample-Aware Noise-Robust Federated Learning-Reference-Cited by-同舟云学术

FedCSS: Joint Client-and-Sample Selection for Hard Sample-Aware Noise-Robust Federated Learning

Published:2023-11-13 Issue:3 Volume:1 Page:1-24
ISSN:2836-6573
Container-title:Proceedings of the ACM on Management of Data
language:en
Short-container-title:Proc. ACM Manag. Data

Author:

Li Anran¹^ORCID,Cao Yue¹^ORCID,Guo Jiabao²^ORCID,Peng Hongyi¹^ORCID,Guo Qing³^ORCID,Yu Han¹^ORCID

Affiliation:

1. Nanyang Technological University, Singapore, Singapore

2. Wuhan University, Wuhan, China

3. A*STAR, Singapore, China

Abstract

Federated Learning (FL) enables a large number of data owners (a.k.a. FL clients) to jointly train a machine learning model without disclosing private local data. The importance of local data samples to the FL model vary widely. This is exacerbated by the presence of noisy data, which exhibit large losses similar to important (hard) samples. Currently, there lacks an FL approach that can effectively distinguish hard samples (which are beneficial) from noisy samples (which are harmful). To bridge this gap, we propose the Federated Client and Sample Selection (FedCSS) approach. It is a bilevel optimization approach for FL client-and-sample selection to achieve hard sample-aware noise-robust learning in a privacy preserving manner. It performs meta-learning based online approximation to iteratively update global FL models, select the most positively influential samples and deal with training data noise. Theoretical analysis shows that it is guaranteed to converge in an efficient manner. Experimental comparison against six state-of-the-art baselines on five real-world datasets in the presence of data noise and heterogeneity shows that it achieves up to 26.4% higher test accuracy, while saving communication and computation costs by at least 41.5% and 1.2%, respectively.

Funder

National Satellite of Excellence in Trustworthy Software Systems, National University of Singapore

Nanyang Technological University, under SUG Grant

NRF Investigatorship

the Cyber Security Agency under its National Cybersecurity R&D Programme

National Research Foundation Singapore and DSO National Laboratories under the AI Singapore Programme

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3617332

Reference66 articles.

1. Brendan McMahan , Eider Moore , and Ramage. Communication-efficient learning of deep networks from decentralized data . In ICML , 2017 . Brendan McMahan, Eider Moore, and Ramage. Communication-efficient learning of deep networks from decentralized data. In ICML, 2017.

2. Skellam mixture mechanism

3. Anran Li , Hongyi Peng , Lan Zhang , Jiahui Huang , Qing Guo , Han Yu , and Yang Liu . Fedsdg-fs: Efficient and secure feature selection for vertical federated learning. arXiv preprint arXiv:2302.10417 , 2023 . Anran Li, Hongyi Peng, Lan Zhang, Jiahui Huang, Qing Guo, Han Yu, and Yang Liu. Fedsdg-fs: Efficient and secure feature selection for vertical federated learning. arXiv preprint arXiv:2302.10417, 2023.

4. GFL: Federated Learning on Non-IID Data via Privacy-Preserving Synthetic Data

5. Andrew Hard , Kanishka Rao , Rajiv Mathews , and Ramaswamy. Federated learning for mobile keyboard prediction . DeepAI , 2018 . Andrew Hard, Kanishka Rao, Rajiv Mathews, and Ramaswamy. Federated learning for mobile keyboard prediction. DeepAI, 2018.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. FedCross: Towards Accurate Federated Learning via Multi-Model Cross-Aggregation;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13