Federated Learning Using Multi-Modal Sensors with Heterogeneous Privacy Sensitivity Levels-Reference-Cited by-同舟云学术

Federated Learning Using Multi-Modal Sensors with Heterogeneous Privacy Sensitivity Levels

Published:2024-08-05 Issue: Volume: Page:
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Hsu Chih-Fan¹^ORCID,Li Yi-Chen²^ORCID,Tsai Chung-Chi³^ORCID,Wang Jian-Kai⁴^ORCID,Hsu Cheng-Hsin²^ORCID

Affiliation:

1. National Yang Ming Chiao Tung University, Taiwan

2. National Tsing Hua University, Taiwan

3. Qualcomm Technologies, Inc., USA

4. Qualcomm Technologies, Inc., Taiwan

Abstract

Data from multi-modal sensors, such as RGB cameras, thermal cameras, microphones, and mmWave radars, have gradually been adopted in various classification problems for better accuracy. Some sensors, like RGB cameras and microphones, however, capture privacy-invasive data, which are less likely to be used in centralized learning. Although the Federated Learning (FL) paradigm frees clients from sharing their sensor data, doing so results in reduced classification accuracy and increased training time. In this article, we introduce a novel Heterogeneous Privacy Federated Learning (HPFL) paradigm to better capitalize on the less privacy-invasive sensor data, such as thermal images and mmWave point clouds, by uploading them to the server for closing the performance gap between FL and centralized learning. HPFL not only allows clients to keep the more privacy-invasive sensor data private, such as RGB images and human voices, but also gives each client total freedom to define the levels of their privacy concern on individual sensor modalities. For example, more sensitive users may prefer to keep their thermal images private, while others do not mind sharing these images. We carry out extensive experiments to evaluate the HPFL paradigm using two representative classification problems: semantic segmentation and emotion recognition. Several key findings demonstrate the merits of HPFL: (i) compared to FedAvg, it improves foreground accuracy by 18.20% in semantic segmentation and boosts the F1-score by 4.20% in emotion recognition, (ii) with heterogeneous privacy concern levels, it achieves an even larger F1-score improvement of 6.17–16.05% in emotion recognition, and (iii) it also outperforms the state-of-the-art FL approaches by 12.04–17.70% in foreground accuracy and 2.54–4.10% in F1-score.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3686801

Reference87 articles.

1. Durmus Acar, Yue Zhao, Ramon Matas, Matthew Mattina, Paul Whatmough, and Venkatesh Saligrama. 2020. Federated learning based on dynamic regularization. In ICLR. IEEE.

2. Network information flow

3. Panagiotis Antoniadis, Ioannis Pikoulis, Panagiotis Filntisis, and Petros Maragos. 2021. An audiovisual and contextual approach for categorical and continuous emotion recognition in-the-wild. In ICCV. IEEE/CVF, 3645–3651.

4. Multimodal Machine Learning: A Survey and Taxonomy

5. Decentralised Learning in Federated Deployment Environments