Affiliation:
1. Eindhoven University of Technology, Eindhoven, The Netherlands
Abstract
Federated Learning is a distributed machine learning paradigm dealing with decentralized and personal datasets. Since data reside on devices such as smartphones and virtual assistants, labeling is entrusted to the clients or labels are extracted in an automated way. Specifically, in the case of audio data, acquiring semantic annotations can be prohibitively expensive and time-consuming. As a result, an abundance of audio data remains unlabeled and unexploited on users’ devices. Most existing federated learning approaches focus on supervised learning without harnessing the unlabeled data. In this work, we study the problem of semi-supervised learning of audio models via self-training in conjunction with federated learning. We propose
FedSTAR
to exploit large-scale on-device unlabeled data to improve the generalization of audio recognition models. We further demonstrate that self-supervised pre-trained models can accelerate the training of on-device models, significantly improving convergence within fewer training rounds. We conduct experiments on diverse public audio classification datasets and investigate the performance of our models under varying percentages of labeled and unlabeled data. Notably, we show that with as little as 3% labeled data available,
FedSTAR
on average can improve the recognition rate by 13.28% compared to the fully supervised federated model.
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Software
Reference46 articles.
1. Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning
2. David Berthelot Nicholas Carlini Ian Goodfellow Nicolas Papernot Avital Oliver and Colin Raffel. 2019. MixMatch: A Holistic Approach to Semi-Supervised Learning. arXiv:1905.02249 [cs.LG].
3. Flower: A friendly federated learning research framework;Beutel Daniel J.;arXiv preprint arXiv:2007.14390,2020
4. Contactless cardiac arrest detection using smart devices;Chan Justin;NPJ Dig. Med.,2019
5. Audio Surveillance of Roads: A System for Detecting Anomalous Sounds
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献