Improving Semi-Supervised Learning for Audio Classification with FixMatch-Reference-Cited by-同舟云学术

Improving Semi-Supervised Learning for Audio Classification with FixMatch

Published:2021-07-28 Issue:15 Volume:10 Page:1807
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Grollmisch Sascha^ORCID,Cano Estefanía^ORCID

Abstract

Including unlabeled data in the training process of neural networks using Semi-Supervised Learning (SSL) has shown impressive results in the image domain, where state-of-the-art results were obtained with only a fraction of the labeled data. The commonality between recent SSL methods is that they strongly rely on the augmentation of unannotated data. This is vastly unexplored for audio data. In this work, SSL using the state-of-the-art FixMatch approach is evaluated on three audio classification tasks, including music, industrial sounds, and acoustic scenes. The performance of FixMatch is compared to Convolutional Neural Networks (CNN) trained from scratch, Transfer Learning, and SSL using the Mean Teacher approach. Additionally, a simple yet effective approach for selecting suitable augmentation methods for FixMatch is introduced. FixMatch with the proposed modifications always outperformed Mean Teacher and the CNNs trained from scratch. For the industrial sounds and music datasets, the CNN baseline performance using the full dataset was reached with less than 5% of the initial training data, demonstrating the potential of recent SSL methods for audio data. Transfer Learning outperformed FixMatch only for the most challenging dataset from acoustic scene classification, showing that there is still room for improvement.

Funder

Deutsche Forschungsgemeinschaft

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/10/15/1807/pdf

Reference44 articles.

1. MixMatch: A Holistic Approach to Semi-Supervised Learning;Berthelot,2019

2. ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring;Berthelot;arXiv,2019

3. FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence;Sohn,2020

4. Meta Pseudo Labels;Pham;arXiv,2020

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A holistic semi-supervised method for imbalanced fault diagnosis of rotational machinery with out-of-distribution samples;Reliability Engineering & System Safety;2024-10

2. Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol;3rd ACM International Workshop on Multimedia AI against Disinformation;2024-06-10

3. A novel extension of FixMatch using uncertainty for semi-supervised audio classification;Science Talks;2024-06

4. Unleashing potentials with deep learning: decoding the complex events for distributed fiber optic sensing applications;Science China Information Sciences;2024-04-22

5. Acoustic scene classification: A comprehensive survey;Expert Systems with Applications;2024-03