Affiliation:
1. Artificial Intelligence Research Laboratory, Electronics and Telecommunications Research Institute, 218 Gajeong-ro, Daejeon 34129, Republic of Korea
2. Artificial Intelligence, University of Science and Technology, 217 Gajeong-ro, Daejeon 34113, Republic of Korea
Abstract
Numerous deep learning methods for acoustic scene classification (ASC) have been proposed to improve the classification accuracy of sound events. However, only a few studies have focused on continual learning (CL) wherein a model continually learns to solve issues with task changes. Therefore, in this study, we systematically analyzed the performance of ten recent CL methods to provide guidelines regarding their performances. The CL methods included two regularization-based methods and eight replay-based methods. First, we defined realistic and difficult scenarios such as online class-incremental (OCI) and online domain-incremental (ODI) cases for three public sound datasets. Then, we systematically analyzed the performance of each CL method in terms of average accuracy, average forgetting, and training time. In OCI scenarios, iCaRL and SCR showed the best performance for small buffer sizes, and GDumb showed the best performance for large buffer sizes. In ODI scenarios, SCR adopting supervised contrastive learning consistently outperformed the other methods, regardless of the memory buffer size. Most replay-based methods have an almost constant training time, regardless of the memory buffer size, and their performance increases with an increase in the memory buffer size. Based on these results, we must first consider GDumb/SCR for the continual learning methods for ASC.
Funder
Electronics and Telecommunications Research Institute
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference57 articles.
1. Environmental sound classification using temporal-frequency attention based convolutional neural network;Mu;Sci. Rep.,2021
2. Environmental sound recognition: A survey;Chachada;APSIPA Trans. Signal Inf. Process.,2014
3. Sophiya, E., and Jothilakshmi, S. (2017). Proceedings of the International Conference on Computational Intelligence, Cyber Security, and Computational Models, Springer.
4. Abeßer, J. (2020). A Review of Deep Learning Based Methods for Acoustic Scene Classification. Appl. Sci., 10.
5. A real-time bird sound recognition system using a low-cost microcontroller;Masazade;Appl. Acoust.,2019