Affiliation:
1. Department of Marine Technology, Ocean University of China, Qingdao 266100, China
Abstract
In indoor environments, reverberation can distort the signalseceived by active noise cancelation devices, posing a challenge to sound classification. Therefore, we combined three speech spectral features based on different frequency scales into a densely connected network (DenseNet) to accomplish sound classification with reverberation effects. We adopted the DenseNet structure to make the model lightweight A dataset was created based on experimental and simulation methods, andhe classification goal was to distinguish between music signals, song signals, and speech signals. Using this framework, effectivexperiments were conducted. It was shown that the classification accuracy of the approach based on DenseNet and fused features reached 95.90%, betterhan the results based on other convolutional neural networks (CNNs). The size of the optimized DenseNet model is only 3.09 MB, which is only 7.76% of the size before optimization. We migrated the model to the Android platform. The modified model can discriminate sound clips faster on Android thanhe network before the modification. This shows that the approach based on DenseNet and fused features can dealith sound classification tasks in different indoor scenes, and the lightweight model can be deployed on embedded devices.
Funder
National Natural Science Foundation of China
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference32 articles.
1. Tawara, N., Ogawa, A., Iwata, T., Delcroix, M., and Ogawa, T. (2020, January 4–8). Frame-level phoneme-invariant speaker embedding for text-independent speaker recognition on extremely short utterances. Proceedings of the ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain.
2. Environmental sound classification based on adding noise;Zhao;Proceedings of the 2021 IEEE 2nd International Conference on Information Technology, Big Data and Artificial Intelligence (ICIBA),2021
3. Liang, B., and Gu, M. (2020, January 6–8). Music genre classification using transfer learning. Proceedings of the 2020 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), Shenzhen, China.
4. Heart sounds classification based on feature fusion using lightweight neural networks;Li;IEEE Trans. Instrum. Meas.,2021
5. Respiratory Sound Classification: From Fluid-Solid Coupling Analysis to Feature-Band Attention;Tong;IEEE Access,2022