An Automatic Classification System for Environmental Sound in Smart Cities-Reference-Cited by-同舟云学术

An Automatic Classification System for Environmental Sound in Smart Cities

Published:2023-07-31 Issue:15 Volume:23 Page:6823
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Zhang Dongping¹,Zhong Ziyin¹,Xia Yuejian¹,Wang Zhutao¹,Xiong Wenbo²

Affiliation:

1. Key Laboratory of Electromagnetic Wave Information Technology and Metrology of Zhejiang Province, China Jiliang University, Hangzhou 310018, China

2. Hangzhou Aihua Intelligent Technology Co., Ltd., 359 Shuxin Road, Hangzhou 311100, China

Abstract

With the continuous promotion of “smart cities” worldwide, the approach to be used in combining smart cities with modern advanced technologies (Internet of Things, cloud computing, artificial intelligence) has become a hot topic. However, due to the non-stationary nature of environmental sound and the interference of urban noise, it is challenging to fully extract features from the model with a single input and achieve ideal classification results, even with deep learning methods. To improve the recognition accuracy of ESC (environmental sound classification), we propose a dual-branch residual network (dual-resnet) based on feature fusion. Furthermore, in terms of data pre-processing, a loop-padding method is proposed to patch shorter data, enabling it to obtain more useful information. At the same time, in order to prevent the occurrence of overfitting, we use the time-frequency data enhancement method to expand the dataset. After uniform pre-processing of all the original audio, the dual-branch residual network automatically extracts the frequency domain features of the log-Mel spectrogram and log-spectrogram. Then, the two different audio features are fused to make the representation of the audio features more comprehensive. The experimental results show that compared with other models, the classification accuracy of the UrbanSound8k dataset has been improved to different degrees.

Funder

Key Research and Development Projects in Zhejiang Province

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/15/6823/pdf

Reference35 articles.

1. Gradient-based learning applied to document recognition;LeCun;Proc. IEEE,1998

2. Pan, X., Ge, C., Lu, R., Song, S., Chen, G., Huang, Z., and Huang, G. (2022, January 18–24). On the Integration of Self-Attention and Convolution. Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.

3. Yu, R., Du, D., LaLonde, R., Davila, D., Funk, C., Hoogs, A., and Clipp, B. (2022, January 18–24). Cascade Transformers for End-to-End Person Search. Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.

4. Chiller faults detection and diagnosis with sensor network and adaptive 1D CNN;Yan;Digit. Commun. Netw.,2022

5. Nagrani, A., Albanie, S., and Zisserman, A. (2018, January 18–23). Seeing voices and hearing faces: Cross-modal biometric matching. Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Self-Adaptable Software for Pre-Programmed Internet Tasks: Enhancing Reliability and Efficiency;Applied Sciences;2024-08-05

2. Artificial Intelligence in Smart Cities—Applications, Barriers, and Future Directions: A Review;Smart Cities;2024-06-10

3. Noise Source Diagnosis Method Based on Transfer Path Analysis and Neural Network;Applied Sciences;2023-11-11