Environmental Sound Classification Based on Transfer-Learning Techniques with Multiple Optimizers-Reference-Cited by-同舟云学术

Environmental Sound Classification Based on Transfer-Learning Techniques with Multiple Optimizers

Published:2022-07-22 Issue:15 Volume:11 Page:2279
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Ashurov Asadulla^ORCID,Zhou Yi,Shi Liming,Zhao Yu,Liu Hongqing

Abstract

The last decade has seen increased interest in environmental sound classification (ESC) due to the increased complexity and rich information of ambient sounds. The state-of-the-art methods for ESC are based on transfer learning paradigms that often utilize learned representations from common image-classification problems. This paper aims to determine the effectiveness of employing pre-trained convolutional neural networks (CNNs) for audio categorization and the feasibility of retraining. This study investigated various hyper-parameters and optimizers, such as optimal learning rate, epochs, and Adam, Adamax, and RMSprop optimizers for several pre-trained models, such as Inception, and VGG, ResNet, etc. Firstly, the raw sound signals were transferred into an image format (log-Mel spectrogram). Then, the selected pre-trained models were applied to the obtained spectrogram data. In addition, the effect of essential retraining factors on classification accuracy and processing time was investigated during CNN training. Various optimizers (such as Adam, Adamax, and RMSprop) and hyperparameters were utilized for evaluating the proposed method on the publicly accessible sound dataset UrbanSound8K. The proposed method achieves 97.25% and 95.5% accuracy on the provided dataset using the pre-trained DenseNet201 and the ResNet50V2 CNN models, respectively.

Funder

Supported by the Science and Technology Research Program of Chongqing Municipal Education Commission

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/11/15/2279/pdf

Reference75 articles.

1. Tensor-based basis function learning for three-dimensional sound speed fields

2. Multiple Sound Source Localization Based on a Multi-Dimensional Assignment Model

3. A Machine Learning approach for automation of Resume Recommendation system

4. Intelligent Robotics Incorporating Machine Learning Algorithms for Improving Functional Capacity Evaluation and Occupational Rehabilitation

5. Cochlear Implantation in Postlingually Deaf Adults is Time-sensitive Towards Positive Outcome: Prediction using Advanced Machine Learning Techniques

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep transfer learning-based bird species classification using mel spectrogram images;PLOS ONE;2024-08-12

2. An Ensembled Convolutional Recurrent Neural Network approach for Automated Classroom Sound Classification;2024 IEEE Conference on Artificial Intelligence (CAI);2024-06-25

3. Artificial neuronal networks are revolutionizing entomological research;Journal of Applied Entomology;2024-01-10

4. Concatenation-based pre-trained convolutional neural networks using attention mechanism for environmental sound classification;Applied Acoustics;2024-01

5. Machine Learning based Voice Authentication and Identification;2023 3rd International Conference on Innovative Mechanisms for Industry Applications (ICIMIA);2023-12-21