Sound Event Detection Using Derivative Features in Deep Neural Networks-Reference-Cited by-同舟云学术

Sound Event Detection Using Derivative Features in Deep Neural Networks

Published:2020-07-17 Issue:14 Volume:10 Page:4911
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Kwak Jin-Yeol,Chung Yong-Joo

Abstract

We propose using derivative features for sound event detection based on deep neural networks. As input to the networks, we used log-mel-filterbank and its first and second derivative features for each frame of the audio signal. Two deep neural networks were used to evaluate the effectiveness of these derivative features. Specifically, a convolutional recurrent neural network (CRNN) was constructed by combining a convolutional neural network and a recurrent neural networks (RNN) followed by a feed-forward neural network (FNN) acting as a classification layer. In addition, a mean-teacher model based on an attention CRNN was used. Both models had an average pooling layer at the output so that weakly labeled and unlabeled audio data may be used during model training. Under the various training conditions, depending on the neural network architecture and training set, the use of derivative features resulted in a consistent performance improvement by using the derivative features. Experiments on audio data from the Detection and Classification of Acoustic Scenes and Events 2018 and 2019 challenges indicated that a maximum relative improvement of 16.9% was obtained in terms of the F-score.

Funder

Ministry of Education, Science and Technology

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/10/14/4911/pdf

Reference24 articles.

1. Audio Surveillance

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Automatic Multiple Sounds Detection with Recurrent Neural Networks (LSTM);Lecture Notes in Networks and Systems;2024

2. A Survey of Sound Source Localization and Detection Methods and Their Applications;Sensors;2023-12-22

3. Data augmentation using Variational Autoencoders for improvement of respiratory disease classification;PLOS ONE;2022-08-12

4. Environment Detection Methods using Speech Signals-A Review;2022 International Conference on Computational Intelligence and Sustainable Engineering Solutions (CISES);2022-05-20

5. Hand-crafted versus learned representations for audio event detection;Multimedia Tools and Applications;2022-04-07