Sound Event Detection Based on Convolutional Neural Networks with Overlapping Pooling Structure-Reference-Cited by-同舟云学术

Sound Event Detection Based on Convolutional Neural Networks with Overlapping Pooling Structure

Published:2021-05-01 Issue:1 Volume:1924 Page:012008
ISSN:1742-6588
Container-title:Journal of Physics: Conference Series
language:
Short-container-title:J. Phys.: Conf. Ser.

Author:

Zhu Hang,Wan Hongjie

Abstract

Abstract In this paper, a sound event detection measure is proposed. This measure is based on convolutional neural networks with overlapping pooling structure Different from the traditional GMM-HMM model and DNN-HMM model, the CNN model uses the convolutional layer which can speed up training by reducing training parameters. In this paper, the extracted sound feature is the mel-frequency cepstrum coefficient (MFCC). The dropout layer is added to the convolutional layer. Over-fitting can decrease the accuracy of the detection, dropout layer can prevent the model from over-fitting. Moreover, the overlapping pooling structure is used in CNN, the stride size is smaller than the pooling kernel size. The output of pooling layer has overlapping parameters, which can increase the richness of features. The final experimental results show that the precision of the proposed CNN model more robust than the GMM-HMM model and baseline model.

Publisher

IOP Publishing

Subject

General Physics and Astronomy

Link

https://iopscience.iop.org/article/10.1088/1742-6596/1924/1/012008/pdf

Reference10 articles.

1. Acoustic monitoring and localization for social care;Goetze;J. Comput. Sci. Eng.,2012

2. Feature learning with deep scattering for urban sound analysis;Salamon,2015

3. Discriminative Learning for Speech Recognition: Theory and Practice;He,2008

4. Bearing Fault Diagnosis Method Based on GMM and Coupled Hidden Markov Model;Cao,2018

5. Investigating End-to-end Speech Recognition for Mandarin-English Code-switching;Shan

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Enhanced Deep Learning Method for Skin Cancer Detection and燙lassification;Computers, Materials & Continua;2022