Abstract
Abstract
In this paper, a sound event detection measure is proposed. This measure is based on convolutional neural networks with overlapping pooling structure Different from the traditional GMM-HMM model and DNN-HMM model, the CNN model uses the convolutional layer which can speed up training by reducing training parameters. In this paper, the extracted sound feature is the mel-frequency cepstrum coefficient (MFCC). The dropout layer is added to the convolutional layer. Over-fitting can decrease the accuracy of the detection, dropout layer can prevent the model from over-fitting. Moreover, the overlapping pooling structure is used in CNN, the stride size is smaller than the pooling kernel size. The output of pooling layer has overlapping parameters, which can increase the richness of features. The final experimental results show that the precision of the proposed CNN model more robust than the GMM-HMM model and baseline model.
Subject
General Physics and Astronomy
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献