Affiliation:
1. School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamilnadu, India
2. Petroleum-Gas University of Ploiesti, Ploiesti, Romania
Abstract
In the recent time, enviromental sound classification has received much popularity. This area of research comes under domain of non-speech audio classification. In this work, we have proposed a dilated Convolutional Neural Network approch to classify urban sound. We have carried out feature extraction, data augmentation techniques to carry out our experimental strategy smoothly. We also found out the activation maps of each layers of dilated convolution neural network. An increamental dilation rate has exploited Overall we achieved 84.16% of accuracy from the proposed dilated convolutional method. The gradual increaments of dilation rate has exploited the worse effect of grindding and has lowered down the computational cost. Also, overall classification performance, precision, recall,overall truth and kappa value have been obtained from our proposed method. We have considered 10 fold cross validation for the implementation of the dilated CNN model.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference31 articles.
1. Environmental sound classification using a regularized deep convolutional neural network with data augmentation;Mushtaq;Applied Acoustics,2020
2. Speaker recognition with hybrid features from a deep belief network;Ali;Neural Comput Appl,2018
3. A new pyramidal concatenated CNN approach for environmental sound classification;Demir;Applied Acoustics,2020
4. Environmental sound recognition with timefrequency audio features;Chu;IEEE Trans Audio Speech Lang Process,2009
5. Radhakrishnan R. , Divakaran A. and Smaragdis P. , Audio analysis for surveillance applications. In Proc. IEEE Workshop Appl Signal Process. Audio Acoust., New Paltz, USA; (2005), pp. 158–161.
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献