Affiliation:
1. Intelligent Systems Group, School of Computing, SASTRA University, Tamil Nadu, India
2. Velammal Engineering College, Tamil Nadu, India
Abstract
Monitoring of human and social activities is becoming increasingly pervasive in our living environment for public security and safety applications. The recognition of suspicious events is important in both indoor and outdoor environments, such as child-care centers, smart-homes, old-age homes, residential areas, office environments, elevators, and smart cities. Environmental audio scene and sound event recognition are the fundamental tasks involved in many audio surveillance applications. Although numerous approaches have been proposed, robust environmental audio surveillance remains a huge challenge due to various reasons, such as various types of overlapping audio sounds, background noises, and lack of universal and multi-modal datasets. The goal of this article is to review various features of representing audio scenes and sound events and provide appropriate machine learning algorithms for audio surveillance tasks. Benchmark datasets are categorized based on the real-world scenarios of audio surveillance applications. To have a quantitative understanding, some of the state-of-the-art approaches are evaluated based on two benchmark datasets for audio scenes and sound event recognition tasks. Finally, we outline the possible future directions for improving the recognition of environmental audio scenes and sound events.
Funder
Department of Science and Technology, Government of India
Cognitive Science Research Initiative
Publisher
Association for Computing Machinery (ACM)
Subject
General Computer Science,Theoretical Computer Science
Reference148 articles.
1. Convolutional Neural Networks for Speech Recognition
2. Spectrotemporal Analysis Using Local Binary Pattern Variants for Acoustic Scene Classification
3. Sharath Adavanne and Tuomas Virtanen. 2017. A report on sound event detection with different binaural features. Retrieved from: arXiv preprint arXiv:1710.02997. Sharath Adavanne and Tuomas Virtanen. 2017. A report on sound event detection with different binaural features. Retrieved from: arXiv preprint arXiv:1710.02997.
4. Optimization of amplitude modulation features for low-resource acoustic scene classification
Cited by
60 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献