Capturing Discriminative Information Using a Deep Architecture in Acoustic Scene Classification

Author:

Shim Hye-jin,Jung Jee-weonORCID,Kim Ju-ho,Yu Ha-jinORCID

Abstract

Acoustic scene classification contains frequently misclassified pairs of classes that share many common acoustic properties. Specific details can provide vital clues for distinguishing such pairs of classes. However, these details are generally not noticeable and are hard to generalize for different data distributions. In this study, we investigate various methods for capturing discriminative information and simultaneously improve the generalization ability. We adopt a max feature map method that replaces conventional non-linear activation functions in deep neural networks; therefore, we apply an element-wise comparison between the different filters of a convolution layer’s output. Two data augmentation methods and two deep architecture modules are further explored to reduce overfitting and sustain the system’s discriminative power. Various experiments are conducted using the “detection and classification of acoustic scenes and events 2020 task1-a” dataset to validate the proposed methods. Our results show that the proposed system consistently outperforms the baseline, where the proposed system demonstrates an accuracy of 70.4% compared to the baseline at 65.1%.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Reference33 articles.

1. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2018 Workshop (DCASE2018), Surrey, UK, 19–20 November 2018;Plumbley,2018

2. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), New York, NY, USA, 25–26 October 2019;Mandel,2019

3. Robust acoustic scene classification using a multi-spectrogram encoder-decoder framework

4. Knowledge Distillation in Acoustic Scene Classification

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Acoustic Scene Classification Using Deep C-RNN Based on Log Mel Spectrogram and Gammatone Frequency Cepstral Coefficients Features;2024 3rd International Conference on Artificial Intelligence For Internet of Things (AIIoT);2024-05-03

2. A Genetic Algorithm Approach to Automate Architecture Design for Acoustic Scene Classification;IEEE Transactions on Evolutionary Computation;2023-04

3. Multi-Feature Convergence Network for Acoustic Scene Classification;Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering;2022-10-21

4. Semi-Supervised Domain Adaptation for Acoustic Scene Classification by Minimax Entropy and Self-Supervision Approaches;2022 International Workshop on Acoustic Signal Enhancement (IWAENC);2022-09-05

5. Attentive Max Feature Map and Joint Training for Acoustic Scene Classification;ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2022-05-23

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3