Benchmarking Audio Signal Representation Techniques for Classification with Convolutional Neural Networks-Reference-Cited by-同舟云学术

Benchmarking Audio Signal Representation Techniques for Classification with Convolutional Neural Networks

Published:2021-05-14 Issue:10 Volume:21 Page:3434
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Sharan Roneel V.^ORCID,Xiong Hao,Berkovsky Shlomo

Abstract

Audio signal classification finds various applications in detecting and monitoring health conditions in healthcare. Convolutional neural networks (CNN) have produced state-of-the-art results in image classification and are being increasingly used in other tasks, including signal classification. However, audio signal classification using CNN presents various challenges. In image classification tasks, raw images of equal dimensions can be used as a direct input to CNN. Raw time-domain signals, on the other hand, can be of varying dimensions. In addition, the temporal signal often has to be transformed to frequency-domain to reveal unique spectral characteristics, therefore requiring signal transformation. In this work, we overview and benchmark various audio signal representation techniques for classification using CNN, including approaches that deal with signals of different lengths and combine multiple representations to improve the classification accuracy. Hence, this work surfaces important empirical evidence that may guide future works deploying CNN for audio signal classification purposes.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/21/10/3434/pdf

Reference83 articles.

1. A Weakly Supervised Learning Framework for Detecting Social Anxiety and Depression

2. Electronic control of a wheelchair guided by voice commands

3. A Comparative Survey of Feature Extraction and Machine Learning Methods in Diverse Acoustic Environments

4. Automatic Croup Diagnosis Using Cough Sound Recognition

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Infant cry classification using an efficient graph structure and attention-based model;Kuwait Journal of Science;2024-07

2. TRespNET: A dual-route exploratory CNN model for pediatric adventitious respiratory sound identification;Biomedical Signal Processing and Control;2024-07

3. Utilizing CNN Architectures for Non-invasive Diagnosis of Speech Disorders;Lecture Notes in Networks and Systems;2024

4. BanglaBeats: A Comprehensive Dataset of Bengali Songs for Music Genre Classification Tasks;2023 26th International Conference on Computer and Information Technology (ICCIT);2023-12-13

5. Guided deep embedded clustering regularization for multifeature medical signal classification;Pattern Recognition;2023-11