Robust Audio Content Classification Using Hybrid-Based SMD and Entropy-Based VAD-Reference-Cited by-同舟云学术

Robust Audio Content Classification Using Hybrid-Based SMD and Entropy-Based VAD

Published:2020-02-06 Issue:2 Volume:22 Page:183
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Wang Kun-Ching^ORCID

Abstract

A robust approach for the application of audio content classification (ACC) is proposed in this paper, especially in variable noise-level conditions. We know that speech, music, and background noise (also called silence) are usually mixed in the noisy audio signal. Based on the findings, we propose a hierarchical ACC approach consisting of three parts: voice activity detection (VAD), speech/music discrimination (SMD), and post-processing. First, entropy-based VAD is successfully used to segment input signal into noisy audio and noise even if variable-noise level is happening. The determinations of one-dimensional (1D)-subband energy information (1D-SEI) and 2D-textural image information (2D-TII) are then formed as a hybrid feature set. The hybrid-based SMD is achieved because the hybrid feature set is input into the classification of the support vector machine (SVM). Finally, a rule-based post-processing of segments is utilized to smoothly determine the output of the ACC system. The noisy audio is successfully classified into noise, speech, and music. Experimental results show that the hierarchical ACC system using hybrid feature-based SMD and entropy-based VAD is successfully evaluated against three available datasets and is comparable with existing methods even in a variable noise-level environment. In addition, our test results with the VAD scheme and hybrid features also shows that the proposed architecture increases the performance of audio content discrimination.

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/22/2/183/pdf

Reference85 articles.

1. Real-Time Robust Voice Activity Detection Using the Upper Envelope Weighted Entropy Measure and the Dual-Rate Adaptive Nonlinear Filter

2. Features for voice activity detection: a comparative analysis

3. Robust Voice Activity Detection Using Long-Term Signal Variability

4. Efficient voice activity detection algorithm using long-term spectral flatness measure

5. Deep Belief Networks Based Voice Activity Detection

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Audio Scene Classification Based on Topic Modelling and Audio Events Using LDA and LSA;Lecture Notes in Electrical Engineering;2023

2. Simulation of Electronic Music Signal Identification Model Based on Big Data Algorithm;Learning and Analytics in Intelligent Systems;2023

3. Music Classification Method Using Big Data Feature Extraction and Neural Networks;Journal of Environmental and Public Health;2022-07-30

4. Noise invariant feature pooling for the internet of audio things;Multimedia Tools and Applications;2022-04-11

5. A Preprocessing Strategy for Denoising of Speech Data Based on Speech Segment Detection;Applied Sciences;2020-10-21