Employing Robust Principal Component Analysis for Noise-Robust Speech Feature Extraction in Automatic Speech Recognition with the Structure of a Deep Neural Network-Reference-Cited by-同舟云学术

Employing Robust Principal Component Analysis for Noise-Robust Speech Feature Extraction in Automatic Speech Recognition with the Structure of a Deep Neural Network

Published:2018-08-15 Issue:3 Volume:1 Page:28
ISSN:2571-5577
Container-title:Applied System Innovation
language:en
Short-container-title:ASI

Author:

Hung Jeih-weih,Lin Jung-Shan,Wu Po-Jen

Abstract

In recent decades, researchers have been focused on developing noise-robust methods in order to compensate for noise effects in automatic speech recognition (ASR) systems and enhance their performance. In this paper, we propose a feature-based noise-robust method that employs a novel data analysis technique—robust principal component analysis (RPCA). In the proposed scenario, RPCA is employed to process a noise-corrupted speech feature matrix, and the obtained sparse partition is shown to reveal speech-dominant characteristics. One apparent advantage of using RPCA for enhancing noise robustness is that no prior knowledge about the noise is required. The proposed RPCA-based method is evaluated with the Aurora-4 database and a task using a state-of-the-art deep neural network (DNN) architecture as the acoustic models. The evaluation results indicate that the newly proposed method can provide the original speech feature with significant recognition accuracy improvement, and can be cascaded with mean normalization (MN), mean and variance normalization (MVN), and relative spectral (RASTA)—three well-known and widely used feature robustness algorithms—to achieve better performance compared with the individual component method.

Publisher

MDPI AG

Subject

Artificial Intelligence,Applied Mathematics,Industrial and Manufacturing Engineering,Human-Computer Interaction,Information Systems,Control and Systems Engineering

Link

http://www.mdpi.com/2571-5577/1/3/28/pdf

Reference37 articles.

1. Perceptual linear predictive (PLP) analysis of speech

2. Springer Handbook of Speech Processing;Benesty,2008

3. Suppression of acoustic noise in speech using spectral subtraction

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improved wireless acoustic sensor network for analysing audio properties;International Journal of Information Technology;2023-08-22

2. Wavelet-Based Weighted Low-Rank Sparse Decomposition Model for Speech Enhancement Using Gammatone Filter Bank Under Low SNR Conditions;Fluctuation and Noise Letters;2023-03-10

3. Spectrogram-based classification on vehicles with modified loud exhausts via convolutional neural networks;Applied Acoustics;2023-03

4. Content-based encrypted speech retrieval scheme with deep hashing;Multimedia Tools and Applications;2022-02-14

5. Speech Recognition using EfficientNet;Proceedings of the 2020 5th International Conference on Multimedia Systems and Signal Processing;2020-05-28