Wavelet-Based Weighted Low-Rank Sparse Decomposition Model for Speech Enhancement Using Gammatone Filter Bank Under Low SNR Conditions-Reference-Cited by-同舟云学术

Wavelet-Based Weighted Low-Rank Sparse Decomposition Model for Speech Enhancement Using Gammatone Filter Bank Under Low SNR Conditions

Published:2023-03-10 Issue:02 Volume:22 Page:
ISSN:0219-4775
Container-title:Fluctuation and Noise Letters
language:en
Short-container-title:Fluct. Noise Lett.

Author:

Sridhar K. Venkata¹,Kumar T. Kishore¹

Affiliation:

1. National Institute of Technology, Warangal, Telangana 506004, India

Abstract

Estimating noise-related parameters in unsupervised speech enhancement (SE) techniques is challenging in low SNR and non-stationary noise environments. In the recent SE approaches, the best results are achieved by partitioning noisy speech spectrograms into low-rank noise and sparse speech parts. However, a few limitations reduce the performance of these SE methods due to the use of overlap and add in STFT process, noisy phase, due to inaccurate estimation of low rank in nuclear norm minimization and Euclidian distance measure in the cost function. These aspects can cause a loss of information in the reconstructed signal when compared to clean speech. To solve this, we propose a novel wavelet-based weighted low-rank sparse decomposition model for enhancing speech by incorporating a gammatone filter bank and Kullback–Leibler divergence. The proposed framework differs from other strategies in which the SE is carried entirely in time domain without the need for noise estimation. Further, to reduce the word error rate, these algorithms were trained and tested on a typical automatic speech recognition module. The experimental findings indicate that the proposed cascaded model has shown significant improvement under low SNR conditions over individual and traditional methods with regard to SDR, PESQ, STOI, SIG, BAK and OVL.

Publisher

World Scientific Pub Co Pte Ltd

Subject

General Physics and Astronomy,General Mathematics

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0219477523500207

Reference42 articles.

1. Speech Enhancement

2. Employing Robust Principal Component Analysis for Noise-Robust Speech Feature Extraction in Automatic Speech Recognition with the Structure of a Deep Neural Network

3. Multicenter evaluation of signal enhancement algorithms for hearing aids