A Modified MFCC for Improved Wavelet-Based Denoising on Robust Speech Recognition-Reference-Cited by-同舟云学术

A Modified MFCC for Improved Wavelet-Based Denoising on Robust Speech Recognition

Published:2021-02-28 Issue:1 Volume:14 Page:12-21
ISSN:2185-3118
Container-title:International Journal of Intelligent Engineering and Systems
language:
Short-container-title:IJIES

Author:

Hidayat Risanuri, ,Winursito Anggun,

Abstract

Research on the current speech recognition system leads to the creation of a noise-resistant system. The Mel Frequency Cepstral Coefficients (MFCC) extraction method becomes a popular method in the speech recognition system. In this paper, the MFCC's weakness of noise interference is the main reason underlies the accomplishment of a robust speech recognition system. Development was carried out by improving the denoising performance using a wavelet transform. Modifications were carried out by analyzing the weakness of the wavelet denoising process on the recognition system using the MFCC method. The analysis was conducted at one of the MFCC stages, the Fast Fourier Transform (FFT) stage. The proposed method was conducted by performing the denoising process using Wavelet only on the noise-related data based on the FFT process' analysis results. The study utilized speech data in the form of eleven isolated words in English added with noise with several different characteristics. Results showed that the proposed method was capable of generating a better accuracy than conventional wavelet denoising methods on the signal to noise ratio (SNR) of 10dB, 15dB, and 20dB using a Fejer Korovkin 6 wavelet type. The highest accuracy increase of the proposed method was in signal to noise ratio (SNR) of 15dB with a rise of 4.63%, followed by a 3.96% increase at 20dB intensity, and 2.3% at 10dB intensity. The performance of the proposed method is then compared with other methods. The results show that the proposed method has the best performance on clean speech and noisy speech at SNR intensities of 10dB, 15dB, and 20dB.

Publisher

The Intelligent Networks and Systems Society

Subject

General Engineering,General Computer Science

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimal Wavelet Selection for Signal Denoising;IEEE Access;2024

2. Denoising of Surface Plasmon Resonance (SPR) Spectra Using the Generalized S-transform and the Bald Eagle Search (BES) Algorithm;Analytical Letters;2023-10-26

3. Exploration of English speech translation recognition based on the LSTM RNN algorithm;Neural Computing and Applications;2023-03-23

4. The application of generalized S-transform in the denoising of surface plasmon resonance (SPR) spectrum;Analytical Methods;2023

5. An Intelligent Recognition and Diagnosis System for English in Radio Land-Air Calls Based on Optimization Algorithm;Proceedings of the 7th International Conference on Cyber Security and Information Engineering;2022-09-23