An Experimental Study on Speech Enhancement Based on a Combination of Wavelets and Deep Learning-Reference-Cited by-同舟云学术

An Experimental Study on Speech Enhancement Based on a Combination of Wavelets and Deep Learning

Published:2022-06-20 Issue:6 Volume:10 Page:102
ISSN:2079-3197
Container-title:Computation
language:en
Short-container-title:Computation

Author:

Gutiérrez-Muñoz Michelle^ORCID,Coto-Jiménez Marvin^ORCID

Abstract

The purpose of speech enhancement is to improve the quality of speech signals degraded by noise, reverberation, or other artifacts that can affect the intelligibility, automatic recognition, or other attributes involved in speech technologies and telecommunications, among others. In such applications, it is essential to provide methods to enhance the signals to allow the understanding of the messages or adequate processing of the speech. For this purpose, during the past few decades, several techniques have been proposed and implemented for the abundance of possible conditions and applications. Recently, those methods based on deep learning seem to outperform previous proposals even on real-time processing. Among the new explorations found in the literature, the hybrid approaches have been presented as a possibility to extend the capacity of individual methods, and therefore increase their capacity for the applications. In this paper, we evaluate a hybrid approach that combines both deep learning and wavelet transformation. The extensive experimentation performed to select the proper wavelets and the training of neural networks allowed us to assess whether the hybrid approach is of benefit or not for the speech enhancement task under several types and levels of noise, providing relevant information for future implementations.

Publisher

MDPI AG

Subject

Applied Mathematics,Modeling and Simulation,General Computer Science,Theoretical Computer Science

Link

https://www.mdpi.com/2079-3197/10/6/102/pdf

Reference53 articles.

1. Research on Speech Signal Denoising Algorithm Based on Wavelet Analysis

2. Speech recognition with no speech or with noisy speech;Krishna;Proceedings of the ICASSP 2019—2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),2019

3. Performance monitoring for automatic speech recognition in noisy multi-channel environments;Meyer;Proceedings of the 2016 IEEE Spoken Language Technology Workshop (SLT). IEEE,2016

4. Hybrid speech enhancement with wiener filters and deep LSTM denoising autoencoders;Coto-Jimenez;Proceedings of the 2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI),2018

5. Multi-objective learning based speech enhancement method to increase speech quality and intelligibility for hearing aid device users

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Detection of E. coli concentration levels using CSI-D+ handheld with UV-C fluorescence imaging and deep learning on leaf surfaces;Sensing for Agriculture and Food Quality and Safety XVI;2024-06-06

2. SASEGAN-TCN: Speech enhancement algorithm based on self-attention generative adversarial network and temporal convolutional network;Mathematical Biosciences and Engineering;2024

3. An optimized convolutional neural network for speech enhancement;International Journal of Speech Technology;2023-12

4. Audio Noise Reduction Using Different Wavelet and Filtering Procedures;2023 IEEE North Karnataka Subsection Flagship International Conference (NKCon);2023-11-19

5. Speech signal analysis and enhancement using combined wavelet Fourier transform with stacked deep learning architecture;International Journal of Speech Technology;2023-09