Abstract
The purpose of speech enhancement is to improve the quality of speech signals degraded by noise, reverberation, or other artifacts that can affect the intelligibility, automatic recognition, or other attributes involved in speech technologies and telecommunications, among others. In such applications, it is essential to provide methods to enhance the signals to allow the understanding of the messages or adequate processing of the speech. For this purpose, during the past few decades, several techniques have been proposed and implemented for the abundance of possible conditions and applications. Recently, those methods based on deep learning seem to outperform previous proposals even on real-time processing. Among the new explorations found in the literature, the hybrid approaches have been presented as a possibility to extend the capacity of individual methods, and therefore increase their capacity for the applications. In this paper, we evaluate a hybrid approach that combines both deep learning and wavelet transformation. The extensive experimentation performed to select the proper wavelets and the training of neural networks allowed us to assess whether the hybrid approach is of benefit or not for the speech enhancement task under several types and levels of noise, providing relevant information for future implementations.
Subject
Applied Mathematics,Modeling and Simulation,General Computer Science,Theoretical Computer Science
Reference53 articles.
1. Research on Speech Signal Denoising Algorithm Based on Wavelet Analysis
2. Speech recognition with no speech or with noisy speech;Krishna;Proceedings of the ICASSP 2019—2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),2019
3. Performance monitoring for automatic speech recognition in noisy multi-channel environments;Meyer;Proceedings of the 2016 IEEE Spoken Language Technology Workshop (SLT). IEEE,2016
4. Hybrid speech enhancement with wiener filters and deep LSTM denoising autoencoders;Coto-Jimenez;Proceedings of the 2018 IEEE International Work Conference on Bioinspired Intelligence (IWOBI),2018
5. Multi-objective learning based speech enhancement method to increase speech quality and intelligibility for hearing aid device users
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献