Self-Supervised Speech Enhancement for Arabic Speech Recognition in Real-World Environments-Reference-Cited by-同舟云学术

Self-Supervised Speech Enhancement for Arabic Speech Recognition in Real-World Environments

Published:2021-04-30 Issue:2 Volume:38 Page:349-358
ISSN:0765-0019
Container-title:Traitement du Signal
language:
Short-container-title:TS

Author:

Dendani Bilal,Bahi Halima,Sari Toufik

Abstract

Mobile speech recognition attracts much attention in the ubiquitous context, however, background noises, speech coding, and transmission errors are prone to corrupt the incoming speech. Therein, building a robust speech recognizer requires the availability of a large number of real-world speech samples. Arabic language, like many other languages, lacks such resources; to overcome this limitation, we propose a speech enhancement step, before the recognition begins. For the speech enhancement purpose, we suggest the use of a deep autoencoder (DAE) algorithm. A two-step procedure is suggested: in the first step, an overcomplete DAE is trained in an unsupervised way, and in the second one, a denoising DAE is trained in a supervised way leveraging the clean speech produced in the previous step. Experimental results performed on a real-life mobile database confirmed the potentials of the proposed approach and show a reduction of the WER (Word Error Rate) of a ubiquitous Arabic speech recognizer. Further experiments show an improvement of the perceptual evaluation of speech quality (PESQ), and the short-time objective intelligibility (STOI) as well.

Publisher

International Information and Engineering Technology Association

Subject

Electrical and Electronic Engineering

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improving self-supervised learning model for audio spoofing detection with layer-conditioned embedding fusion;Computer Speech & Language;2024-06

2. Enhanced Emotion Recognition from Spoken Assamese Dialect: A Machine Learning Approach with Language-Independent Features;Traitement du Signal;2023-10-30

3. Deep Learning Methods for Arabic Autoencoder Speech Recognition System for Electro-Larynx Device;Advances in Human-Computer Interaction;2023-02-28

4. Modern Standard Arabic Speech Corpora: A Systematic Review;IEEE Access;2023

5. Acoustic Model with Multiple Lexicon Types for Indonesian Speech Recognition;Applied Computational Intelligence and Soft Computing;2022-09-16