Automatic Speech-to-Text Transcription in Arabic-Reference-Cited by-同舟云学术

Automatic Speech-to-Text Transcription in Arabic

Published:2009-12 Issue:4 Volume:8 Page:1-18
ISSN:1530-0226
Container-title:ACM Transactions on Asian Language Information Processing
language:en
Short-container-title:ACM Transactions on Asian Language Information Processing

Author:

Lamel Lori¹,Messaoudi Abdelkhalek¹,Gauvain Jean-Luc¹

Affiliation:

1. LIMSI-CNRS

Abstract

The Arabic language presents a number of challenges for speech recognition, arising in part from the significant differences in the spoken and written forms, in particular the conventional form of texts being non-vowelized. Being a highly inflected language, the Arabic language has a very large lexical variety and typically with several possible (generally semantically linked) vowelizations for each written form. This article summarizes research carried out over the last few years on speech-to-text transcription of broadcast data in Arabic. The initial research was oriented toward processing of broadcast news data in Modern Standard Arabic, and has since been extended to address a larger variety of broadcast data, which as a consequence results in the need to also be able to handle dialectal speech. While standard techniques in speech recognition have been shown to apply well to the Arabic language, taking into account language specificities help to significantly improve system performance.

Funder

Defense Advanced Research Projects Agency

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/1644879.1644885

Reference38 articles.

1. Adda-Decker M. and Lamel L. 2000. The use of lexica in automatic speech recognition. F. Van Eynde and D. Gibbon Eds. Kluwer Academic Publishers 235--266. Adda-Decker M. and Lamel L. 2000. The use of lexica in automatic speech recognition . F. Van Eynde and D. Gibbon Eds. Kluwer Academic Publishers 235--266.

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. End-to-End Speech Recognition For Arabic Dialects;Arabian Journal for Science and Engineering;2023-03-01

2. Real-time speech recognition of arabic language;AIP Conference Proceedings;2023

3. An Approach for Pronunciation Classification of Classical Arabic Phonemes Using Deep Learning;Applied Sciences;2021-12-27

4. Heterophonic speech recognition using composite phones;SpringerPlus;2016-11-24

5. Structured Output Layer Neural Network Language Models for Speech Recognition;IEEE Transactions on Audio, Speech, and Language Processing;2013-01