Affiliation:
1. Master in Web Sciences, Syrian Virtual University, Damascus, Syria
Abstract
This paper aims to find the relationship between the full diacritization of the Arabic text and the quality of the speech synthesized in screen readers and presents a new methodology to develop screen readers for the visually impaired, focusing on preprocessing and diacritization of the text before converting it to audio. First, the actual need for our proposal was measured by conducting a MOS (Mean Opinion Score) questionnaire to evaluate the quality of the speech synthesized before and after full diacritization in the NVDA (https://www.nvda-ar.org/) screen reader. Then, an e-reader was built by integrating two models: the first one is for automatic Arabic diacritization (depending on Shakkala), and the second is a TTS model (depending on Tacotron). The quality of our proposed system was measured in terms of (1) pronunciation and (2) intelligibility, in which our system outperformed the commercial screen readers, NVDA and IBSAR (https://www.sakhr.com), as it recorded 60.67%, 17.67%, and 21.67% as correct, incorrect, and partially correct, respectively, for the isolated word test, and 84% correct results for the homograph test, and 78.50% and 93% correct results, respectively, for the DRT and DMRT tests.
Subject
Human-Computer Interaction
Reference26 articles.
1. Automatic diacritization of Arabic text using recurrent neural networks;G. A. Abandah;International Journal on Document Analysis and Recognition,2015
2. A survey of automatic Arabic diacritization techniques
3. A rule based method for adding case ending diacritics for modern standard Arabic texts;A. Fashwan
4. Alserag: an automatic diacritization system for Arabic;S. Alansary,2018