Design and Development of a Text-to-Speech Synthesizer for Afan Oromo

Author:

Chala Tamrat DelessaORCID,Guta Ashenafi Chalchisa,Asebel Muluken Hussen

Abstract

AbstractSpeech is one of the natural ways of communication between humans, later extended as a means for human–computer interaction. It helps visually impaired people to read electronic texts and is used in information retrieval and language education. This paper proposed the development of a text-to-speech synthesizer for Afan Oromo (Oromo Language), using unit selection speech synthesizer approaches. Although several works have been conducted in the area of text-to-speech synthesis for technologically favored languages for many years, every language has its own unique features. So, speech synthesizer systems developed for one language cannot be used for another language, because the structures of one language are not presumably representative of others. It is clear that each program is based on the system corresponding to the phonetic rules of a certain language. Besides, the existing text-to-speech synthesizer for Afan Oromo was reviewed in this study and the result of developed prototype results are showing promising, however, still, their performance needs a lot of improvement in terms of intelligibility and naturalness using novel approaches and quality of corpus. Therefore, this research was initiated to develop the possibility of developing a prototype text-to-speech synthesizer to improve the performance of the text-to-speech synthesizer. In this study, Afan Oromo corpus was collected from genuine sources and prepared speech datasets both text and audio in collaboration with Afan Oromo experts. The performance of the synthesizer was tested by proper users for its intelligibility and naturalness using Mean Opinion Scale (MOS). The obtained result of naturalness of the prototype is 4.44 (very good) out of 5, which indicated that the result obtained is encouraging and better performance than the existing TTS of Afan Oromo in terms of intelligibility and naturalness. But the result scored in terms of intelligibility still needs further work. The main challenge is Afan Oromo has many dialects, so preparing a balanced text corpus from each dialect is very tough. Moreover, enhancement of the work is predicted to bring a reasonable level of intelligibility to the system.

Funder

Széchenyi István University

Publisher

Springer Science and Business Media LLC

Subject

General Medicine

Reference23 articles.

1. Hunton JE. Blending information and communication technology with accounting research. Account Horiz. 2002;16:55–67.

2. Sasirekha D, Chandra E. Text to speech: a simple tutorial. Int J Soft Comput Eng. 2012;2:275–8.

3. Palo P, Laine UK. A review of articulatory speech synthesis. Master's of thesis, University of Helsinki, 2006

4. Tolessa T. Early history of written Oromo language up to 1900. Sci Tech Arts Res J. 2013;1:76–80.

5. Melbaa Gadaa O. An introduction to the history of the Oromo people. Khartoum, Sudan, 1988

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3