Building a natural sounding Text-To-Speech system for the Nepali language: research and development challenges and solutions
Author:
Bajracharya Roop Shree Ratna,Regmi Santosh,Bal Bal Krishna,Prasain Balaram
Abstract
Text-to-Speech (TTS) synthesis has come far from its primitive synthetic monotone voices to more natural and intelligible sounding voices. One of the direct applications of a natural sounding TTS systems is the screen reader applications for the visually impaired and the blind community. The Festival Speech Synthesis System uses a concatenative speech synthesis method together with the unit selection process to generate a natural sounding voice. This work primarily gives an account of the efforts put towards developing a Natural sounding TTS system for Nepali using the Festival system. We also shed light on the issues faced and the solutions derived which can be quite overlapping across other similar under-resourced languages in the region.
Publisher
Nepal Journals Online (JOL)
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Strategies for Corpus Development for Low‐Resource Languages;Automatic Speech Recognition and Translation for Low Resource Languages;2024-03-29