FORMULATION OF SYLLABLE BASED PRONUNCIATION MODELS FOR TAMIL TEXT-TO-SPEECH SYNTHESIZER
-
Published:2020-10-30
Issue:4
Volume:33
Page:282-297
-
ISSN:0127-9084
-
Container-title:Malaysian Journal of Computer Science
-
language:
-
Short-container-title:MJCS
Author:
Rajendran Vaibhavi,Kumar G Bharadwaja
Abstract
The primary aim of Human-Computer Interaction (HCI) is to deliver the power of computers and communication systems to people in an easily accessible and understandable form. HCI in a person’s native/first language is always invigorate. Developing a Tamil Text-To-Speech (TTS) system will facilitate a convenient medium of interaction for people who speak Tamil language. This paper emphasizes on the development of pronunciation models, a vital component of a Tamil TTS. Developing a pronunciation model for Tamil is more arduous when compared to other languages due to the non-triviality between the letter to sound correspondence. Veritably, two syllable-based pronunciation models developed by us are discussed in this paper. First, is a syllable-centric rule-based pronunciation model that generates a well-founded training data which is ingrained into the second, Conditional Random Field (CRF) enforced model. It is evident that both of these models are dominions with a high Mean Similarity Score of 0.97 and 0.94 respectively in comparison to the other existing rule driven and data driven models in the literature. These syllable-based pronunciation models will enrich the performance of a Tamil TTS.
Publisher
Univ. of Malaya
Subject
General Computer Science