Abstract
Abstract
In English phonetic synthesis, it is impossible to create a thesaurus containing all vocabulary as English has an almost unlimited vocabulary. Hence, for English words that are not included in the thesaurus, generating phonetic symbols through the “Grapheme-to-phoneme (G2P)” algorithm is the best solution. For this purpose, a dynamic finite generalization (DFGA) machine learning algorithm for the rules of G2P conversion is proposed in this paper. The dictionary library used for learning has 27,040 words, 90% of which are used for rule learning, and the remaining 10% are used for testing. After ten rounds of cross-validation, the average grapheme conversion accuracy in the learning and test sets is 99.78% and 93.14%, and the average vocabulary conversion accuracy is 99.56% and 73.51%, respectively.
Subject
General Physics and Astronomy
Reference6 articles.
1. Sentence Selection Based on Extended Entropy Using Phonetic and Prosodic Contexts for Statistical Parametric Speech Synthesis [J];Nose;IEEE/ACM Transactions on Audio Speech & Language Processing,2018
2. HMM-based database segmentation and unit selection for concatenative speech synthesis [J];Martin;Journal of the Acoustical Society of America,2017
3. A Parameter Generation Algorithm Using Local Variance for HMM-Based Speech Synthesis [J];Nose;IEEE Journal of Selected Topics in Signal Processing,2017
4. Homograph filter for speech synthesis system [J];Huynh;Journal of the Acoustical Society of America,2015
5. Arabic discourse analysis based on acoustic, prosodic and phonetic modeling: elocution evaluation, speech classification and pathological speech correction [J];Maraoui;International journal of speech technology,2016
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献