Design of English text-to-speech conversion algorithm based on machine learning-Reference-Cited by-同舟云学术

Design of English text-to-speech conversion algorithm based on machine learning

Published:2021-02-02 Issue:2 Volume:40 Page:2433-2444
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Dongmei Li¹

Affiliation:

1. Department of Foreign Language, Inner Mongolia University of Technology, Huhhot, China

Abstract

English text-to-speech conversion is the key content of modern computer technology research. Its difficulty is that there are large errors in the conversion process of text-to-speech feature recognition, and it is difficult to apply the English text-to-speech conversion algorithm to the system. In order to improve the efficiency of the English text-to-speech conversion, based on the machine learning algorithm, after the original voice waveform is labeled with the pitch, this article modifies the rhythm through PSOLA, and uses the C4.5 algorithm to train a decision tree for judging pronunciation of polyphones. In order to evaluate the performance of pronunciation discrimination method based on part-of-speech rules and HMM-based prosody hierarchy prediction in speech synthesis systems, this study constructed a system model. In addition, the waveform stitching method and PSOLA are used to synthesize the sound. For words whose main stress cannot be discriminated by morphological structure, label learning can be done by machine learning methods. Finally, this study evaluates and analyzes the performance of the algorithm through control experiments. The results show that the algorithm proposed in this paper has good performance and has a certain practical effect.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference24 articles.

1. Healthcare Big Data Voice Pathology Assessment Framework;Hossain;IEEE Access,2016

2. Are there vocal cues to human developmental stability? Relationships between facial fluctuating asymmetry and voice attractiveness;Hill;Evolution & Human Behavior,2017

3. Voice recognition through the use of Gabor transform and heuristic algorithm;Woźniak;Nephron Clinical Practice,2017

4. Objective voice and speech analysis of persons with chronic hoarseness by prosodic analysis of speech samples;Haderlein;Logopedics Phoniatrics Vocology,2015

5. Human Recognition using Voice Print in LabVIEW;Nidhyananthan;International Journal of Applied Engineering Research,2018

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Design of Computer Information Management System Based on Machine Learning Algorithms;Scalable Computing: Practice and Experience;2024-02-24

2. Automatic Assessment System for English Pronunciation Machine Quality Based on GloVe-CNN Algorithm;2023 3rd International Conference on Mobile Networks and Wireless Communications (ICMNWC);2023-12-04

3. Application of Intelligent Fuzzy Decision Tree Algorithm in English Machine Translation;2023 International Conference on Telecommunications, Electronics and Informatics (ICTEI);2023-09-11

4. Construction of English Translation Model Based on Improved Fuzzy Semantic Optimal Control of GLR Algorithm;Scientific Programming;2022-05-31

5. Design of Aging Smart Home Products Based on Radial Basis Function Speech Emotion Recognition;Frontiers in Psychology;2022-05-04