Affiliation:
1. Henan Medical College, Henan Zhengzhou, China
Abstract
At present, the posterior probability measure widely used in English speech recognition has the situation that the posterior probability measure of different phonemes cannot be consistent to measure the pronunciation quality of the phoneme and the acoustic modeling method of voice recognition is inconsistent with the evaluation target. Therefore, in order to improve the evaluation effect of English pronunciation quality in colleges and universities, this article is based on artificial emotion recognition and high-speed hybrid model to analyze and filter various clutters that affect speech quality to improve students’ English speech recognition. Moreover, this article uses the characteristics of the clutter and the target in the data to conform to different distributions and based on the clutter distribution characteristics obtained by statistics, this article realizes the suppression of the clutter to improve the target detection performance. In addition, the method proposed in this paper solves the limitations of the clutter suppression technology in the traditional voice detection system and improves the target detection performance. In order to study the pronunciation quality evaluation effect of this model and its effect in English teaching, this paper designs a controlled experiment to analyze the model’s performance. The research results show that the model constructed in this paper has good performance.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference28 articles.
1. Aging effects on voice features used in forensic speaker comparison[J];Rhodes;International Journal of Speech Language & The Law,2017
2. A Review of Audio Features and Statistical Models Exploited for Voice Pattern Design[J];Duong;Computer Science,2015
3. The effects of whispered speech on state-of-the-art voice based biometrics systems[J];Sarria-Paja;Canadian Conference on Electrical and Computer Engineering,2015
4. Speaker-individuality in Fujisaki model f0 features: Implications for forensic voice comparison[J];Leeman;International Journal of Speech Language and the Law,2015
5. Are there vocal cues to human developmental stability? Relationships between facial fluctuating asymmetry and voice attractiveness[J];Hill;Evolution & Human Behavior,2017
Cited by
15 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献