Quality evaluation of English pronunciation based on artificial emotion recognition and gaussian mixture model

Author:

Gang Zhang1

Affiliation:

1. Henan Medical College, Henan Zhengzhou, China

Abstract

At present, the posterior probability measure widely used in English speech recognition has the situation that the posterior probability measure of different phonemes cannot be consistent to measure the pronunciation quality of the phoneme and the acoustic modeling method of voice recognition is inconsistent with the evaluation target. Therefore, in order to improve the evaluation effect of English pronunciation quality in colleges and universities, this article is based on artificial emotion recognition and high-speed hybrid model to analyze and filter various clutters that affect speech quality to improve students’ English speech recognition. Moreover, this article uses the characteristics of the clutter and the target in the data to conform to different distributions and based on the clutter distribution characteristics obtained by statistics, this article realizes the suppression of the clutter to improve the target detection performance. In addition, the method proposed in this paper solves the limitations of the clutter suppression technology in the traditional voice detection system and improves the target detection performance. In order to study the pronunciation quality evaluation effect of this model and its effect in English teaching, this paper designs a controlled experiment to analyze the model’s performance. The research results show that the model constructed in this paper has good performance.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference28 articles.

1. Aging effects on voice features used in forensic speaker comparison[J];Rhodes;International Journal of Speech Language & The Law,2017

2. A Review of Audio Features and Statistical Models Exploited for Voice Pattern Design[J];Duong;Computer Science,2015

3. The effects of whispered speech on state-of-the-art voice based biometrics systems[J];Sarria-Paja;Canadian Conference on Electrical and Computer Engineering,2015

4. Speaker-individuality in Fujisaki model f0 features: Implications for forensic voice comparison[J];Leeman;International Journal of Speech Language and the Law,2015

5. Are there vocal cues to human developmental stability? Relationships between facial fluctuating asymmetry and voice attractiveness[J];Hill;Evolution & Human Behavior,2017

Cited by 15 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3