Affiliation:
1. Hainan College of Foreign Studies, Wenchang, China
Abstract
The difference between English and Chinese expressions is that English emphasizes the stress of syllables, so the recognition of English speech emotions plays an important role in learning English. This study uses transfer learning as the technical support to study English speech emotion recognition. The acoustic model based on weight transfer has two different training strategies: single-stage training and two-stage training strategy. By comparing the performance of the English speech emotion recognition model based on CNN neural network and the model proposed in this paper, the statistical comparison data is drawn into a statistical graph. The research results show that transfer learning has certain advantages over other algorithms in English speech emotion recognition. In the subsequent teaching and real-time translation equipment research, transfer learning can be applied to English models.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference22 articles.
1. Aging effects on voice features used in forensic speaker comparison;Rhodes;International Journal of Speech Language & the Law,2017
2. A review of audio features and statistical models exploited for voice pattern design;Ngoc Duong;Computer Science,2015
3. The effects of whispered speech on state-of-the-art voice based biometrics systems;Sarria-Paja;Canadian Conference on Electrical and Computer Engineering,2015
4. Speaker-individuality in Fujisaki model f0 features: Implications for forensic voice comparison;Leeman;International Journal of Speech Language and the Law,2015
5. Are there vocal cues to human developmental stability? Relationships between facial fluctuating asymmetry and voice attractiveness;Hill;Evolution & Human Behavior,2017
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献