Simulation of English speech emotion recognition based on transfer learning and CNN neural network-Reference-Cited by-同舟云学术

Simulation of English speech emotion recognition based on transfer learning and CNN neural network

Published:2021-02-02 Issue:2 Volume:40 Page:2349-2360
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Chen Xuehua¹

Affiliation:

1. Hainan College of Foreign Studies, Wenchang, China

Abstract

The difference between English and Chinese expressions is that English emphasizes the stress of syllables, so the recognition of English speech emotions plays an important role in learning English. This study uses transfer learning as the technical support to study English speech emotion recognition. The acoustic model based on weight transfer has two different training strategies: single-stage training and two-stage training strategy. By comparing the performance of the English speech emotion recognition model based on CNN neural network and the model proposed in this paper, the statistical comparison data is drawn into a statistical graph. The research results show that transfer learning has certain advantages over other algorithms in English speech emotion recognition. In the subsequent teaching and real-time translation equipment research, transfer learning can be applied to English models.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference22 articles.

1. Aging effects on voice features used in forensic speaker comparison;Rhodes;International Journal of Speech Language & the Law,2017

2. A review of audio features and statistical models exploited for voice pattern design;Ngoc Duong;Computer Science,2015

3. The effects of whispered speech on state-of-the-art voice based biometrics systems;Sarria-Paja;Canadian Conference on Electrical and Computer Engineering,2015

4. Speaker-individuality in Fujisaki model f0 features: Implications for forensic voice comparison;Leeman;International Journal of Speech Language and the Law,2015

5. Are there vocal cues to human developmental stability? Relationships between facial fluctuating asymmetry and voice attractiveness;Hill;Evolution & Human Behavior,2017

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Applying RFID and NLP for efficient warehouse picking;International Journal of RF Technologies;2024-03-05

2. Design of Neural Network-Based Intelligent Robot-Assisted English Translation System;Lecture Notes in Networks and Systems;2024

3. Construction of English Speech Recognition Model by Fusing CNN and Random Deep Factorization TDNN;ACM Transactions on Asian and Low-Resource Language Information Processing;2023-05-23

4. Exploration of English speech translation recognition based on the LSTM RNN algorithm;Neural Computing and Applications;2023-03-23

5. A novel transfer learning model on complex fuzzy inference system;Journal of Intelligent & Fuzzy Systems;2023-03-09