Simulation of English speech emotion recognition based on transfer learning and CNN neural network

Author:

Chen Xuehua1

Affiliation:

1. Hainan College of Foreign Studies, Wenchang, China

Abstract

The difference between English and Chinese expressions is that English emphasizes the stress of syllables, so the recognition of English speech emotions plays an important role in learning English. This study uses transfer learning as the technical support to study English speech emotion recognition. The acoustic model based on weight transfer has two different training strategies: single-stage training and two-stage training strategy. By comparing the performance of the English speech emotion recognition model based on CNN neural network and the model proposed in this paper, the statistical comparison data is drawn into a statistical graph. The research results show that transfer learning has certain advantages over other algorithms in English speech emotion recognition. In the subsequent teaching and real-time translation equipment research, transfer learning can be applied to English models.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference22 articles.

1. Aging effects on voice features used in forensic speaker comparison;Rhodes;International Journal of Speech Language & the Law,2017

2. A review of audio features and statistical models exploited for voice pattern design;Ngoc Duong;Computer Science,2015

3. The effects of whispered speech on state-of-the-art voice based biometrics systems;Sarria-Paja;Canadian Conference on Electrical and Computer Engineering,2015

4. Speaker-individuality in Fujisaki model f0 features: Implications for forensic voice comparison;Leeman;International Journal of Speech Language and the Law,2015

5. Are there vocal cues to human developmental stability? Relationships between facial fluctuating asymmetry and voice attractiveness;Hill;Evolution & Human Behavior,2017

Cited by 16 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Applying RFID and NLP for efficient warehouse picking;International Journal of RF Technologies;2024-03-05

2. Design of Neural Network-Based Intelligent Robot-Assisted English Translation System;Lecture Notes in Networks and Systems;2024

3. Construction of English Speech Recognition Model by Fusing CNN and Random Deep Factorization TDNN;ACM Transactions on Asian and Low-Resource Language Information Processing;2023-05-23

4. Exploration of English speech translation recognition based on the LSTM RNN algorithm;Neural Computing and Applications;2023-03-23

5. A novel transfer learning model on complex fuzzy inference system;Journal of Intelligent & Fuzzy Systems;2023-03-09

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3