Speech emotion recognition based on emotion perception-Reference-Cited by-同舟云学术

Speech emotion recognition based on emotion perception

Published:2023-05-12 Issue:1 Volume:2023 Page:
ISSN:1687-4722
Container-title:EURASIP Journal on Audio, Speech, and Music Processing
language:en
Short-container-title:J AUDIO SPEECH MUSIC PROC.

Author:

Liu Gang^ORCID,Cai Shifang,Wang Ce

Abstract

AbstractSpeech emotion recognition (SER) is a hot topic in speech signal processing. With the advanced development of the cheap computing power and proliferation of research in data-driven methods, deep learning approaches are prominent solutions to SER nowadays. SER is a challenging task due to the scarcity of datasets and the lack of emotion perception. Most existing networks of SER are based on computer vision and natural language processing, so the applicability for extracting emotion is not strong. Drawing on the research results of brain science on emotion computing and inspired by the emotional perceptive process of the human brain, we propose an approach based on emotional perception, which designs a human-like implicit emotional attribute classification and introduces implicit emotional information through multi-task learning. Preliminary experiments show that the unweighted accuracy (UA) of the proposed method has increased by 2.44%, and weighted accuracy (WA) 3.18% (both absolute values) on the Interactive Emotional Dyadic Motion Capture (IEMOCAP) dataset, which verifies the effectiveness of our method.

Publisher

Springer Science and Business Media LLC

Subject

Electrical and Electronic Engineering,Acoustics and Ultrasonics

Link

https://link.springer.com/content/pdf/10.1186/s13636-023-00289-4.pdf

Reference31 articles.

1. L.S.A. Low, N.C. Maddage, M. Lech, L.B. Sheeber, N.B. Allen, Detection of clinical depression in adolescents’ speech during family interactions. IEEE Trans. Biomed. Eng. 58(3), 574–586 (2010)

2. X. Huahu, G. Jue, Y. Jian, in Proceedings of the 2010 International Conference on Artificial Intelligence and Computational Intelligence, vol. 1. Application of speech emotion recognition in intelligent household robot, (IEEE, Sanya, 2010), pp. 537–541

3. W.J. Yoon, Y.H. Cho, K.S. Park, in International Conference on Ubiquitous Intelligence and Computing. A study of speech emotion recognition and its application to mobile services (Springer, Hong Kong China, 2007), pp. 758–766

4. K. Han, D. Yu, I. Tashev, in Proceedings of Interspeech 2014. Speech emotion recognition using deep neural network and extreme learning machine (ISCA, Singapore, 2014)

5. M. Chen, X. He, J. Yang, H. Zhang, 3-d convolutional recurrent neural networks with attention model for speech emotion recognition. IEEE Signal Process. Lett. 25(10), 1440–1444 (2018)

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring the Effectiveness of Advanced Machine Learning Models in Speech Emotion Recognition;2024 International Conference on Communication, Computer Sciences and Engineering (IC3SE);2024-05-09

2. An Intelligent Emotion Recognition System based on Speech Terminologies using Artificial Intelligence Assisted Learning Scheme;2024 Ninth International Conference on Science Technology Engineering and Mathematics (ICONSTEM);2024-04-04

3. A Machine Learning and Deep Learning based Approach to Generate a Speech Emotion Recognition System;2024 11th International Conference on Computing for Sustainable Global Development (INDIACom);2024-02-28

4. Machine Learning Approach for Detection of Speech Emotions for RAVDESS Audio Dataset;2024 Fourth International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies (ICAECT);2024-01-11

5. CCTG-NET: Contextualized Convolutional Transformer-GRU Network for speech emotion recognition;International Journal of Speech Technology;2023-12