Abstract
The study of understanding sentiment and emotion in speech is a challenging task in human multimodal language. However, in certain cases, such as telephone calls, only audio data can be obtained. In this study, we independently evaluated sentiment analysis and emotion recognition from speech using recent self-supervised learning models—specifically, universal speech representations with speaker-aware pre-training models. Three different sizes of universal models were evaluated for three sentiment tasks and an emotion task. The evaluation revealed that the best results were obtained with two classes of sentiment analysis, based on both weighted and unweighted accuracy scores (81% and 73%). This binary classification with unimodal acoustic analysis also performed competitively compared to previous methods which used multimodal fusion. The models failed to make accurate predictionsin an emotion recognition task and in sentiment analysis tasks with higher numbers of classes. The unbalanced property of the datasets may also have contributed to the performance degradations observed in the six-class emotion, three-class sentiment, and seven-class sentiment tasks.
Funder
New Energy and Industrial Technology Development Organization
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference28 articles.
1. Prosody, Information, and Modeling with Emphasis on Tonal Features of Speech;Fujisaki;Proceedings of the Workshop on Spoken Language Processing,2003
2. Sentiment-Aware Automatic Speech Recognition Pre-Training for Enhanced Speech Emotion Recognition
3. Evaluation of error- and correlation-based loss functions for multitask learning dimensional speech emotion recognition
4. Sentiment analysis and emotion recognition: Evolving the paradigm of communication within data classification;Gross;Appl. Mark. Anal.,2020
5. Sentiment analysis of online spoken reviews
Cited by
20 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献