Speech Sentiment Analysis Using Hierarchical Conformer Networks-Reference-Cited by-同舟云学术

Speech Sentiment Analysis Using Hierarchical Conformer Networks

Published:2022-08-12 Issue:16 Volume:12 Page:8076
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Zhao Peng^ORCID,Liu Fangai,Zhuang Xuqiang^ORCID

Abstract

Multimodality has been widely used for sentiment analysis tasks, especially for speech sentiment analysis. Compared with the emotion expression of most text languages, speech is more intuitive for human emotion, as speech contains more and richer emotion features. Most of the current studies mainly involve the extraction of speech features, but the accuracy and prediction rate of the models still need to be improved. To improve the extraction and fusion of speech sentiment feature information, we present a new framework. The framework adopts a hierarchical conformer model and an attention-based GRU model to increase the accuracy of the model. The method has two main parts: a local feature learning group and a global feature learning group. The local feature learning group is mainly used to learn the spatio-temporal feature information of speech emotion features through the conformer model, and a combination of convolution and transformer is used to be able to enhance the extraction of long and short-term feature information. The global features are then extracted by the AUGRU model, and the fusion of features is performed by the attention mechanism to access the weights of feature information. Finally, the sentiment is identified by a fully connected network layer, and then classified by a central loss function and a softmax function. Compared with existing speech sentiment analysis models, we obtained better sentiment classification results on the IEMOCAP and RAVDESS benchmark datasets.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/16/8076/pdf

Reference45 articles.

1. Speech emotion recognition

2. Emotion recognition using deep learning approach from audio–visual emotional big data

3. Automatic Speech Recognition (ASR) Systems for Children: A Systematic Literature Review

4. Semi-supervised learning with generative model for sentiment classification of stock messages

5. Unsupervised word-level affect analysis and propagation in a lexical knowledge graph

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Robust Representation Learning for Speech Emotion Recognition with Moment Exchange;2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC);2023-10-31

2. Sentiment analysis of epidemiological surveillance reports on COVID-19 in Greece using machine learning models;Frontiers in Public Health;2023-07-18

3. A Comparative Study on Bengali Speech Sentiment Analysis Based on Audio Data;2023 IEEE International Conference on Big Data and Smart Computing (BigComp);2023-02