Affiliation:
1. Anhui University, China
2. Anhui University, Hefei Comprehensive National Science Center, China
3. Hefei University of Technology, Hefei Comprehensive National Science Center, China
Abstract
Recently,
E
motion
R
ecognition in
C
onversation (ERC) has attracted much attention and has become a hot topic in the field of natural language processing. Conversation is conducted in chronological order; current utterance is more likely influenced by nearby utterances. At the same time, speaker dependency also plays a core role in the conversation dynamic. The combined effect of the sequence-aware information and the speaker-aware information makes the emotion’s dynamic change. However, past works used simple information fusion methods to model the two kinds of information but ignored their interactive influence. Thus, we propose a novel method entitled SIGAT (
S
peaker-aware
I
nteractive
G
raph
A
ttention Ne
t
work) to solve the problem. The core module is a mutual interactive module in which a dual-connection (self-connection and interact-connection) graph attention network is constructed. The advantage of SIGAT is modeling the speaker-aware and sequence-aware information in a unified graph and updating them simultaneously. In this way, we model the interactive influence of them and obtain the final representations, which have richer contextual clues. Experimental results on the four public datasets demonstrate that SIGAT outperforms the state-of-the-art models.
Funder
National Key Research Development Program of China
Major Project of Anhui Province
Anhui Province Key Research and Development Program
General Programmer of the National Natural Science Foundation of China
National Natural Science Foundation of China
Major Projects of Science and Technology in Anhui Province
Publisher
Association for Computing Machinery (ACM)
Reference44 articles.
1. IEMOCAP: interactive emotional dyadic motion capture database
2. Zi Chai and Xiaojun Wan. 2020. Learning to ask more: Semi-autoregressive sequential question generation under dual-graph interaction. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 225–237.
3. Ming Chen, Zhewei Wei, Zengfeng Huang, Bolin Ding, and Yaliang Li. 2020. Simple and deep graph convolutional networks. In Proceedings of the 37th International Conference on Machine Learning, ICML, Vol. 119. 1725–1735.
4. Deepanway Ghosal, Navonil Majumder, Alexander Gelbukh, Rada Mihalcea, and Soujanya Poria. 2020. COSMIC: COmmonSense knowledge for eMotion Identification in Conversations. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings. 2470–2481.
5. DialogueGCN: A Graph Convolutional Neural Network for Emotion Recognition in Conversation