Multichannel Multimodal Emotion Analysis of Cross-Modal Feedback Interactions Based on Knowledge Graph-Reference-Cited by-同舟云学术

Multichannel Multimodal Emotion Analysis of Cross-Modal Feedback Interactions Based on Knowledge Graph

Published:2024-05-29 Issue:3 Volume:56 Page:
ISSN:1573-773X
Container-title:Neural Processing Letters
language:en
Short-container-title:Neural Process Lett

Author:

Dong Shaohua,Fan Xiaochao,Ma Xinchun

Abstract

AbstractMultimodal sentiment analysis is a downstream branch task of sentiment analysis with high attention at present. Previous work in multimodal sentiment analysis have focused on the representation and fusion of modalities, capturing the underlying semantic relationships between modalities by considering contextual information. While this approach is feasible for simple contextual comments, more complex comments require the integration of external knowledge to obtain more accurate sentiment information. However, incorporating external knowledge into sentiment analysis to enhance information complementarity has not been thoroughly investigated. To address this, we propose a multichannel cross-modal feedback interaction model that incorporates the knowledge graph into multimodal sentiment analysis. Our proposed model consists of two main components: the cross-modal feedback recurrent interaction module and the external knowledge module for capturing latent information. The cross-modal interaction employs a self-feedback mechanism during network training, extracting feature representations of each modality and using these representations to mask sensory inputs, allowing the model to perform feedback-based feature masking. The external knowledge graph captures potential semantic information representations in the textual data through knowledge graph embedding. Finally, a global feature fusion module is employed for multichannel multimodal information integration. On two publicly available datasets, our method demonstrates good performance in terms of accuracy and F1 scores, compared to state-of-the-art models and several baselines.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s11063-024-11641-w.pdf

Reference36 articles.

1. Zadeh A, Chen M, Poria S, Cambria E and Morency L-P (2017) Tensor fusion network for multimodal senti-ment analysis. arXiv:1707.07250

2. Liu Z, Shen Y, Lakshminarasimhan VB, Liang PP, Zadeh AB and Morency, L-P (2018) Effificient low-rank multimodal fusion with modality-specifific factors. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics 1:2247–2256

3. Yang X, Feng S, Zhang Y, Wang D (2021) Multimodal Sentiment Detection Based on multichannel Graph Neural Networks. In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp 328–339

4. Pham H, Liang PP, Manzini T, Morency L-P, Poczos M (2021) Found in translation: Learning robust joint representations by cyclic translations between modalities.ssociation for the Advancement of Artifificial Intelligence 33:6892–6899

5. Wu Y, Lin Z, Zhao Y, Qin B, Zhu L-N (2021) A text-centered shared-private framework via cross-modal prediction for multimodal sentiment analysis. Findings of the Association for Computational Linguistics: ACL-IJCNLP, pp 4730–4738