Dynamic Invariant-Specific Representation Fusion Network for Multimodal Sentiment Analysis-Reference-Cited by-同舟云学术

Dynamic Invariant-Specific Representation Fusion Network for Multimodal Sentiment Analysis

Published:2022-01-24 Issue: Volume:2022 Page:1-14
ISSN:1687-5273
Container-title:Computational Intelligence and Neuroscience
language:en
Short-container-title:Computational Intelligence and Neuroscience

Author:

He Jing¹^ORCID,Yanga Haonan¹^ORCID,Zhang Changfan¹^ORCID,Chen Hongrun¹^ORCID,Xua Yifu¹

Affiliation:

1. College of Electrical and Information Engineering, Hunan University of Technology, Zhuzhou 412007, China

Abstract

Multimodal sentiment analysis (MSA) aims to infer emotions from linguistic, auditory, and visual sequences. Multimodal information representation method and fusion technology are keys to MSA. However, the problem of difficulty in fully obtaining heterogeneous data interactions in MSA usually exists. To solve these problems, a new framework, namely, dynamic invariant-specific representation fusion network (DISRFN), is put forward in this study. Firstly, in order to effectively utilize redundant information, the joint domain separation representations of all modes are obtained through the improved joint domain separation network. Then, the hierarchical graph fusion net (HGFN) is used for dynamically fusing each representation to obtain the interaction of multimodal data for guidance in the sentiment analysis. Moreover, comparative experiments are performed on popular MSA data sets MOSI and MOSEI, and the research on fusion strategy, loss function ablation, and similarity loss function analysis experiments is designed. The experimental results verify the effectiveness of the DISRFN framework and loss function.

Funder

National Natural Science Foundation of China

Publisher

Hindawi Limited

Subject

General Mathematics,General Medicine,General Neuroscience,General Computer Science

Link

http://downloads.hindawi.com/journals/cin/2022/2105593.pdf

Reference49 articles.

1. Quantum-inspired multimodal fusion for video sentiment analysis

2. Deep Multimodal Fusion Autoencoder for Saliency Prediction of RGB-D Images

3. Rapid and robust traffic accident detection based on orientation map