Memory Fusion Network for Multi-view Sequential Learning-Reference-Cited by-同舟云学术

Memory Fusion Network for Multi-view Sequential Learning

Published:2018-04-27 Issue:1 Volume:32 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Zadeh Amir,Liang Paul Pu,Mazumder Navonil,Poria Soujanya,Cambria Erik,Morency Louis-Philippe

Abstract

Multi-view sequential learning is a fundamental problem in machine learning dealing with multi-view sequences. In a multi-view sequence, there exists two forms of interactions between different views: view-specific interactions and cross-view interactions. In this paper, we present a new neural architecture for multi-view sequential learning called the Memory Fusion Network (MFN) that explicitly accounts for both interactions in a neural architecture and continuously models them through time. The first component of the MFN is called the System of LSTMs, where view-specific interactions are learned in isolation through assigning an LSTM function to each view. The cross-view interactions are then identified using a special attention mechanism called the Delta-memory Attention Network (DMAN) and summarized through time with a Multi-view Gated Memory. Through extensive experimentation, MFN is compared to various proposed approaches for multi-view sequential learning on multiple publicly available benchmark datasets. MFN outperforms all the multi-view approaches. Furthermore, MFN outperforms all current state-of-the-art models, setting new state-of-the-art results for all three multi-view datasets.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 236 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Extracting method for fine-grained emotional features in videos;Knowledge-Based Systems;2024-10

2. AdaFN-AG: Enhancing multimodal interaction with Adaptive Feature Normalization for multimodal sentiment analysis;Intelligent Systems with Applications;2024-09

3. TEMM: text-enhanced multi-interactive attention and multitask learning network for multimodal sentiment analysis;The Journal of Supercomputing;2024-08-12

4. Prompt Learning for Multimodal Intent Recognition with Modal Alignment Perception;Cognitive Computation;2024-08-10

5. Machine-Learning-Based Multi-Modal Force Estimation for Steerable Ablation Catheters;IEEE Transactions on Medical Robotics and Bionics;2024-08