Words Can Shift: Dynamically Adjusting Word Representations Using Nonverbal Behaviors-Reference-Cited by-同舟云学术

Words Can Shift: Dynamically Adjusting Word Representations Using Nonverbal Behaviors

Published:2019-07-17 Issue: Volume:33 Page:7216-7223
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Wang Yansen,Shen Ying,Liu Zhun,Liang Paul Pu,Zadeh Amir,Morency Louis-Philippe

Abstract

Humans convey their intentions through the usage of both verbal and nonverbal behaviors during face-to-face communication. Speaker intentions often vary dynamically depending on different nonverbal contexts, such as vocal patterns and facial expressions. As a result, when modeling human language, it is essential to not only consider the literal meaning of the words but also the nonverbal contexts in which these words appear. To better model human language, we first model expressive nonverbal representations by analyzing the fine-grained visual and acoustic patterns that occur during word segments. In addition, we seek to capture the dynamic nature of nonverbal intents by shifting word representations based on the accompanying nonverbal behaviors. To this end, we propose the Recurrent Attended Variation Embedding Network (RAVEN) that models the fine-grained structure of nonverbal subword sequences and dynamically shifts word representations based on nonverbal cues. Our proposed model achieves competitive performance on two publicly available datasets for multimodal sentiment analysis and emotion recognition. We also visualize the shifted word representations in different nonverbal contexts and summarize common patterns regarding multimodal variations of word representations.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 170 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Frame-level nonverbal feature enhancement based sentiment analysis;Expert Systems with Applications;2024-12

2. Granformer: A granular transformer net with linear complexity;Neurocomputing;2024-11

3. Hierarchical denoising representation disentanglement and dual-channel cross-modal-context interaction for multimodal sentiment analysis;Expert Systems with Applications;2024-10

4. TCHFN: Multimodal sentiment analysis based on Text-Centric Hierarchical Fusion Network;Knowledge-Based Systems;2024-09

5. TEMM: text-enhanced multi-interactive attention and multitask learning network for multimodal sentiment analysis;The Journal of Supercomputing;2024-08-12