Sparse Spatial-Temporal Emotion Graph Convolutional Network for Video Emotion Recognition-Reference-Cited by-同舟云学术

Sparse Spatial-Temporal Emotion Graph Convolutional Network for Video Emotion Recognition

Published:2022-09-28 Issue: Volume:2022 Page:1-10
ISSN:1687-5273
Container-title:Computational Intelligence and Neuroscience
language:en
Short-container-title:Computational Intelligence and Neuroscience

Author:

Liu Xiaodong¹^ORCID,Xu Huating¹,Wang Miao¹

Affiliation:

1. School of Software, Henan University of Engineering, Zhengzhou, China

Abstract

Video emotion recognition has attracted increasing attention. Most existing approaches are based on the spatial features extracted from video frames. The context information and their relationships in videos are often ignored. Thus, the performance of existing approaches is restricted. In this study, we propose a sparse spatial-temporal emotion graph convolutional network-based video emotion recognition method (SE-GCN). For the spatial graph, the emotional relationship between any two emotion proposal regions is first calculated and the sparse spatial graph is constructed according to the emotional relationship. For the temporal graph, the emotional information contained in each emotion proposal region is first analyzed and the sparse temporal graph is constructed by using the emotion proposal regions with rich emotional cues. Then, the reasoning features of the emotional relationship are obtained by the spatial-temporal GCN. Finally, the features of the emotion proposal regions and the spatial-temporal relationship features are fused to recognize the video emotion. Extensive experiments are conducted on four challenging benchmark datasets, that is, MHED, HEIV, VideoEmotion-8, and Ekman-6. The experimental results demonstrate that the proposed method achieves state-of-the-art performance.

Funder

Foundation of Henan Educational Committee

Publisher

Hindawi Limited

Subject

General Mathematics,General Medicine,General Neuroscience,General Computer Science

Link

http://downloads.hindawi.com/journals/cin/2022/3518879.pdf

Reference42 articles.

1. Histogram of oriented gradient-based fusion of features for human action recognition in action video sequences;C. I. Patel;Sensors,2020

2. Dimension-based generic convolution block for object recognition;DBGC

3. Intelligent video anomaly detection and classification using faster RCNN with deep reinforcement learning model

4. Computational intelligence-based harmony search algorithm for real-time object detection and tracking in video surveillance systems;M. Faihan;Mathematics,2022

5. Two-stage deep learning framework for sRGB image white balance

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multimodal Emotion Recognition Based on Cascaded Multichannel and Hierarchical Fusion;Computational Intelligence and Neuroscience;2023-01-05