Human activity prediction using saliency-aware motion enhancement and weighted LSTM network-Reference-Cited by-同舟云学术

Human activity prediction using saliency-aware motion enhancement and weighted LSTM network

Published:2021-01-11 Issue:1 Volume:2021 Page:
ISSN:1687-5281
Container-title:EURASIP Journal on Image and Video Processing
language:en
Short-container-title:J Image Video Proc.

Author:

Weng Zhengkui^ORCID,Li Wuzhao,Jin Zhipeng

Abstract

AbstractIn recent years, great progress has been made in recognizing human activities in complete image sequences. However, predicting human activity earlier in a video is still a challenging task. In this paper, a novel framework named weighted long short-term memory network (WLSTM) with saliency-aware motion enhancement (SME) is proposed for video activity prediction. First, a boundary-prior based motion segmentation method is introduced to use shortest geodesic distance in an undirected weighted graph. Next, a dynamic contrast segmentation strategy is proposed to segment the moving object in a complex environment. Then, the SME is constructed to enhance the moving object by suppressing irrelevant background in each frame. Moreover, an effective long-range attention mechanism is designed to further deal with the long-term dependency of complex non-periodic activities by automatically focusing more on the semantic critical frames instead of processing all sampled frames equally. Thus, the learned weights can highlight the discriminative frames and reduce the temporal redundancy. Finally, we evaluate our framework on the UT-Interaction and sub-JHMDB datasets. The experimental results show that WLSTM with SME statistically outperforms a number of state-of-the-art methods on both datasets.

Funder

Natural Science Foundation of Zhejiang Province

Jiaxing Public Welfare Research Project

Publisher

Springer Science and Business Media LLC

Subject

Electrical and Electronic Engineering,Information Systems,Signal Processing

Link

http://link.springer.com/content/pdf/10.1186/s13640-020-00544-0.pdf

Reference34 articles.

1. L. Wang, Three-dimensional convolutional restricted Boltzmann machine for human behavior recognition from RGB-D video. EURASIP J. Image Video Process. 2018, 120 (2018)

2. X. Wang, L. Gao, J. Song, et al., Beyond frame-level CNN: saliency-aware 3D CNN with LSTM for video action recognition. IEEE Signal Process. Lett. 24(4), 510–514 (2017)

3. Z. Weng, Y. Guan, Trajectory-aware three-stream CNN for video action recognition. J. Electron. Imaging 28(2), 021004 (2018)

4. Z. Weng, Y. Guan, Action recognition using length-variable edge trajectory and spatio-temporal motion skeleton descriptor. EURASIP J. Image Video Process. 2018, 8 (2018)

5. H. Bilen, B. Fernando, E. Gavves, et al., Action recognition with dynamic image networks. IEEE Trans. Pattern Anal. Mach. Intell. 40(12), 2799–2813 (2018)

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Automatic Rifle and Sniper Detection Using Pose Estimation and CNN With BiLSTMs;Practice, Progress, and Proficiency in Sustainability;2024-02-09

2. TricP: A Novel Approach for Human Activity Recognition Using Tricky Predator Optimization Approach Based on Inception and LSTM;2024

3. Human Activity Recognition using ShuffleNetV2 Model;2023 Intelligent Computing and Control for Engineering and Business Systems (ICCEBS);2023-12-14

4. HRTransNet: HRFormer-Driven Two-Modality Salient Object Detection;IEEE Transactions on Circuits and Systems for Video Technology;2023-02

5. BGRDNet: RGB-D salient object detection with a bidirectional gated recurrent decoding network;Multimedia Tools and Applications;2022-03-23