Spatiotemporal Interaction Residual Networks with Pseudo3D for Video Action Recognition-Reference-Cited by-同舟云学术

Spatiotemporal Interaction Residual Networks with Pseudo3D for Video Action Recognition

Published:2020-06-01 Issue:11 Volume:20 Page:3126
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Chen Jianyu,Kong Jun,Sun Hui,Xu Hui,Liu Xiaoli,Lu Yinghua,Zheng Caixia^ORCID

Abstract

Action recognition is a significant and challenging topic in the field of sensor and computer vision. Two-stream convolutional neural networks (CNNs) and 3D CNNs are two mainstream deep learning architectures for video action recognition. To combine them into one framework to further improve performance, we proposed a novel deep network, named the spatiotemporal interaction residual network with pseudo3D (STINP). The STINP possesses three advantages. First, the STINP consists of two branches constructed based on residual networks (ResNets) to simultaneously learn the spatial and temporal information of the video. Second, the STINP integrates the pseudo3D block into residual units for building the spatial branch, which ensures that the spatial branch can not only learn the appearance feature of the objects and scene in the video, but also capture the potential interaction information among the consecutive frames. Finally, the STINP adopts a simple but effective multiplication operation to fuse the spatial branch and temporal branch, which guarantees that the learned spatial and temporal representation can interact with each other during the entire process of training the STINP. Experiments were implemented on two classic action recognition datasets, UCF101 and HMDB51. The experimental results show that our proposed STINP can provide better performance for video recognition than other state-of-the-art algorithms.

Funder

National Natural Science Foundation of China

Fundamental Research Funds for the Central Universities

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/20/11/3126/pdf

Reference67 articles.

1. Rank Pooling for Action Recognition

2. Semantic human activity recognition: A literature review

3. Action Recognition and Prediction: A Survey Human;Kong;arXiv,2018

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Detection method of limb movement in competitive sports training based on deep learning;Journal of Computational Methods in Sciences and Engineering;2023-05-30

2. Human Action Recognition Research Based on Fusion TS-CNN and LSTM Networks;Arabian Journal for Science and Engineering;2022-09-20

3. Detection Method of Limb Movement in Competitive Sports Training Based on Deep Learning;Journal of Mathematics;2022-02-18

4. A novel motion recognition method based on improved two-stream convolutional neural network and sparse feature fusion;Computer Science and Information Systems;2022

5. Research on automatic recognition method of basketball shooting action based on background subtraction method;International Journal of Biometrics;2022