Lite-3DCNN Combined with Attention Mechanism for Complex Human Movement Recognition-Reference-Cited by-同舟云学术

Lite-3DCNN Combined with Attention Mechanism for Complex Human Movement Recognition

Published:2022-09-09 Issue: Volume:2022 Page:1-9
ISSN:1687-5273
Container-title:Computational Intelligence and Neuroscience
language:en
Short-container-title:Computational Intelligence and Neuroscience

Author:

Zhu Maochang¹^ORCID,Bin Sheng¹^ORCID,Sun Gengxin¹^ORCID

Affiliation:

1. College of Computer Science & Technology, Qingdao University, Qingdao 266071, China

Abstract

Three-dimensional convolutional network (3DCNN) is an essential field of motion recognition research. The research work of this paper optimizes the traditional three-dimensional convolution network, introduces the self-attention mechanism, and proposes a new network model to analyze and process complex human motion videos. In this study, the average frame skipping sampling and scaling and the one-hot encoding are used for data pre-processing to retain more features in the limited data. The experimental results show that this paper innovatively designs a lightweight three-dimensional convolutional network combined with an attention mechanism framework, and the number of parameters of the model is reduced by more than 90% to only about 1.7 million. This study compared the performance of different models in different classifications and found that the model proposed in this study performed well in complex human motion video classification. Its recognition rate increased by 1%–8% compared with the C3D model.

Funder

Natural Science Foundation of Shandong Province

Publisher

Hindawi Limited

Subject

General Mathematics,General Medicine,General Neuroscience,General Computer Science

Link

http://downloads.hindawi.com/journals/cin/2022/4816549.pdf

Reference26 articles.

1. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation

2. Action Recognition Based on Sequential 2D-CNN for Surveillance Systems

3. Bidirectional LSTM with saliency-aware 3D-CNN features for human action recognition