Affiliation:
1. Anqing Normal University
Abstract
Abstract
In skeleton-based action recognition, the approach of treating skeleton data as pseudo-images using convolutional neural networks (CNN) has proven to be effective. However, among the existing CNN-based approaches, most of them focus on modeling information at the joint-level ignoring the size and direction information of the skeleton edges, which play an important role in action recognition, and these approaches may not be optimal. In addition, combining the directionality of human motion to portray the motion variations information of the action, which is more natural and reasonable for action sequence modeling, is rarely considered in existing approaches. In this work, we propose a novel direction-guided two-stream convolutional neural networks (DG-2sCNN) for skeleton-based action recognition. On the first stream, our model focuses on our defined edge-level information (including edge and edge\_motion information) with directionality in the skeleton data to explore the spatio-temporal features of the action. On the second stream, since the motion is directional, we define different skeleton edge directions and extract different motion information (including translation and rotation information) in different directions in order to better exploit the motion features of the action. Besides, we propose the description of human motion inscribed by a combination of translation and rotation, and explore the way they are integrated. We conducted extensive experiments on two challenging datasets, NTU-RGB+D 60 and NTU-RGB+D 120, to verify the superiority of our proposed method over state-of-the-art methods. The experimental results demonstrate that the proposed direction-guided edge-level information and motion information complement each other for better action recognition.
Publisher
Research Square Platform LLC
Reference38 articles.
1. Trelinski, Jacek and Kwolek, Bogdan (2021) CNN-based and DTW features for human activity recognition on depth maps. Neural Computing and Applications 33(21) : 14551--14563 Springer
2. Yun, LIU and Panpan, XUE and Hui, LI and Chuanxu, WANG (2021) A Review of Action Recognition Using Joints Based on Deep Learning. Journal of Electronics and Information 43(6) : 1789--1802 Journal of Electronics and Information
3. Ren, Bin and Liu, Mengyuan and Ding, Runwei and Liu, Hong (2020) A survey on 3d skeleton-based action recognition using learning method. arXiv preprint arXiv:2002.05907
4. Krizhevsky, Alex and Sutskever, Ilya and Hinton, Geoffrey E (2012) Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 25
5. Xia, Rongjie and Li, Yanshan and Luo, Wenhan (2021) LAGA-Net: Local-And-Global Attention Network for Skeleton Based Action Recognition. IEEE Transactions on Multimedia IEEE