Multi‐temporal scale aggregation refinement graph convolutional network for skeleton‐based action recognition-Reference-Cited by-同舟云学术

Multi‐temporal scale aggregation refinement graph convolutional network for skeleton‐based action recognition

Published:2023-09-25 Issue: Volume: Page:
ISSN:1546-4261
Container-title:Computer Animation and Virtual Worlds
language:en
Short-container-title:Computer Animation & Virtual

Author:

Li Xuanfeng¹,Lu Jian¹,Zhou Jian¹,Liu Wei¹,Zhang Kaibing¹

Affiliation:

1. School of Electronics and Information Xi'an Polytechnic University Xi'an China

Abstract

AbstractSkeleton‐based human action recognition is gaining significant attention and finding widespread application in various fields, such as virtual reality and human‐computer interaction systems. Recent studies have highlighted the effectiveness of graph convolutional network (GCN) based methods in this task, leading to a remarkable improvement in prediction accuracy. However, most GCN‐based methods overlook the varying contributions of self, centripetal and centrifugal subsets. Besides, only a single‐scale temporal feature is adopted, and the multi‐temporal scale information is ignored. To this end, firstly, in order to differentiate the importance of different skeleton subsets, we develop a refinement graph convolution, which can adaptively learn a weight for each subset feature. Secondly, a multi‐temporal scale aggregation module is proposed to extract more discriminative temporal dynamic information. Furthermore, a multi‐temporal scale aggregation refinement graph convolutional network (MTSA‐RGCN) is proposed, and four‐stream structure is also adopted in this paper, which can comprehensively model complementary features and eventually achieves a significant performance boost. In the empirical experiments, the performance of our approach has been greatly improved on both NTU‐RGB+D 60 and NTU‐RGB+D 120 datasets, compared to other state‐of‐the‐art methods.

Funder

National Natural Science Foundation of China

Publisher

Wiley

Subject

Computer Graphics and Computer-Aided Design,Software

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/cav.2221

Reference35 articles.

1. A survey of vision-based methods for action representation, segmentation and recognition

2. Skeleton-Aided Articulated Motion Generation

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. TP-LSM: visual temporal pyramidal time modeling network to multi-label action detection in image-based AI;The Visual Computer;2024-08-30