Affiliation:
1. School of Automation Southeast University Nanjing China
2. Department of Computer and Information Science University of Massachusetts Dartmouth Massachusetts USA
Abstract
AbstractA novel counting model is presented by the authors to estimate the number of repetitive actions in temporal 3D skeleton data. As per the authors’ knowledge, this is the first work of this kind using skeleton data for high‐precision repetitive action counting. Different from existing works on RGB video data, the authors’ model follows a bottom‐up pipeline to clip the sub‐action first followed by robust aggregation in inference. First, novel counting loss functions and robust inference with backtracking is proposed to pursue precise per‐frame count as well as overall count with boundary frames. Second, an efficient synthetic approach is proposed to augment skeleton data in training and thus avoid time‐consuming repetitive action data collection work. Finally, a challenging human repetitive action counting dataset named VSRep is collected with various types of action to evaluate the proposed model. Experiments demonstrate that the proposed counting model outperforms existing video‐based methods by a large margin in terms of accuracy in real‐time inference.
Publisher
Institution of Engineering and Technology (IET)
Subject
Computer Vision and Pattern Recognition,Software
Reference33 articles.
1. Single person pose estimation: a survey;Zhang F.;arXiv preprint arXiv:210910056,2021
2. A review on human pose estimation;Josyula R.;arXiv preprint arXiv:211006877,2021
3. Skeleton-Based Action Recognition with Spatial Reasoning and Temporal Stack Learning
4. Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献