Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation-Reference-Cited by-同舟云学术

Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation

Published:2018-07 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Li Chao¹,Zhong Qiaoyong¹,Xie Di¹,Pu Shiliang¹

Affiliation:

1. Hikvision Research Institute

Abstract

Skeleton-based human action recognition has recently drawn increasing attentions with the availability of large-scale skeleton datasets. The most crucial factors for this task lie in two aspects: the intra-frame representation for joint co-occurrences and the inter-frame representation for skeletons' temporal evolutions. In this paper we propose an end-to-end convolutional co-occurrence feature learning framework. The co-occurrence features are learned with a hierarchical methodology, in which different levels of contextual information are aggregated gradually. Firstly point-level information of each joint is encoded independently. Then they are assembled into semantic representation in both spatial and temporal domains. Specifically, we introduce a global spatial aggregation scheme, which is able to learn superior joint co-occurrence features over local aggregation. Besides, raw skeleton coordinates as well as their temporal difference are integrated with a two-stream paradigm. Experiments show that our approach consistently outperforms other state-of-the-arts on action recognition and detection benchmarks like NTU RGB+D, SBU Kinect Interaction and PKU-MMD.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 212 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-Scale Structural Graph Convolutional Network for Skeleton-Based Action Recognition;IEEE Transactions on Circuits and Systems for Video Technology;2024-08

2. Linguistic-Driven Partial Semantic Relevance Learning for Skeleton-Based Action Recognition;Sensors;2024-07-26

3. A Novel Symmetric Fine-Coarse Neural Network for 3D Human Action Recognition Based on Point Cloud Sequences;Applied Sciences;2024-07-20

4. SkelVIT: consensus of vision transformers for a lightweight skeleton-based action recognition system;Signal, Image and Video Processing;2024-07-11

5. Temporal Receptive Field Graph Convolutional Network for Skeleton-Based Action Recognition;2024 International Technical Conference on Circuits/Systems, Computers, and Communications (ITC-CSCC);2024-07-02