Spatio–Temporal Image Representation of 3D Skeletal Movements for View-Invariant Action Recognition with Deep Convolutional Neural Networks-Reference-Cited by-同舟云学术

Spatio–Temporal Image Representation of 3D Skeletal Movements for View-Invariant Action Recognition with Deep Convolutional Neural Networks

Published:2019-04-24 Issue:8 Volume:19 Page:1932
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Pham Huy Hieu^ORCID,Salmane Houssam^ORCID,Khoudour Louahdi^ORCID,Crouzil Alain^ORCID,Zegers Pablo^ORCID,Velastin Sergio A.

Abstract

Designing motion representations for 3D human action recognition from skeleton sequences is an important yet challenging task. An effective representation should be robust to noise, invariant to viewpoint changes and result in a good performance with low-computational demand. Two main challenges in this task include how to efficiently represent spatio–temporal patterns of skeletal movements and how to learn their discriminative features for classification tasks. This paper presents a novel skeleton-based representation and a deep learning framework for 3D action recognition using RGB-D sensors. We propose to build an action map called SPMF (Skeleton Posture-Motion Feature), which is a compact image representation built from skeleton poses and their motions. An Adaptive Histogram Equalization (AHE) algorithm is then applied on the SPMF to enhance their local patterns and form an enhanced action map, namely Enhanced-SPMF. For learning and classification tasks, we exploit Deep Convolutional Neural Networks based on the DenseNet architecture to learn directly an end-to-end mapping between input skeleton sequences and their action labels via the Enhanced-SPMFs. The proposed method is evaluated on four challenging benchmark datasets, including both individual actions, interactions, multiview and large-scale datasets. The experimental results demonstrate that the proposed method outperforms previous state-of-the-art approaches on all benchmark tasks, whilst requiring low computational time for training and inference.

Funder

Seventh Framework Programme

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/19/8/1932/pdf

Reference105 articles.

1. Human activity analysis

2. Detecting Irregularities in Images and in Video

3. Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition

4. Recognizing Human-Object Interactions in Still Images by Modeling the Mutual Context of Objects and Human Poses;Yao;IEEE Trans. Pattern Anal. Mach. Intell.,2012

Cited by 27 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient abnormal behavior detection with adaptive weight distribution;Neurocomputing;2024-10

2. Human Object Interactivity Detection usingGraph Neural Networks;2024 International Conference on Electronics, Computing, Communication and Control Technology (ICECCC);2024-05-02

3. Strategic Pairwise Selection for Labeling High-Risk Action from Video-Based Data;Communications in Computer and Information Science;2024

4. ENGA: Elastic Net-Based Genetic Algorithm for human action recognition;Expert Systems with Applications;2023-10

5. A Multimodal Fusion Approach for Human Activity Recognition;International Journal of Neural Systems;2022-12-27