A Deep Sequence Learning Framework for Action Recognition in Small-Scale Depth Video Dataset-Reference-Cited by-同舟云学术

A Deep Sequence Learning Framework for Action Recognition in Small-Scale Depth Video Dataset

Published:2022-09-09 Issue:18 Volume:22 Page:6841
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Bulbul Mohammad Farhad^ORCID,Ullah Amin^ORCID,Ali Hazrat^ORCID,Kim Daijin

Abstract

Depth video sequence-based deep models for recognizing human actions are scarce compared to RGB and skeleton video sequences-based models. This scarcity limits the research advancements based on depth data, as training deep models with small-scale data is challenging. In this work, we propose a sequence classification deep model using depth video data for scenarios when the video data are limited. Unlike summarizing the frame contents of each frame into a single class, our method can directly classify a depth video, i.e., a sequence of depth frames. Firstly, the proposed system transforms an input depth video into three sequences of multi-view temporal motion frames. Together with the three temporal motion sequences, the input depth frame sequence offers a four-stream representation of the input depth action video. Next, the DenseNet121 architecture is employed along with ImageNet pre-trained weights to extract the discriminating frame-level action features of depth and temporal motion frames. The extracted four sets of feature vectors about frames of four streams are fed into four bi-directional (BLSTM) networks. The temporal features are further analyzed through multi-head self-attention (MHSA) to capture multi-view sequence correlations. Finally, the concatenated genre of their outputs is processed through dense layers to classify the input depth video. The experimental results on two small-scale benchmark depth datasets, MSRAction3D and DHA, demonstrate that the proposed framework is efficacious even for insufficient training samples and superior to the existing depth data-based action recognition methods.

Funder

Institute of Information & communications Technology Planning & Evaluation, Korea government

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/18/6841/pdf

Reference84 articles.

1. RGB-D Data-Based Action Recognition: A Review

2. Survey of pedestrian action recognition techniques for autonomous driving

3. Continuous detection and recognition of actions of interest among actions of non-interest using a depth camera;Dawar;Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP),2017

4. Tornado: A spatio-temporal convolutional regression network for video action proposal;Zhu;Proceedings of the IEEE International Conference on Computer Vision,2017

5. A Vision-Based System for Intelligent Monitoring: Human Behaviour Analysis and Privacy by Context

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Video-based automatic hand hygiene detection for operating rooms using 3D convolutional neural networks;Journal of Clinical Monitoring and Computing;2024-06-19