3DFCNN: real-time action recognition using 3D deep neural networks with raw depth information-Reference-Cited by-同舟云学术

3DFCNN: real-time action recognition using 3D deep neural networks with raw depth information

Published:2022-03-19 Issue:17 Volume:81 Page:24119-24143
ISSN:1380-7501
Container-title:Multimedia Tools and Applications
language:en
Short-container-title:Multimed Tools Appl

Author:

Sánchez-Caballero Adrián^ORCID,de López-Diz Sergio^ORCID,Fuentes-Jimenez David^ORCID,Losada-Gutiérrez Cristina^ORCID,Marrón-Romera Marta^ORCID,Casillas-Pérez David^ORCID,Sarker Mohammad Ibrahim^ORCID

Abstract

AbstractThis work describes an end-to-end approach for real-time human action recognition from raw depth image-sequences. The proposal is based on a 3D fully convolutional neural network, named 3DFCNN, which automatically encodes spatio-temporal patterns from raw depth sequences. The described 3D-CNN allows actions classification from the spatial and temporal encoded information of depth sequences. The use of depth data ensures that action recognition is carried out protecting people’s privacy, since their identities can not be recognized from these data. The proposed 3DFCNN has been optimized to reach a good performance in terms of accuracy while working in real-time. Then, it has been evaluated and compared with other state-of-the-art systems in three widely used public datasets with different characteristics, demonstrating that 3DFCNN outperforms all the non-DNN-based state-of-the-art methods with a maximum accuracy of 83.6% and obtains results that are comparable to the DNN-based approaches, while maintaining a much lower computational cost of 1.09 seconds, what significantly increases its applicability in real-world environments.

Funder

Ministerio de Economía y Competitividad

Universidad de Alcalá

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Hardware and Architecture,Media Technology,Software

Link

https://link.springer.com/content/pdf/10.1007/s11042-022-12091-z.pdf

Reference100 articles.

1. Al-Akam R, Paulus D, Gharabaghi D (2018) Human action recognition based on 3d convolution neural networks from rgbd videos. In: WSCG 2018: Poster papers proceedings: 26th international conference in central europe on computer graphics, visualization and computer vision, pp 18–26

2. Ashraf N, Sun C, Foroosh H (2014) View invariant action recognition using projective depth. Comput Vis Image Underst 123:41–52