A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities-Reference-Cited by-同舟云学术

A Hybrid Network for Large-Scale Action Recognition from RGB and Depth Modalities

Published:2020-06-10 Issue:11 Volume:20 Page:3305
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Wang Huogen^ORCID,Song Zhanjie,Li Wanqing,Wang Pichao

Abstract

The paper presents a novel hybrid network for large-scale action recognition from multiple modalities. The network is built upon the proposed weighted dynamic images. It effectively leverages the strengths of the emerging Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) based approaches to specifically address the challenges that occur in large-scale action recognition and are not fully dealt with by the state-of-the-art methods. Specifically, the proposed hybrid network consists of a CNN based component and an RNN based component. Features extracted by the two components are fused through canonical correlation analysis and then fed to a linear Support Vector Machine (SVM) for classification. The proposed network achieved state-of-the-art results on the ChaLearn LAP IsoGD, NTU RGB+D and Multi-modal & Multi-view & Interactive ( M 2 I ) datasets and outperformed existing methods by a large margin (over 10 percentage points in some cases).

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/20/11/3305/pdf

Reference82 articles.

1. RGB-D-based human motion recognition with deep learning: A survey

2. Skeleton Optical Spectra-Based Action Recognition Using Convolutional Neural Networks

Cited by 22 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multimodal vision-based human action recognition using deep learning: a review;Artificial Intelligence Review;2024-06-19

2. Comparison Analysis of Multimodal Fusion for Dangerous Action Recognition in Railway Construction Sites;Electronics;2024-06-12

3. Domain-Adaptive and Context-Aware Fall Detection Based on Coarse-Fine Network Learning;International Journal of Innovative Science and Research Technology (IJISRT);2024-05-23

4. SynthAct: Towards Generalizable Human Action Recognition based on Synthetic Data;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

5. Multimodal action recognition: a comprehensive survey on temporal modeling;Multimedia Tools and Applications;2023-12-22