3D-CNN-Based Fused Feature Maps with LSTM Applied to Action Recognition-Reference-Cited by-同舟云学术

3D-CNN-Based Fused Feature Maps with LSTM Applied to Action Recognition

Published:2019-02-13 Issue:2 Volume:11 Page:42
ISSN:1999-5903
Container-title:Future Internet
language:en
Short-container-title:Future Internet

Author:

Arif Sheeraz,Wang Jing,Ul Hassan Tehseen,Fei Zesong

Abstract

Human activity recognition is an active field of research in computer vision with numerous applications. Recently, deep convolutional networks and recurrent neural networks (RNN) have received increasing attention in multimedia studies, and have yielded state-of-the-art results. In this research work, we propose a new framework which intelligently combines 3D-CNN and LSTM networks. First, we integrate discriminative information from a video into a map called a ‘motion map’ by using a deep 3-dimensional convolutional network (C3D). A motion map and the next video frame can be integrated into a new motion map, and this technique can be trained by increasing the training video length iteratively; then, the final acquired network can be used for generating the motion map of the whole video. Next, a linear weighted fusion scheme is used to fuse the network feature maps into spatio-temporal features. Finally, we use a Long-Short-Term-Memory (LSTM) encoder-decoder for final predictions. This method is simple to implement and retains discriminative and dynamic information. The improved results on benchmark public datasets prove the effectiveness and practicability of the proposed method.

Publisher

MDPI AG

Subject

Computer Networks and Communications

Link

http://www.mdpi.com/1999-5903/11/2/42/pdf

Reference49 articles.

1. A survey on vision-based human action recognition

2. On Space-Time Interest Points

Cited by 42 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Remote Sensing Image Change Detection Based on Deep Learning: Multi-Level Feature Cross-Fusion with 3D-Convolutional Neural Networks;Applied Sciences;2024-07-18

2. A novel deep learning method based on 2-D CNNs and GRUs for permeability prediction of tight sandstone;Geoenergy Science and Engineering;2024-07

3. A novel multi-scale violence and public gathering dataset for crowd behavior classification;Frontiers in Computer Science;2024-05-10

4. Real-Time Human Action Recognition by using R(2+1)D Convolutional Neural Network;2024 3rd International Conference on Artificial Intelligence For Internet of Things (AIIoT);2024-05-03

5. Enhanced Two-Stream Bayesian Hyper Parameter Optimized 3D-CNN Inception-v3 Based Drop-ConvLSTM2D Deep Learning Model for Human Action Recognition;Information Technology and Control;2024-03-22