A novel human activity recognition architecture: using residual inception ConvLSTM layer-Reference-Cited by-同舟云学术

A novel human activity recognition architecture: using residual inception ConvLSTM layer

Published:2022-05-21 Issue:1 Volume:69 Page:
ISSN:1110-1903
Container-title:Journal of Engineering and Applied Science
language:en
Short-container-title:J. Eng. Appl. Sci.

Author:

Khater Sarah^ORCID,Hadhoud Mayada,Fayek Magda B.

Abstract

AbstractHuman activity recognition (HAR) is a very challenging problem that requires identifying an activity performed by a single individual or a group of people observed from spatiotemporal data. Many computer vision applications require a solution to HAR. To name a few, surveillance systems, medical and health care monitoring applications, and smart home assistant devices. The rapid development of machine learning leads to a great advance in HAR solutions. One of these solutions is using ConvLSTM architecture. ConvLSTM architectures have recently been used in many spatiotemporal computer vision applications.In this paper, we introduce a new layer, residual inception convolutional recurrent layer, ResIncConvLSTM, a variation of ConvLSTM layer. Also, a novel architecture to solve HAR using the introduced layer is proposed. Our proposed architecture resulted in an accuracy improvement by 7% from ConvLSTM baseline architecture. The comparisons are held in terms of classification accuracy. The architectures are trained using KTH dataset and tested against both KTH and Weizmann datasets. The architectures are also trained and tested against a subset of UCF Sports Action dataset. Also, experimental results show the effectiveness of our proposed architecture compared to other state-of-the-art architectures.

Publisher

Springer Science and Business Media LLC

Subject

General Engineering

Link

https://link.springer.com/content/pdf/10.1186/s44147-022-00098-0.pdf

Reference53 articles.

1. Sebe N, Cohen I, Garg A, Huang TS (2005) Machine learning in computer Vision vol. 29. SSBM, Berlin.

2. Beddiar DR, Nini B, Sabokrou M, Hadid A (2020) Vision-based human activity recognition: a survey. Multimed Tools Appl 79(41):30509–30555.

3. Zheng W-S, Gong S, Xiang T (2011) Person re-identification by probabilistic relative distance comparison In: CVPR 2011, 649–656.. IEEE, New York.

4. Shi X, Chen Z, Wang H, Yeung D-Y, Wong W-K, Woo W-c (2015) Convolutional lstm network: a machine learning approach for precipitation nowcasting. arXiv preprint arXiv:1506.04214.

5. Song Y, Li C, Wang Y (2017) Pixel-wise object tracking. arXiv preprint arXiv:1711.07377.

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Human activity recognition: A comprehensive review;Expert Systems;2024-07-27

2. Convolutional Long Short-Term Memory (ConvLSTM)-Based Prediction of Voltage Stability in a Microgrid;Energies;2024-04-23

3. A Multi-batch Differential Binary Motion Image and Deep Hashing Network for Human Action Recognition;Lecture Notes in Networks and Systems;2024

4. Group Activity Recognition in Visual Data Using Deep Learning Framework;2023 2nd International Conference on Futuristic Technologies (INCOFT);2023-11-24

5. Deep Learning Approach for Human Action Recognition Using a Time Saliency Map Based on Motion Features Considering Camera Movement and Shot in Video Image Sequences;Information;2023-11-15