A comparative study using improved LSTM /GRU for human action recognition-Reference-Cited by-同舟云学术

A comparative study using improved LSTM /GRU for human action recognition

Published:2022-12-21 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Muhamad Azhee Wria¹,Mohammed Aree Ali¹

Affiliation:

1. University of Sulaymaniyah

Abstract

Abstract One of the deep learning algorithms for sequence data analysis is a recurrent neural network (RNN). In a conventional neural network, the inputs and the outputs are independent of each other. At the same time, RNN is considered a type of Neural Network where the output from the previous step feeds information to the current phase. It has many applications, including video sentiment classification, speech tagging, and machine translation. Recurrent networks are also distributed parameters across each layer of the network. Several layers are stacked together to increase depth in forwarding and backward information of long short-term memory (LSTM) and Gated Recurrent Unit (GRU). This paper proposes two models for various action recognitions using LSTM and GRU, respectively. The first model was improved by increasing the LSTM layers to four and the number of units in each layer to 128 cells. While in the second model, GRU layers were extended to two layers with 128 cells, and the (update and reset) gates are modified based on the previous and current input. A comparative study was conducted during the experimental tests performed on the UCF101 action dataset regarding the accuracy rate for both models. Test results indicate that the accuracy has a significant improvement compared with other state-of-the-arts action recognitions, which are 95.19% and 92.9% for both improved LSTM and GRU, respectively.

Publisher

Research Square Platform LLC

Reference39 articles.

1. Illumination and scale invariant relevant visual features with hypergraph-based learning for multi-shot person re-identification,";Nanda A;Multimedia Tools Appl.,2017

2. "A neuromorphic person re-identification framework for video surveillance,";Nanda A;IEEE Access,2017

3. S. Herath, M. Harandi, and F. Porikli, "Going deeper into action recognition: A survey," Image Vis. Comput., vol. 60, pp. 4–21, Apr. 2017.

4. J. Y.-H. Ng, M. Hausknecht, S. Vijayanarasimhan, O. Vinyals, R. Monga, and G. Toderici, "Beyond short snippets: Deep networks for video classification," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., Jun. 2015, pp. 4694–4702.

5. "A critical review of recurrent neural networks for sequence learning.";Lipton ZC,2015