Author:
Pandey Ritik,Chikhale Yadnesh,Verma Ritik,Patil Deepali
Abstract
Human action recognition has become an important research area in the fields of computer vision, image processing, and human-machine or human-object interaction due to its large number of real time applications. Action recognition is the identification of different actions from video clips (an arrangement of 2D frames) where the action may be performed in the video. This is a general construction of image classification tasks to multiple frames and then collecting the predictions from each frame. Different approaches are proposed in literature to improve the accuracy in recognition. In this paper we proposed a deep learning based model for Recognition and the main focus is on the CNN model for image classification. The action videos are converted into frames and pre-processed before sending to our model for recognizing different actions accurately..
Reference20 articles.
1. Tran Du, Bourdev Lubomir, Fergus Rob,Torresani Lorenzo, Paluri Manohar “Learning Spatiotemporal Features with 3D Convolutional Networks”, IEEE International Conference on Computer Vision (ICCV), 2015.
2. Deep Learning-Based Real-Time Multiple-Person Action Recognition System
3. Romaissa Beddiar & Nini Brahim & Sabokrou Mohammad & Hadid Abdenour, “Vision-based human activity recognition: a survey”, Multimedia Tools and Applications. 79. 10.1007/s11042-020-09004-3, Aug 2020.
4. Du Y., Wang W., and Wang L., “Hierarchical recurrent neural network for skeleton based action recognition”, In IEEE Conference Paper on Computer Vision and Pattern Recognition (CVPR), 2015, pp. 1110–1118.
5. Song S., Lan C., Xing J., Zeng W., “An End-to-End SpatioTemporal Attention Model for Human Action Recognition from Skeleton Data” in AAAI, pp. 4263–4270, 2017.