Affiliation:
1. Ho Chi Minh City Open University, 35-37 Ho Hao Hon Street, Ward Co Giang, District 1, Ho Chi Minh City, Vietnam
Abstract
Human action recognition is an important field in computer vision that has attracted remarkable attention from researchers. This survey aims to provide a comprehensive overview of recent human action recognition approaches based on deep learning using RGB video data. Our work divides recent deep learning-based methods into five different categories to provide a comprehensive overview for researchers who are interested in this field of computer vision. Moreover, a pure-transformer architecture (convolution-free) has outperformed its convolutional counterparts in many fields of computer vision recently. Our work also provides recent convolution-free-based methods which replaced convolution networks with the transformer networks that achieved state-of-the-art results on many human action recognition datasets. Firstly, we discuss proposed methods based on a 2D convolutional neural network. Then, methods based on a recurrent neural network which is used to capture motion information are discussed. 3D convolutional neural network-based methods are used in many recent approaches to capture both spatial and temporal information in videos. However, with long action videos, multistream approaches with different streams to encode different features are reviewed. We also compare the performance of recently proposed methods on four popular benchmark datasets. We review 26 benchmark datasets for human action recognition. Some potential research directions are discussed to conclude this survey.
Funder
Ho Chi Minh City Open University
Subject
General Mathematics,General Medicine,General Neuroscience,General Computer Science
Cited by
28 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Understanding Plant Secondary Metabolism Using Bioinformatics Tools;Bioinformatics for Plant Research and Crop Breeding;2024-07-19
2. Pre-Hospital Stroke Care beyond the MSU;Current Neurology and Neuroscience Reports;2024-06-22
3. De-anonymizing VR Avatars using Non-VR Motion Side-channels;Proceedings of the 17th ACM Conference on Security and Privacy in Wireless and Mobile Networks;2024-05-27
4. Creating Patterns for Handicrafts and Embroidery;International Journal of Innovative Science and Research Technology (IJISRT);2024-05-16
5. Online human motion analysis in industrial context: A review;Engineering Applications of Artificial Intelligence;2024-05