Affiliation:
1. College of Computer and Information Systems, Umm Al-Qura University, Makkah 21955, Saudi Arabia
Abstract
Human Pose Estimation (HPE) is the task that aims to predict the location of human joints from images and videos. This task is used in many applications, such as sports analysis and surveillance systems. Recently, several studies have embraced deep learning to enhance the performance of HPE tasks. However, building an efficient HPE model is difficult; many challenges, like crowded scenes and occlusion, must be handled. This paper followed a systematic procedure to review different HPE models comprehensively. About 100 articles published since 2014 on HPE using deep learning were selected using several selection criteria. Both image and video data types of methods were investigated. Furthermore, both single and multiple HPE methods were reviewed. In addition, the available datasets, different loss functions used in HPE, and pretrained feature extraction models were all covered. Our analysis revealed that Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) are the most used in HPE. Moreover, occlusion and crowd scenes remain the main problems affecting models’ performance. Therefore, the paper presented various solutions to address these issues. Finally, this paper highlighted the potential opportunities for future work in this task.
Funder
Deputyship for Research & Innovation, Ministry of Education in Saudi Arabia
Subject
Artificial Intelligence,Engineering (miscellaneous)
Reference166 articles.
1. Sun, J., Chen, X., Lu, Y., and Cao, J. (2020, January 14–16). 2D Human Pose Estimation from Monocular Images: A Survey. Proceedings of the IEEE 3rd International Conference on Computer and Communication Engineering Technology, Beijing, China.
2. Gong, W., Zhang, X., Gonzàlez, J., Sobral, A., Bouwmans, T., Tu, C., and Zahzah, E.H. (2016). Human pose estimation from monocular images: A comprehensive survey. Sensors, 16.
3. Abnormal Behavior Learning Based on Edge Computing toward a Crowd Monitoring System;Miao;IEEE Netw.,2022
4. On unifying deep learning and edge computing for human motion analysis in exergames development;Pardos;Neural Comput. Appl.,2022
5. Animepose: Multi-person 3d pose estimation and animation;Kumarapu;Pattern Recognit. Lett.,2021
Cited by
13 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献