Affiliation:
1. Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing 100101, China
Abstract
Heatmap-based traditional approaches for estimating human pose usually suffer from drawbacks such as high network complexity or suboptimal accuracy. Focusing on the issue of multi-person pose estimation without heatmaps, this paper proposes an end-to-end, lightweight human pose estimation network using a multi-scale coordinate attention mechanism based on the Yolo-Pose network to improve the overall network performance while ensuring the network is lightweight. Specifically, the lightweight network GhostNet was first integrated into the backbone to alleviate the problem of model redundancy and produce a significant number of effective feature maps. Then, by combining the coordinate attention mechanism, the sensitivity of our proposed network to direction and location perception was enhanced. Finally, the BiFPN module was fused to balance the feature information of different scales and further improve the expression ability of convolutional features. Experiments on the COCO 2017 dataset showed that, compared with the baseline method YOLO-Pose, the average accuracy of the proposed network on the COCO 2017 validation dataset was improved by 4.8% while minimizing the amount of network parameters and calculations. The experimental results demonstrated that our proposed method can improve the detection accuracy of human pose estimation while ensuring that the model is lightweight.
Funder
Beijing Natural Science Foundation
National Natural Science Foundation of China
Key Project of Science and Technology Plan of Beijing Municipal Education Commission
Academic Research Projects of Beijing Union University
Premium Funding Project for Academic Human Resources Development in Beijing Union University
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference50 articles.
1. An Improved Helmet Detection Algorithm Based on YOLO V4;Yang;Int. J. Found. Comput. Sci.,2022
2. Gong, W., Zhang, X., Gonzàlez, J., Sobral, A., Bouwmans, T., Tu, C., and Zahzah, E.-h. (2016). Human pose estimation from monocular images: A comprehensive survey. Sensors, 16.
3. Gadhiya, R., and Kalani, N. (2021, January 23–24). Analysis of deep learning based pose estimation techniques for locating landmarks on human body parts. Proceedings of the 2021 International Conference on Circuits, Controls and Communications (CCUBE), Bangalore, India.
4. Recent advances of monocular 2d and 3d human pose estimation: A deep learning perspective;Liu;ACM Comput. Surv.,2022
5. Nie, X., Feng, J., Zhang, J., and Yan, S. (2019–2, January 27). Single-stage multi-person pose machines. Proceedings of the IEEE/CVF International Conference on Computer Vision (CVPR), Seoul, Republic of Korea.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献