LidPose: Real-Time 3D Human Pose Estimation in Sparse Lidar Point Clouds with Non-Repetitive Circular Scanning Pattern
Author:
Kovács Lóránt12ORCID, Bódis Balázs M.12ORCID, Benedek Csaba12ORCID
Affiliation:
1. HUN-REN Institute for Computer Science and Control (SZTAKI), Kende utca 13-17, H-1111 Budapest, Hungary 2. Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Práter utca 50/A, H-1083 Budapest, Hungary
Abstract
In this paper, we propose a novel, vision-transformer-based end-to-end pose estimation method, LidPose, for real-time human skeleton estimation in non-repetitive circular scanning (NRCS) lidar point clouds. Building on the ViTPose architecture, we introduce novel adaptations to address the unique properties of NRCS lidars, namely, the sparsity and unusual rosetta-like scanning pattern. The proposed method addresses a common issue of NRCS lidar-based perception, namely, the sparsity of the measurement, which needs balancing between the spatial and temporal resolution of the recorded data for efficient analysis of various phenomena. LidPose utilizes foreground and background segmentation techniques for the NRCS lidar sensor to select a region of interest (RoI), making LidPose a complete end-to-end approach to moving pedestrian detection and skeleton fitting from raw NRCS lidar measurement sequences captured by a static sensor for surveillance scenarios. To evaluate the method, we have created a novel, real-world, multi-modal dataset, containing camera images and lidar point clouds from a Livox Avia sensor, with annotated 2D and 3D human skeleton ground truth.
Funder
National Laboratory for Autonomous Systems Artificial Intelligence National Laboratory
Reference55 articles.
1. Zimmermann, C., Welschehold, T., Dornhege, C., Burgard, W., and Brox, T. (2018, January 21–25). 3D Human Pose Estimation in RGBD Images for Robotic Task Learning. Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, QLD, Australia. 2. Cormier, M., Clepe, A., Specker, A., and Beyerer, J. (2022, January 4–8). Where are we with Human Pose Estimation in Real-World Surveillance?. Proceedings of the 2022 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW), Waikoloa, HI, USA. 3. A survey on visual surveillance of object motion and behaviors;Hu;IEEE Trans. Syst. Man Cybern. Part C (Appl. Rev.),2004 4. Zanfir, A., Zanfir, M., Gorban, A., Ji, J., Zhou, Y., Anguelov, D., and Sminchisescu, C. (2022, January 14–18). HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving. Proceedings of the 6th Conference on Robot Learning, Auckland, New Zealand. 5. A Multisensor Technique for Gesture Recognition Through Intelligent Skeletal Pose Analysis;Rossol;IEEE Trans.-Hum.-Mach. Syst.,2016
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|