Abstract
We present a simple yet effective pipeline for absolute three-dimensional (3D) human pose estimation from two-dimensional (2D) joint keypoints, namely, the 2D-to-3D human pose lifting problem. Our method comprises two simple baseline networks, a 3D conversion function, and a correction network. The former two networks predict the root distance and the root-relative joint distance simultaneously. Given the input and predicted distances, the 3D conversion function recovers the absolute 3D pose, and the correction network reduces 3D pose noise caused by input uncertainties. Furthermore, to cope with input noise implicitly, we adopt a Siamese architecture that enforces the consistency of features between two training inputs, i.e., ground truth 2D joint keypoints and detected 2D joint keypoints. Finally, we experimentally validate the advantages of the proposed method and demonstrate its competitive performance over state-of-the-art absolute 2D-to-3D pose-lifting methods.
Subject
Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)
Reference48 articles.
1. (2022, January 01). Optitrack. Available online: https://www.optitrack.com/.
2. (2022, January 01). Qualisys. Available online: https://www.qualisys.com/.
3. Liu, W., Bao, Q., Sun, Y., and Mei, T. (2021). Recent Advances in Monocular 2D and 3D Human Pose Estimation: A Deep Learning Perspective. arXiv.
4. Wu, Y., Ma, S., Zhang, D., and Sun, J. (2020). 3D Capsule Hand Pose Estimation Network Based on Structural Relationship Information. Symmetry, 12.
5. Moon, G., Chang, J.Y., and Lee, K.M. (November, January 27). Camera distance-aware top-down approach for 3d multi-person pose estimation from a single rgb image. Proceedings of the ICCV, Seoul, Republic of Korea.