PosturePose: Optimized Posture Analysis for Semi-Supervised Monocular 3D Human Pose Estimation
Author:
Amadi Lawrence1ORCID, Agam Gady1
Affiliation:
1. Visual Computing Lab, Illinois Institute of Technology, Chicago, IL 60616, USA
Abstract
One motivation for studying semi-supervised techniques for human pose estimation is to compensate for the lack of variety in curated 3D human pose datasets by combining labeled 3D pose data with readily available unlabeled video data—effectively, leveraging the annotations of the former and the rich variety of the latter to train more robust pose estimators. In this paper, we propose a novel, fully differentiable posture consistency loss that is unaffected by camera orientation and improves monocular human pose estimators trained with limited labeled 3D pose data. Our semi-supervised monocular 3D pose framework combines biomechanical pose regularization with a multi-view posture (and pose) consistency objective function. We show that posture optimization was effective at decreasing pose estimation errors when applied to a 2D–3D lifting network (VPose3D) and two well-studied datasets (H36M and 3DHP). Specifically, the proposed semi-supervised framework with multi-view posture and pose loss lowered the mean per-joint position error (MPJPE) of leading semi-supervised methods by up to 15% (−7.6 mm) when camera parameters of unlabeled poses were provided. Without camera parameters, our semi-supervised framework with posture loss improved semi-supervised state-of-the-art methods by 17% (−15.6 mm decrease in MPJPE). Overall, our pose models compete favorably with other high-performing pose models trained under similar conditions with limited labeled data.
Funder
National Science Foundation
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference55 articles.
1. Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments;Ionescu;IEEE Trans. Pattern Anal. Mach. Intell.,2014 2. Joo, H., Simon, T., Li, X., Liu, H., Tan, L., Gui, L., Banerjee, S., Godisart, T.S., Nabbe, B., and Matthews, I. (2015, January 7–13). Panoptic Studio: A Massively Multiview System for Social Interaction Capture. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile. 3. HumanEva: Synchronized Video and Motion Capture Dataset and Baseline Algorithm for Evaluation of Articulated Human Motion;Sigal;Int. J. Comput. Vis.,2009 4. Mehta, D., Rhodin, H., Casas, D., Fua, P., Sotnychenko, O., Xu, W., and Theobalt, C. (2017, January 10–12). Monocular 3D Human Pose Estimation in the Wild Using Improved CNN Supervision. Proceedings of the 3D Vision (3DV), 2017 Fifth International Conference, Qingdao, China. 5. Von Marcard, T., Henschel, R., Black, M.J., Rosenhahn, B., and Pons-Moll, G. (2018, January 8–14). Recovering Accurate 3D Human Pose in the Wild Using IMUs and a Moving Camera. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|