DUA: A Domain-Unified Approach for Cross-Dataset 3D Human Pose Estimation
Author:
Manesco João Renato Ribeiro1ORCID, Berretti Stefano2ORCID, Marana Aparecido Nilceu1ORCID
Affiliation:
1. Faculty of Sciences, UNESP—São Paulo State University, Bauru 17033-360, SP, Brazil 2. Media Integration and Communication Center (MICC), University of Florence, 50134 Florence, Italy
Abstract
Human pose estimation is an important Computer Vision problem, whose goal is to estimate the human body through joints. Currently, methods that employ deep learning techniques excel in the task of 2D human pose estimation. However, the use of 3D poses can bring more accurate and robust results. Since 3D pose labels can only be acquired in restricted scenarios, fully convolutional methods tend to perform poorly on the task. One strategy to solve this problem is to use 2D pose estimators, to estimate 3D poses in two steps using 2D pose inputs. Due to database acquisition constraints, the performance improvement of this strategy can only be observed in controlled environments, therefore domain adaptation techniques can be used to increase the generalization capability of the system by inserting information from synthetic domains. In this work, we propose a novel method called Domain Unified approach, aimed at solving pose misalignment problems on a cross-dataset scenario, through a combination of three modules on top of the pose estimator: pose converter, uncertainty estimator, and domain classifier. Our method led to a 44.1mm (29.24%) error reduction, when training with the SURREAL synthetic dataset and evaluating with Human3.6M over a no-adaption scenario, achieving state-of-the-art performance.
Funder
São Paulo Research Foundation Petrobras/Fundunesp
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference43 articles.
1. Zheng, C., Wu, W., Yang, T., Zhu, S., Chen, C., Liu, R., Shen, J., Kehtarnavaz, N., and Shah, M. (2020). Deep Learning-Based Human Pose Estimation: A Survey. arXiv. 2. Monocular Human Pose Estimation: A Survey of Deep Learning-Based Methods;Chen;Comput. Vis. Image Underst.,2020 3. Doersch, C., and Zisserman, A. (2019). Sim2real Transfer Learning for 3D Human Pose Estimation: Motion to the Rescue. arXiv. 4. Bartol, K., Bojanic, D., Petkovic, T., D’Apuzzo, N., and Pribanic, T. (2020, January 17–18). A Review of 3D Human Pose Estimation from 2D Images. Proceedings of the 3DBODY.TECH 2020—11th International Conference and Exhibition on 3D Body Scanning and Processing Technologies, Virtual. 5. Fleet, D., Pajdla, T., Schiele, B., and Tuytelaars, T. (2014). Computer Vision—ECCV 2014, Springer International Publishing. Lecture Notes in Computer Science.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|