Towards Single 2D Image-Level Self-Supervision for 3D Human Pose and Shape Estimation-Reference-Cited by-同舟云学术

Towards Single 2D Image-Level Self-Supervision for 3D Human Pose and Shape Estimation

Published:2021-10-18 Issue:20 Volume:11 Page:9724
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Cha Junuk,Saqlain Muhammad^ORCID,Lee Changhwa,Lee Seongyeong,Lee Seungeun,Kim Donguk,Park Won-Hee,Baek Seungryul

Abstract

Three-dimensional human pose and shape estimation is an important problem in the computer vision community, with numerous applications such as augmented reality, virtual reality, human computer interaction, and so on. However, training accurate 3D human pose and shape estimators based on deep learning approaches requires a large number of images and corresponding 3D ground-truth pose pairs, which are costly to collect. To relieve this constraint, various types of weakly or self-supervised pose estimation approaches have been proposed. Nevertheless, these methods still involve supervision signals, which require effort to collect, such as unpaired large-scale 3D ground truth data, a small subset of 3D labeled data, video priors, and so on. Often, they require installing equipment such as a calibrated multi-camera system to acquire strong multi-view priors. In this paper, we propose a self-supervised learning framework for 3D human pose and shape estimation that does not require other forms of supervision signals while using only single 2D images. Our framework inputs single 2D images, estimates human 3D meshes in the intermediate layers, and is trained to solve four types of self-supervision tasks (i.e., three image manipulation tasks and one neural rendering task) whose ground-truths are all based on the single 2D images themselves. Through experiments, we demonstrate the effectiveness of our approach on 3D human pose benchmark datasets (i.e., Human3.6M, 3DPW, and LSP), where we present the new state-of-the-art among weakly/self-supervised methods.

Funder

R&D Program of the Korea Railroad Research Institute, Republic of Korea.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/11/20/9724/pdf

Reference66 articles.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Estimation of 3D anatomically précised hand poses using single shot corrective CNN;Journal of Intelligent & Fuzzy Systems;2023-11-04

2. Image-free Domain Generalization via CLIP for 3D Hand Pose Estimation;2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2023-01

3. A Survey on Deep Learning-Based 2D Human Pose Estimation Models;Computers, Materials & Continua;2023

4. 3DMesh-GAR: 3D Human Body Mesh-Based Method for Group Activity Recognition;Sensors;2022-02-14

5. Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement;Lecture Notes in Computer Science;2022