CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization-Reference-Cited by-同舟云学术

CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization

Published:2024-07-19 Issue:4 Volume:43 Page:1-13
ISSN:0730-0301
Container-title:ACM Transactions on Graphics
language:en
Short-container-title:ACM Trans. Graph.

Author:

Peng Hao-Yang¹^ORCID,Zhang Jia-Peng²^ORCID,Guo Meng-Hao¹^ORCID,Cao Yan-Pei³^ORCID,Hu Shi-Min¹^ORCID

Affiliation:

1. BNRist, Department of Computer Science and Technology, Tsinghua University, Beijing, China

2. Zhili College, Tsinghua University, Beijing, China

3. VAST, Beijing, China

Abstract

BNRist, Department of Computer Science and Technology, Tsinghua University, China In the field of digital content creation, generating high-quality 3D characters from single images is challenging, especially given the complexities of various body poses and the issues of self-occlusion and pose ambiguity. In this paper, we present CharacterGen, a framework developed to efficiently generate 3D characters. CharacterGen introduces a streamlined generation pipeline along with an image-conditioned multi-view diffusion model. This model effectively calibrates input poses to a canonical form while retaining key attributes of the input image, thereby addressing the challenges posed by diverse poses. A transformer-based, generalizable sparse-view reconstruction model is the other core component of our approach, facilitating the creation of detailed 3D models from multi-view images. We also adopt a texture-back-projection strategy to produce high-quality texture maps. Additionally, we have curated a dataset of anime characters, rendered in multiple poses and views, to train and evaluate our model. Our approach has been thoroughly evaluated through quantitative and qualitative experiments, showing its proficiency in generating 3D characters with high-quality shapes and textures, ready for downstream applications such as rigging and animation.

Funder

National Science and Technology Major Project

National Natural Science Foundation of China

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3658217

Reference76 articles.

1. actorcore. 2023. accurig a software for automatic character rigging. https://actorcore.reallusion.com/auto-rig/accurig

2. imGHUM: Implicit Generative Models of 3D Human Shape and Articulated Pose

3. Pretrain, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction

4. OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields