IDE-3D

Author:

Sun Jingxiang1,Wang Xuan2,Shi Yichun3,Wang Lizhen1,Wang Jue2,Liu Yebin1

Affiliation:

1. Tsinghua University, China

2. Tencent AI Lab, China

3. ByteDance Inc.

Abstract

Existing 3D-aware facial generation methods face a dilemma in quality versus editability: they either generate editable results in low resolution, or high-quality ones with no editing flexibility. In this work, we propose a new approach that brings the best of both worlds together. Our system consists of three major components: (1) a 3D-semantics-aware generative model that produces view-consistent, disentangled face images and semantic masks; (2) a hybrid GAN inversion approach that initializes the latent codes from the semantic and texture encoder, and further optimizes them for faithful reconstruction; and (3) a canonical editor that enables efficient manipulation of semantic masks in canonical view and produces high-quality editing results. Our approach is competent for many applications, e.g. free-view face drawing, editing and style control. Both quantitative and qualitative results show that our method reaches the state-of-the-art in terms of photorealism, faithfulness and efficiency.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design

Reference69 articles.

1. Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?

2. Image2StyleGAN++: How to Edit the Embedded Images?

3. Rameen Abdal , Peihao Zhu , Niloy J Mitra , and Peter Wonka . 2021 . Styleflow: Attribute-conditioned exploration of stylegan-generated images using conditional continuous normalizing flows. ACM Transactions on Graphics (TOG) (2021). Rameen Abdal, Peihao Zhu, Niloy J Mitra, and Peter Wonka. 2021. Styleflow: Attribute-conditioned exploration of stylegan-generated images using conditional continuous normalizing flows. ACM Transactions on Graphics (TOG) (2021).

4. ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement

5. Sherwin Bahmani , Jeong Joon Park , Despoina Paschalidou, Hao Tang, Gordon Wetzstein, Leonidas Guibas, Luc Van Gool, and Radu Timofte. 2022 . 3D-Aware Video Generation . arXiv preprint arXiv:2206.14797 (2022). Sherwin Bahmani, Jeong Joon Park, Despoina Paschalidou, Hao Tang, Gordon Wetzstein, Leonidas Guibas, Luc Van Gool, and Radu Timofte. 2022. 3D-Aware Video Generation. arXiv preprint arXiv:2206.14797 (2022).

Cited by 48 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Directional Texture Editing for 3D Models;Computer Graphics Forum;2024-09-02

2. InvertAvatar: Incremental GAN Inversion for Generalized Head Avatars;Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers '24;2024-07-13

3. Editing Audio-Driven Talking Head Based on Audio Information;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

4. DisCO: Portrait Distortion Correction with Perspective-Aware 3D GANs;International Journal of Computer Vision;2024-06-17

5. Point-StyleGAN: Multi-scale point cloud synthesis with style modulation;Computer Aided Geometric Design;2024-06

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3