A Perceptual Shape Loss for Monocular 3D Face Reconstruction-Reference-Cited by-同舟云学术

A Perceptual Shape Loss for Monocular 3D Face Reconstruction

Published:2023-10 Issue:7 Volume:42 Page:
ISSN:0167-7055
Container-title:Computer Graphics Forum
language:en
Short-container-title:Computer Graphics Forum

Author:

Otto C.¹²^ORCID,Chandran P.¹^ORCID,Zoss G.¹^ORCID,Gross M.¹²^ORCID,Gotardo P.¹^ORCID,Bradley D.¹^ORCID

Affiliation:

1. DisneyResearch|Studios Switzerland

2. ETH Zürich Switzerland

Abstract

AbstractMonocular 3D face reconstruction is a wide‐spread topic, and existing approaches tackle the problem either through fast neural network inference or offline iterative reconstruction of face geometry. In either case carefully‐designed energy functions are minimized, commonly including loss terms like a photometric loss, a landmark reprojection loss, and others. In this work we propose a new loss function for monocular face capture, inspired by how humans would perceive the quality of a 3D face reconstruction given a particular image. It is widely known that shading provides a strong indicator for 3D shape in the human visual system. As such, our new ‘perceptual’ shape loss aims to judge the quality of a 3D face estimate using only shading cues. Our loss is implemented as a discriminator‐style neural network that takes an input face image and a shaded render of the geometry estimate, and then predicts a score that perceptually evaluates how well the shaded render matches the given image. This ‘critic’ network operates on the RGB image and geometry render alone, without requiring an estimate of the albedo or illumination in the scene. Furthermore, our loss operates entirely in image space and is thus agnostic to mesh topology. We show how our new perceptual shape loss can be combined with traditional energy terms for monocular 3D face optimization and deep neural network regression, improving upon current state‐of‐the‐art results.

Publisher

Wiley

Subject

Computer Graphics and Computer-Aided Design

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/cgf.14945

Reference70 articles.

1. anTrãn A. T. HassnerT. MasiI. MedioniG.:Regressing robust and discriminative 3D morphable models with a very deep neural network. arXiv preprint arXiv:1612.04904 (2016). 2

2. High-quality single-shot capture of facial geometry;Beeler T.;ACM Trans. on Graphics (Proc. SIGGRAPH),2010

3. BaoL. LinX. ChenY. ZhangH. WangS. ZheX. KangD. HuangH. JiangX. WangJ. YuD. ZhangZ.: High-fidelity 3d digital human head creation from rgb-d selfies.ACM Transactions on Graphics(2021). 1 2

4. BoothJ. RoussosA. ZafeiriouS. PonniahA. DunawayD.: A 3d morphable model learnt from 10 000 faces. In2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)(2016) pp.5543–5552. 1

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Boosting fairness for 3D face reconstruction;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30