NeRSemble: Multi-view Radiance Field Reconstruction of Human Heads

Author:

Kirschstein Tobias1ORCID,Qian Shenhan1ORCID,Giebenhain Simon1ORCID,Walter Tim1ORCID,Nießner Matthias1ORCID

Affiliation:

1. Technical University of Munich, Munich, Germany

Abstract

We focus on reconstructing high-fidelity radiance fields of human heads, capturing their animations over time, and synthesizing re-renderings from novel viewpoints at arbitrary time steps. To this end, we propose a new multi-view capture setup composed of 16 calibrated machine vision cameras that record time-synchronized images at 7.1 MP resolution and 73 frames per second. With our setup, we collect a new dataset of over 4700 high-resolution, high-framerate sequences of more than 220 human heads, from which we introduce a new human head reconstruction benchmark 1 . The recorded sequences cover a wide range of facial dynamics, including head motions, natural expressions, emotions, and spoken language. In order to reconstruct high-fidelity human heads, we propose Dynamic Neural Radiance Fields using Hash Ensembles (NeRSemble). We represent scene dynamics by combining a deformation field and an ensemble of 3D multi-resolution hash encodings. The deformation field allows for precise modeling of simple scene movements, while the ensemble of hash encodings helps to represent complex dynamics. As a result, we obtain radiance field representations of human heads that capture motion over time and facilitate re-rendering of arbitrary novel viewpoints. In a series of experiments, we explore the design choices of our method and demonstrate that our approach outperforms state-of-the-art dynamic radiance field approaches by a significant margin.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design

Reference57 articles.

1. RigNeRF: Fully Controllable Neural 3D Portraits

2. Benjamin Attal , Jia-Bin Huang , Christian Richardt , Michael Zollhoefer , Johannes Kopf , Matthew O'Toole , and Changil Kim . 2023. HyperReel: High-Fidelity 6-DoF Video with Ray-Conditioned Sampling. CVPR ( 2023 ). Benjamin Attal, Jia-Bin Huang, Christian Richardt, Michael Zollhoefer, Johannes Kopf, Matthew O'Toole, and Changil Kim. 2023. HyperReel: High-Fidelity 6-DoF Video with Ray-Conditioned Sampling. CVPR (2023).

3. Jonathan T. Barron , Ben Mildenhall , Dor Verbin , Pratul P. Srinivasan , and Peter Hedman . 2022. Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields. CVPR ( 2022 ). V. Blanz, C. Basso, T. Poggio, and T. Vetter. 2003. Reanimating Faces in Images and Video . (2003), 641--650. Jonathan T. Barron, Ben Mildenhall, Dor Verbin, Pratul P. Srinivasan, and Peter Hedman. 2022. Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields. CVPR (2022). V. Blanz, C. Basso, T. Poggio, and T. Vetter. 2003. Reanimating Faces in Images and Video. (2003), 641--650.

4. A morphable model for the synthesis of 3D faces

5. Immersive light field video with a layered mesh representation

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. VSCHH 2023: A Benchmark for the View Synthesis Challenge of Human Heads;2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW);2023-10-02

2. ILSH: The Imperial Light-Stage Head Dataset for Human Head View Synthesis;2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW);2023-10-02

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3