Neural Radiance Field-Inspired Depth Map Refinement for Accurate Multi-View Stereo-Reference-Cited by-同舟云学术

Neural Radiance Field-Inspired Depth Map Refinement for Accurate Multi-View Stereo

Published:2024-03-08 Issue:3 Volume:10 Page:68
ISSN:2313-433X
Container-title:Journal of Imaging
language:en
Short-container-title:J. Imaging

Author:

Ito Shintaro¹^ORCID,Miura Kanta¹,Ito Koichi¹^ORCID,Aoki Takafumi¹^ORCID

Affiliation:

1. Graduate School of Information Sciences, Tohoku University, 6-6-05, Aramaki Aza Aoba, Sendai 9808579, Japan

Abstract

In this paper, we propose a method to refine the depth maps obtained by Multi-View Stereo (MVS) through iterative optimization of the Neural Radiance Field (NeRF). MVS accurately estimates the depths on object surfaces, and NeRF accurately estimates the depths at object boundaries. The key ideas of the proposed method are to combine MVS and NeRF to utilize the advantages of both in depth map estimation and to use NeRF for depth map refinement. We also introduce a Huber loss into the NeRF optimization to improve the accuracy of the depth map refinement, where the Huber loss reduces the estimation error in the radiance fields by placing constraints on errors larger than a threshold. Through a set of experiments using the Redwood-3dscan dataset and the DTU dataset, which are public datasets consisting of multi-view images, we demonstrate the effectiveness of the proposed method compared to conventional methods: COLMAP, NeRF, and DS-NeRF.

Funder

JSPS KAKENHI

Publisher

MDPI AG

Link

https://www.mdpi.com/2313-433X/10/3/68/pdf

Reference34 articles.

1. Szeliski, R. (2010). Computer Vision: Algorithms and Applications, Springer.

2. Seitz, S.M., Curless, B., Diebe, J., Scharstein, D., and Szeliski, R. (2006, January 17–22). A comparison and evaluation of Multi-View Stereo reconstruction algorithms. Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York, NY, USA.

3. Schönberger, J.L., Zheng, E., Pollefeys, M., and Frahm, J. (2016, January 11–14). Pixelwise view selection for unstructured Multi-View Stereo. Proceedings of the European Conference Computer Vision, Amsterdam, The Netherlands.

4. Collins, R.T. (1996, January 18–20). A space-sweep approach to true multi-image matching. Proceedings of the CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA.

5. Yao, Y., Luo, Z., Li, S., Shen, T., Fang, T., and Quan, L. (2019, January 6–20). Recurrent MVSNet for high-resolution multi-view stereo depth inference. Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.