U-ETMVSNet: Uncertainty-Epipolar Transformer Multi-View Stereo Network for Object Stereo Reconstruction-Reference-Cited by-同舟云学术

U-ETMVSNet: Uncertainty-Epipolar Transformer Multi-View Stereo Network for Object Stereo Reconstruction

Published:2024-03-07 Issue:6 Volume:14 Page:2223
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Zhao Ning¹,Wang Heng¹,Cui Quanlong¹,Wu Lan¹

Affiliation:

1. School of Electrical Engineering, Henan University of Technology, Zhengzhou 450001, China

Abstract

The Multi-View Stereo model (MVS), which utilizes 2D images from multiple perspectives for 3D reconstruction, is a crucial technique in the field of 3D vision. To address the poor correlation between 2D features and 3D space in existing MVS models, as well as the high sampling rate required for static sampling, we proposeU-ETMVSNet in this paper. Initially, we employ an integrated epipolar transformer module (ET) to establish 3D spatial correlations along epipolar lines, thereby enhancing the reliability of aggregated cost volumes. Subsequently, we devise a sampling module based on probability volume uncertainty to dynamically adjust the depth sampling range for the next stage. Finally, we utilize a multi-stage joint learning method based on multi-depth value classification to evaluate and optimize the model. Experimental results demonstrate that on the DTU dataset, our method achieves a relative performance improvement of 27.01% and 11.27% in terms of completeness error and overall error, respectively, compared to CasMVSNet, even at lower depth sampling rates. Moreover, our method exhibits excellent performance with a score of 58.60 on the Tanks &Temples dataset, highlighting its robustness and generalization capability.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/6/2223/pdf

Reference48 articles.

1. Campbell, N.D., Vogiatzis, G., Hernández, C., and Cipolla, R. (2008, January 12–18). Using multiple hypotheses to improve depth-maps for multi-view stereo. Proceedings of the Computer Vision–ECCV 2008: 10th European Conference on Computer Vision, Marseille, France. Proceedings, Part I 10.

2. Gipuma: Massively parallel multi-view stereo reconstruction;Galliani;Publ. Dtsch. Ges. Photogramm. Fernerkund. Geoinf. E. V,2016

3. Efficient large-scale multi-view stereo for ultra high-resolution image sets;Tola;Mach. Vis. Appl.,2012

4. Galliani, S., Lasinger, K., and Schindler, K. (2015, January 7–13). Massively parallel multiview stereopsis by surface normal diffusion. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.

5. Yao, Y., Luo, Z., Li, S., Fang, T., and Quan, L. (2018, January 8–14). Mvsnet: Depth inference for unstructured multi-view stereo. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.