MVS-T: A Coarse-to-Fine Multi-View Stereo Network with Transformer for Low-Resolution Images 3D Reconstruction-Reference-Cited by-同舟云学术

MVS-T: A Coarse-to-Fine Multi-View Stereo Network with Transformer for Low-Resolution Images 3D Reconstruction

Published:2022-10-09 Issue:19 Volume:22 Page:7659
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Jia Ruiming^ORCID,Chen Xin^ORCID,Cui Jiali^ORCID,Hu Zhenghui^ORCID

Abstract

A coarse-to-fine multi-view stereo network with Transformer (MVS-T) is proposed to solve the problems of sparse point clouds and low accuracy in reconstructing 3D scenes from low-resolution multi-view images. The network uses a coarse-to-fine strategy to estimate the depth of the image progressively and reconstruct the 3D point cloud. First, pyramids of image features are constructed to transfer the semantic and spatial information among features at different scales. Then, the Transformer module is employed to aggregate the image’s global context information and capture the internal correlation of the feature map. Finally, the image depth is inferred by constructing a cost volume and iterating through the various stages. For 3D reconstruction of low-resolution images, experiment results show that the 3D point cloud obtained by the network is more accurate and complete, which outperforms other advanced algorithms in terms of objective metrics and subjective visualization.

Funder

National Natural Science Fund

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/19/7659/pdf

Reference41 articles.

1. Research and Implementation of Autonomous Navigation for Mobile Robots Based on SLAM Algorithm under ROS

2. A robot hand-eye calibration method of line laser sensor based on 3D reconstruction

3. Live Semantic 3D Perception for Immersive Augmented Reality

4. ARACAM: A RGB-D Multi-View Photogrammetry System for Lower Limb 3D Reconstruction Applications

5. 3D MODELING OF GIRIFALCO FORTRESS

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Super-Resolution and 3D Reconstruction Method Based on OmDF Endoscopic Images;Sensors;2024-07-27

2. A Coarse-to-Fine Transformer-Based Network for 3D Reconstruction from Non-Overlapping Multi-View Images;Remote Sensing;2024-03-03

3. Generating 3D Models for Prototyping of Virtual Environments using NeRF;2024 Second International Conference on Emerging Trends in Information Technology and Engineering (ICETITE);2024-02-22

4. Modeling Long-range Dependencies and Epipolar Geometry for Multi-view Stereo;ACM Transactions on Multimedia Computing, Communications, and Applications;2023-07-12

5. A-SATMVSNet: An attention-aware multi-view stereo matching network based on satellite imagery;Frontiers in Earth Science;2023-04-13