Adaptive region aggregation for multi‐view stereo matching using deformable convolutional networks-Reference-Cited by-同舟云学术

Adaptive region aggregation for multi‐view stereo matching using deformable convolutional networks

Published:2023-08-21 Issue:183 Volume:38 Page:430-449
ISSN:0031-868X
Container-title:The Photogrammetric Record
language:en
Short-container-title:The Photogrammetric Record

Author:

Hu Han¹^ORCID,Su Liupeng¹^ORCID,Mao Shunfu¹,Chen Min¹^ORCID,Pan Guoqiang²,Xu Bo¹^ORCID,Zhu Qing¹

Affiliation:

1. Faculty of Geosciences and Environmental Engineering Southwest Jiaotong University Chengdu China

2. Equipment Project Management Center Chinese People's Armed Police Force Beijing China

Abstract

AbstractDeep‐learning methods have demonstrated promising performance in multi‐view stereo (MVS) applications. However, it remains challenging to apply a geometrical prior on the adaptive matching windows to achieve efficient three‐dimensional reconstruction. To address this problem, this paper proposes a learnable adaptive region aggregation method based on deformable convolutional networks (DCNs), which is integrated into the feature extraction workflow for MVSNet method that uses coarse‐to‐fine structure. Following the conventional pipeline of MVSNet, a DCN is used to densely estimate and apply transformations in our feature extractor, which is composed of a deformable feature pyramid network (DFPN). Furthermore, we introduce a dedicated offset regulariser to promote the convergence of the learnable offsets of the DCN. The effectiveness of the proposed DFPN is validated through quantitative and qualitative evaluations on the BlendedMVS and Tanks and Temples benchmark datasets within a cross‐dataset evaluation setting.

Funder

National Basic Research Program of China

National Natural Science Foundation of China

Publisher

Wiley

Subject

Earth and Planetary Sciences (miscellaneous),Computers in Earth Sciences,Computer Science Applications,Engineering (miscellaneous)

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/phor.12459

Reference71 articles.

1. MVSNet++: Learning Depth-Based Attention Pyramid Features for Multi-View Stereo

2. A space-sweep approach to true multi-image matching