Depth Edge and Structure Optimization-Based End-to-End Self-Supervised Stereo Matching-Reference-Cited by-同舟云学术

Depth Edge and Structure Optimization-Based End-to-End Self-Supervised Stereo Matching

Published:2023-10 Issue:13 Volume:37 Page:
ISSN:0218-0014
Container-title:International Journal of Pattern Recognition and Artificial Intelligence
language:en
Short-container-title:Int. J. Patt. Recogn. Artif. Intell.

Author:

Yang Wenbang¹²^ORCID,Cheng Xianjing³,Zhao Yong⁴,Qian Ren⁵,Li Jianhua⁵

Affiliation:

1. State Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University Guiyang 550025, P. R. China

2. School of Mathematical Sciences, Xingyi Normal University for Nationalities, Xingyi 562400, P. R. China

3. Harbin Institute of Technology, Shenzhen 518000, P. R. China

4. School of Electronic and Computer Engineering, Shenzhen Graduate School of Peking University, Shenzhen 518000, P. R. China

5. College of Computer Science and Technology, Guizhou University, Guiyang 550025, P. R. China

Abstract

This paper addresses the challenge of poor cross-domain generalization performance in deep learning methods for stereo matching, particularly when dealing with unseen scenes or disparity maps lacking ground-truth information. To overcome this issue, we propose a self-supervised network called SANet. The network integrates a lightweight algorithm, AANet_Edge, which is based on depth edges optimization. In SANet, we combine AANet_Edge with a novel algorithm called SDCO, which efficiently extracts depth edges using a segment structure and employs a two-layer optimization framework to generate accurate dense disparity maps. These maps are then utilized for pixel-by-pixel supervised training. Furthermore, SANet incorporates multi-scale reconstructed left maps and multi-scale edges-aware modules to learn the structural features of the input image. To evaluate the effectiveness of SANet, comprehensive experiments are conducted on two standard benchmark datasets, namely KITTI 2012 and KITTI 2015. The experimental results demonstrate that SANet produces accurate disparity maps for unseen scenes or limited images and achieves high cross-domain generalization performance.

Funder

Guizhou Provincial Department of Education Youth Science and Technology Talents Growth

Publisher

World Scientific Pub Co Pte Ltd

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218001423500222

Reference31 articles.

1. SLIC Superpixels Compared to State-of-the-Art Superpixel Methods

2. PMBP: PatchMatch Belief Propagation for Correspondence Field Estimation

3. PatchMatch Stereo - Stereo Matching with Slanted Support Windows

4. Fast approximate energy minimization via graph cuts

5. Efficient Graph-Based Image Segmentation