Linear Time Non-Local Cost Aggregation on Complementary Spatial Tree Structures
-
Published:2023-10-30
Issue:21
Volume:13
Page:11892
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Bu Penghui12ORCID, Wang Hang1, Dou Yihua1, Zhao Hong3
Affiliation:
1. School of Mechanical Engineering, Xi’an Shiyou University, Xi’an 710065, China 2. Xi’an Chishine Optoelectronics Technology Co., Ltd., Xi’an 710076, China 3. State Key Laboratory for Manufacturing Systems Engineering, Xi’an Jiaotong University, Xi’an 710049, China
Abstract
Studies on many dense correspondence tasks in the field of computer vision attempt to find spatially smooth results. A typical way to solve these problems is by smoothing the matching costs using edge-preserving filters. However, local filters generate locally optimal results, in that they only take the costs over a small support window into account, and non-local filters based on a minimum spanning tree (MST) tend to overuse the piece-wise constant assumption. In this paper, we propose a linear time non-local cost aggregation method based on two complementary spatial tree structures. The geodesic distances in both the spatial and intensity spaces along the tree structures are used to evaluate the similarity of pixels, and the final aggregated cost is the sum of the outputs from these two trees. The filtering output of a pixel on each tree can be obtained by recursively aggregating the costs along eight sub-trees with linear time complexity. The only difference between the filtering procedures on these two spatial tree structures is the order of the filtering. Experimental results in optical flow estimation and stereo matching on the Middlebury and KITTI datasets demonstrate the effectiveness and efficiency of our method. It turns out that our method outperforms typical non-local filters based on the MST in cost aggregation. Moreover, a comparison of handcrafted features and deep features learned by convolutional neural networks (CNNs) in calculating the matching cost is also provided. The code will be available soon.
Funder
Research and Development Program of Shaanxi Province National Natural Science Foundation of China
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference35 articles.
1. A hierarchical stereo matching algorithm based on adaptive support region aggregation method;Zeglazi;Pattern Recognit. Lett.,2018 2. Yao, Y., Luo, Z., Li, S., Fang, T., and Quan, L. (2018, January 8–14). Mvsnet: Depth inference for unstructured multi-view stereo. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany. 3. Data-driven visual similarity for cross-domain image matching;Shrivastava;ACM Trans. Graph.,2011 4. Truong, P., Apostolopoulos, S., Mosinska, A., Stucky, S., Ciller, C., and Zanet, S.D. (November, January 27). Glampoints: Greedily learned accurate match points. Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea. 5. Gaussian dynamic convolution for efficient single-image segmentation;Sun;IEEE Trans. Circuits Syst. Video Technol.,2021
|
|