Structure-Aware Residual Pyramid Network for Monocular Depth Estimation-Reference-Cited by-同舟云学术

Structure-Aware Residual Pyramid Network for Monocular Depth Estimation

Published:2019-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Chen Xiaotian¹,Chen Xuejin¹,Zha Zheng-Jun¹

Affiliation:

1. National Engineering Laboratory for Brain-inspired Intelligence Technology and Application, University of Science and Technology of China

Abstract

Monocular depth estimation is an essential task for scene understanding. The underlying structure of objects and stuff in a complex scene is critical to recovering accurate and visually-pleasing depth maps. Global structure conveys scene layouts, while local structure reflects shape details. Recently developed approaches based on convolutional neural networks (CNNs) significantly improve the performance of depth estimation. However, few of them take into account multi-scale structures in complex scenes. In this paper, we propose a Structure-Aware Residual Pyramid Network (SARPN) to exploit multi-scale structures for accurate depth prediction. We propose a Residual Pyramid Decoder (RPD) which expresses global scene structure in upper levels to represent layouts, and local structure in lower levels to present shape details. At each level, we propose Residual Refinement Modules (RRM) that predict residual maps to progressively add finer structures on the coarser structure predicted at the upper level. In order to fully exploit multi-scale image features, an Adaptive Dense Feature Fusion (ADFF) module, which adaptively fuses effective features from all scales for inferring structures of each scale, is introduced. Experiment results on the challenging NYU-Depth v2 dataset demonstrate that our proposed approach achieves state-of-the-art performance in both qualitative and quantitative evaluation. The code is available at https://github.com/Xt-Chen/SARPN.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 46 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Versatile depth estimator based on common relative depth estimation and camera-specific relative-to-metric depth conversion;Journal of Visual Communication and Image Representation;2024-08

2. Scale-Invariant Monocular Depth Estimation via SSI Depth;Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers '24;2024-07-13

3. EDFIDepth: enriched multi-path vision transformer feature interaction networks for monocular depth estimation;The Journal of Supercomputing;2024-06-05

4. Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation;ACM Transactions on Multimedia Computing, Communications, and Applications;2024-05-16

5. Unsupervised Domain Adaptation Depth Estimation Based on Self-attention Mechanism and Edge Consistency Constraints;Neural Processing Letters;2024-05-09