Unsupervised Learning of Geometry From Videos With Edge-Aware Depth-Normal Consistency-Reference-Cited by-同舟云学术

Unsupervised Learning of Geometry From Videos With Edge-Aware Depth-Normal Consistency

Published:2018-04-27 Issue:1 Volume:32 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Yang Zhenheng,Wang Peng,Xu Wei,Zhao Liang,Nevatia Ramakant

Abstract

Learning to reconstruct depths from a single image by watching unlabeled videos via deep convolutional network (DCN) is attracting significant attention in recent years, e.g. (Zhou et al. 2017). In this paper, we propose to use surface normal representation for unsupervised depth estimation framework. Our estimated depths are constrained to be compatible with predicted normals, yielding more robust geometry results. Specifically, we formulate an edge-aware depth-normal consistency term, and solve it by constructing a depth-to-normal layer and a normal-to-depth layer inside of the DCN. The depth-to-normal layer takes estimated depths as input, and computes normal directions using cross production based on neighboring pixels. Then given the estimated normals, the normal-to-depth layer outputs a regularized depth map through local planar smoothness. Both layers are computed with awareness of edges inside the image to help address the issue of depth/normal discontinuity and preserve sharp edges. Finally, to train the network, we apply the photometric error and gradient smoothness to supervise both depth and normal predictions. We conducted experiments on both outdoor (KITTI) and indoor (NYUv2) datasets, and showed that our algorithm vastly outperforms state-of-the-art, which demonstrates the benefits of our approach.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 40 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-09

2. Learning Effective Geometry Representation from Videos for Self-Supervised Monocular Depth Estimation;ISPRS International Journal of Geo-Information;2024-06-11

3. CFDepthNet: Monocular Depth Estimation Introducing Coordinate Attention and Texture Features;Neural Processing Letters;2024-04-24

4. Joint self-supervised learning of interest point, descriptor, depth, and ego-motion from monocular video;Multimedia Tools and Applications;2024-02-26

5. Structure-Aware Cross-Modal Transformer for Depth Completion;IEEE Transactions on Image Processing;2024