Depth Prediction without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos-Reference-Cited by-同舟云学术

Depth Prediction without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos

Published:2019-07-17 Issue: Volume:33 Page:8001-8008
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Casser Vincent,Pirk Soeren,Mahjourian Reza,Angelova Anelia

Abstract

Learning to predict scene depth from RGB inputs is a challenging task both for indoor and outdoor robot navigation. In this work we address unsupervised learning of scene depth and robot ego-motion where supervision is provided by monocular videos, as cameras are the cheapest, least restrictive and most ubiquitous sensor for robotics. Previous work in unsupervised image-to-depth learning has established strong baselines in the domain. We propose a novel approach which produces higher quality results, is able to model moving objects and is shown to transfer across data domains, e.g. from outdoors to indoor scenes. The main idea is to introduce geometric structure in the learning process, by modeling the scene and the individual objects; camera ego-motion and object motions are learned from monocular videos as input. Furthermore an online refinement method is introduced to adapt learning on the fly to unknown domains. The proposed approach outperforms all state-of-the-art approaches, including those that handle motion e.g. through learned flow. Our results are comparable in quality to the ones which used stereo as supervision and significantly improve depth prediction on scenes and datasets which contain a lot of object motion. The approach is of practical relevance, as it allows transfer across environments, by transferring models trained on data collected for robot navigation in urban scenes to indoor navigation settings. The code associated with this paper can be found at https://sites.google.com/view/struct2depth.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 203 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. TSUDepth: Exploring temporal symmetry-based uncertainty for unsupervised monocular depth estimation;Neurocomputing;2024-10

2. Deep learning-based 3D reconstruction from multiple images: A survey;Neurocomputing;2024-09

3. Self-supervised monocular depth estimation with self-distillation and dense skip connection;Computer Vision and Image Understanding;2024-09

4. Reconstructing the local structures of Chinese ancient architecture using unsupervised depth estimation;Heritage Science;2024-08-30

5. Repmono: a lightweight self-supervised monocular depth estimation architecture for high-speed inference;Complex & Intelligent Systems;2024-08-10