Unsupervised deep learning based ego motion estimation with a downward facing camera-Reference-Cited by-同舟云学术

Unsupervised deep learning based ego motion estimation with a downward facing camera

Published:2021-11-27 Issue: Volume: Page:
ISSN:0178-2789
Container-title:The Visual Computer
language:en
Short-container-title:Vis Comput

Author:

Gilles Maximilian^ORCID,Ibrahimpasic Sascha

Abstract

AbstractKnowing the robot's pose is a crucial prerequisite for mobile robot tasks such as collision avoidance or autonomous navigation. Using powerful predictive models to estimate transformations for visual odometry via downward facing cameras is an understudied area of research. This work proposes a novel approach based on deep learning for estimating ego motion with a downward looking camera. The network can be trained completely unsupervised and is not restricted to a specific motion model. We propose two neural network architectures based on the Early Fusion and Slow Fusion design principle: “EarlyBird” and “SlowBird”. Both networks share a Spatial Transformer layer for image warping and are trained with a modified structural similarity index (SSIM) loss function. Experiments carried out in simulation and for a real world differential drive robot show similar and partially better results of our proposed deep learning based approaches compared to a state-of-the-art method based on fast Fourier transformation.

Funder

Karlsruher Institut für Technologie (KIT)

Publisher

Springer Science and Business Media LLC

Subject

Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Software

Link

https://link.springer.com/content/pdf/10.1007/s00371-021-02345-6.pdf

Reference25 articles.

1. He, M., Zhu, C., Huang, Q., Ren, B., Liu, J.: A review of monocular visual odometry. Vis. Comput. (2020). https://doi.org/10.1007/s00371-019-01714-6

2. Scaramuzza, D., Fraundorfer, F.: Visual odometry [Tutorial]. IEEE Robot. Autom. Mag. (2011). https://doi.org/10.1109/MRA.2011.943233

3. Nguyen, T., Chen, S.W., Shivakumar, S.S., Taylor, C.J., Kumar, V.: Unsupervised deep homography: a fast and robust homography estimation model. IEEE Robot. Autom. Lett. (2018). https://doi.org/10.1109/LRA.2018.2809549