Influence of Neural Network Receptive Field on Monocular Depth and Ego-Motion Estimation-Reference-Cited by-同舟云学术

Influence of Neural Network Receptive Field on Monocular Depth and Ego-Motion Estimation

Published:2023-11-28 Issue:S2 Volume:32 Page:S206-S213
ISSN:1060-992X
Container-title:Optical Memory and Neural Networks
language:en
Short-container-title:Opt. Mem. Neural Networks

Author:

Linok S. A.,Yudin D. A.

Abstract

Abstract We present an analysis of a self-supervised learning approach for monocular depth and ego-motion estimation. This is an important problem for computer vision systems of robots, autonomous vehicles and other intelligent agents, equipped only with monocular camera sensor. We have explored a number of neural network architectures that perform single-frame depth and multi-frame camera pose predictions to minimize photometric error between consecutive frames on a sequence of camera images. Unlike other existing works, our proposed approach called ERF-SfMLearner examines the influence of the deep neural network receptive field on the performance of depth and ego-motion estimation. To do this, we study the modification of network layers with two convolution operators with extended receptive field: dilated and deformable convolutions. We demonstrate on the KITTI dataset that increasing the receptive field leads to better metrics and lower errors both in terms of depth and ego-motion estimation. Code is publicly available at github.com/linukc/ERF-SfMLearner.

Publisher

Allerton Press

Subject

Electrical and Electronic Engineering,General Computer Science,Electronic, Optical and Magnetic Materials

Link

https://link.springer.com/content/pdf/10.3103/S1060992X23060103.pdf

Reference29 articles.

1. Qusay Sellat and Kanagachidambaresan Ramasubramanian, Advanced techniques for perception and localization in autonomous driving systems: A survey, Opt. Mem. Neural Networks, 2022, vol. 31, no. 2, pp. 123–144.

2. Shepel, I., Adeshkin, V., Belkin, I., and Yudin, D.A., Occupancy grid generation with dynamic obstacle segmentation in stereo images, IEEE Trans. Intell. Transp. Syst., 2021, vol. 23, no. 9, pp. 14779–14789.

3. Bokovoy, A., Muraviev, K., and Yakovlev, K., Map-merging algorithms for visual slam: Feasibility study and empirical evaluation, in Russian Conference on Artificial Intelligence, Springer, 2020, pp. 46–60.

4. Angermann, Ch., Schwab, M., and Haltmeier, M., Laubichler, Ch., and J’onsson, S., Unsupervised single-shot depth estimation using perceptual reconstruction, Mach. Vision Appl., 2023, vol. 34, no. 5, p. 82.

5. Goshin, Y., Coplanarity-based approach for camera motion estimation invariant to the scene depth, Opt. Mem. Neural Networks, 2022, vol. 31 (Suppl. 1), pp. 22–30.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine Learning Algorithms for Autonomous Vehicles;Handbook of Formal Optimization;2024

2. Machine Learning Algorithms for Autonomous Vehicles;Handbook of Formal Optimization;2024