Self-Supervised Steering and Path Labeling for Autonomous Driving-Reference-Cited by-同舟云学术

Self-Supervised Steering and Path Labeling for Autonomous Driving

Published:2023-10-15 Issue:20 Volume:23 Page:8473
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Mihalea Andrei¹,Samoilescu Robert-Florian¹,Florea Adina Magda¹^ORCID

Affiliation:

1. Department of Computer Science, Faculty of Automatic Control and Computers, University Politehnica of Bucharest, 060042 Bucharest, Romania

Abstract

Autonomous driving is a complex task that requires high-level hierarchical reasoning. Various solutions based on hand-crafted rules, multi-modal systems, or end-to-end learning have been proposed over time but are not quite ready to deliver the accuracy and safety necessary for real-world urban autonomous driving. Those methods require expensive hardware for data collection or environmental perception and are sensitive to distribution shifts, making large-scale adoption impractical. We present an approach that solely uses monocular camera inputs to generate valuable data without any supervision. Our main contributions involve a mechanism that can provide steering data annotations starting from unlabeled data alongside a different pipeline that generates path labels in a completely self-supervised manner. Thus, our method represents a natural step towards leveraging the large amounts of available online data ensuring the complexity and the diversity required to learn a robust autonomous driving policy.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/20/8473/pdf

Reference50 articles.

1. Pomerleau, D.A. (1989, January 27–30). Alvinn: An autonomous land vehicle in a neural network. Proceedings of the Advances in Neural Information Processing Systems, Denver, CO, USA.

2. Hawke, J., Shen, R., Gurau, C., Sharma, S., Reda, D., Nikolov, N., Mazur, P., Micklethwaite, S., Griffiths, N., and Shah, A. (2019). Urban Driving with Conditional Imitation Learning. arXiv.

3. Segnet: A deep convolutional encoder-decoder architecture for image segmentation;Badrinarayanan;IEEE Trans. Pattern Anal. Mach. Intell.,2017

4. Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27–30). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.

5. Zhou, T., Brown, M., Snavely, N., and Lowe, D.G. (2017, January 21–26). Unsupervised learning of depth and ego-motion from video. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.