Self-Learning Robot Autonomous Navigation with Deep Reinforcement Learning Techniques-Reference-Cited by-同舟云学术

Self-Learning Robot Autonomous Navigation with Deep Reinforcement Learning Techniques

Published:2023-12-30 Issue:1 Volume:14 Page:366
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Pintos Gómez de las Heras Borja¹^ORCID,Martínez-Tomás Rafael¹^ORCID,Cuadra Troncoso José Manuel¹^ORCID

Affiliation:

1. Department of Artificial Intelligence, National Distance Education University, Juan del Rosal 16, 28040 Madrid, Spain

Abstract

Complex and high-computational-cost algorithms are usually the state-of-the-art solution for autonomous driving cases in which non-holonomic robots must be controlled in scenarios with spatial restrictions and interaction with dynamic obstacles while fulfilling at all times safety, comfort, and legal requirements. These highly complex software solutions must cover the high variability of use cases that might appear in traffic conditions, especially when involving scenarios with dynamic obstacles. Reinforcement learning algorithms are seen as a powerful tool in autonomous driving scenarios since the complexity of the algorithm is automatically learned by trial and error with the help of simple reward functions. This paper proposes a methodology to properly define simple reward functions and come up automatically with a complex and successful autonomous driving policy. The proposed methodology has no motion planning module so that the computational power can be limited like in the reactive robotic paradigm. Reactions are learned based on the maximization of the cumulative reward obtained during the learning process. Since the motion is based on the cumulative reward, the proposed algorithm is not bound to any embedded model of the robot and is not being affected by uncertainties of these models or estimators, making it possible to generate trajectories with the consideration of non-holonomic constrains. This paper explains the proposed methodology and discusses the setup of experiments and the results for the validation of the methodology in scenarios with dynamic obstacles. A comparison between the reinforcement learning algorithm and state-of-the-art approaches is also carried out to highlight how the methodology proposed outperforms state-of-the-art algorithms.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/14/1/366/pdf

Reference28 articles.

1. Warren, C.W. (1990, January 13–18). Multiple robot path coordination using artificial potential fields. Proceedings of the IEEE International Conference on Robotics and Automation, Cincinnati, OH, USA.

2. Motion Planning in Dynamic Environments Using Velocity Obstacles;Fiorini;Int. J. Robot. Res.,1998

3. The dynamic window approach to collision avoidance;Fox;IEEE Robot. Autom. Mag.,1997

4. Reactive navigation in real environments using partial center of area method;Troncoso;Robot. Auton. Syst.,2010

5. Tobaruela, J.A., and Rodríguez, A.O. (2017). Rodríguez Reactive navigation in extremely dense and highly intricate environments. PLoS ONE, 12.