Multiagent Manuvering with the Use of Reinforcement Learning-Reference-Cited by-同舟云学术

Multiagent Manuvering with the Use of Reinforcement Learning

Published:2023-04-17 Issue:8 Volume:12 Page:1894
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Orłowski Mateusz¹²^ORCID,Skruch Paweł¹²^ORCID

Affiliation:

1. Aptiv Services Poland S.A., ul. Podgórki Tynieckie 2, 30-399 Cracow, Poland

2. Department of Automatic Control and Robotics, AGH University of Science and Technology, Adam Mickiewicz Avenue 30/B1, 30-059 Krakow, Poland

Abstract

This paper presents an approach for defining, solving, and implementing dynamic cooperative maneuver problems in autonomous driving applications. The formulation of these problems considers a set of cooperating cars as part of a multiagent system. A reinforcement learning technique is applied to find a suboptimal policy. The key role in the presented approach is a multiagent maneuvering environment that allows for the simulation of car-like agents within an obstacle-constrained space. Each of the agents is tasked with reaching an individual goal, defined as a specific location in space. The policy is determined during the reinforcement learning process to reach a predetermined goal position for each of the simulated cars. In the experiments, three road scenarios—zipper, bottleneck, and crossroads—were used. The trained policy has been successful in solving the cooperation problem in all scenarios and the positive effects of applying shared rewards between agents have been presented and studied. The results obtained in this work provide a window of opportunity for various automotive applications.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/8/1894/pdf

Reference46 articles.

1. Van Hasselt, H., Guez, A., and Silver, D. (2016, January 12–17). Deep Reinforcement Learning with Double Q-learning. Proceedings of the 30th AAAI Conference on Artificial Intelligence, AAAI 2016, Phoenix, AZ, USA.

2. Schulman, J., Wolski, F., Dhariwal, P., Radford, A., and Openai, O.K. (2017). Proximal Policy Optimization Algorithms. arXiv.

3. Haarnoja, T., Zhou, A., Hartikainen, K., Tucker, G., Ha, S., Tan, J., Kumar, V., Zhu, H., Gupta, A., and Abbeel, P. (2018). Soft Actor-Critic Algorithms and Applications. arXiv.

4. Silver, D., Hubert, T., Schrittwieser, J., Antonoglou, I., Lai, M., Guez, A., Lanctot, M., Sifre, L., Kumaran, D., and Graepel, T. (2017). Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm. arXiv.

5. Kurach, K., Raichuk, A., Stańczyk, P., Zaja̧c, M., Bachem, O., Espeholt, L., Riquelme, C., Vincent, D., Michalski, M., and Bousquet, O. (2020, January 7–12). Google Research Football: A Novel Reinforcement Learning Environment. Proceedings of the AAAI 2020—34th AAAI Conference on Artificial Intelligence, New York, NY, USA.