Affiliation:
1. Department of Control for Transportation and Vehicle Systems, Budapest University of Technology and Economics, H-1111 Budapest, Hungary
Abstract
This paper presents a cooperative highway platooning strategy that integrates Multi-Agent Reinforcement Learning (MARL) with Proximal Policy Optimization (PPO) to effectively manage the complex task of merging. In modern transportation systems, platooning—where multiple vehicles travel closely together under coordinated control—promises significant improvements in traffic flow and fuel efficiency. However, the challenge of merging, which involves dynamically adjusting the formation to incorporate new vehicles, remains challenging. Our approach leverages the strengths of MARL to enable individual vehicles within a platoon to learn optimal behaviors through interactions. PPO ensures stable and efficient learning by optimizing policies balancing exploration and exploitation. Simulation results show that our method achieves merging with safety and operational efficiency.
Funder
European Union
Ministry of Culture and Innovation of Hungary
National Research, Development and Innovation Fund
Hungarian Academy of Sciences
Reference39 articles.
1. Demba, A., and Möller, D.P.F. (2018, January 3–5). Vehicle-to-Vehicle Communication Technology. Proceedings of the 2018 IEEE International Conference on Electro/Information Technology (EIT), Rochester, MI, USA.
2. Van Phu, C.N., Farhi, N., Haj-Salem, H., and Lebacque, J.P. (2017, January 26–28). A vehicle-to-infrastructure communication based algorithm for urban traffic control. Proceedings of the 2017 5th IEEE International Conference on Models and Technologies for Intelligent Transportation Systems (MT-ITS), Naples, Italy.
3. Securing vehicle-to-everything (V2X) communication platforms;Hasan;IEEE Trans. Intell. Veh.,2020
4. Social Perception of Autonomous Vehicles;Krizsik;Period. Polytech. Transp. Eng.,2023
5. A conceptualization of vehicle platoons and platoon operations;Maiti;Transp. Res. Part Emerg. Technol.,2017