Research on Behavioral Decision at an Unsignalized Roundabout for Automatic Driving Based on Proximal Policy Optimization Algorithm-Reference-Cited by-同舟云学术

Research on Behavioral Decision at an Unsignalized Roundabout for Automatic Driving Based on Proximal Policy Optimization Algorithm

Published:2024-03-29 Issue:7 Volume:14 Page:2889
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Gan Jingpeng¹,Zhang Jiancheng²,Liu Yuansheng²

Affiliation:

1. College of Urban Rall Transit and Logistics, Beijing Union University, Beijing 100101, China

2. College of Robotics, Beijing Union University, Beijing 100101, China

Abstract

Unsignalized roundabouts have a significant impact on traffic flow and vehicle safety. To address the challenge of autonomous vehicles passing through roundabouts with low penetration, improve their efficiency, and ensure safety and stability, we propose the proximal policy optimization (PPO) algorithm to enhance decision-making behavior. We develop an optimization-based behavioral choice model for autonomous vehicles that incorporates gap acceptance theory and deep reinforcement learning using the PPO algorithm. Additionally, we employ the CoordConv network to establish an aerial view for spatial perception information gathering. Furthermore, a dynamic multi-objective reward mechanism is introduced to maximize the PPO algorithm’s reward pool function while quantifying environmental factors. Through simulation experiments, we demonstrate that our optimized PPO algorithm significantly improves training efficiency by enhancing the reward value function by 2.85%, 7.17%, and 19.58% in scenarios with 20, 100, and 200 social vehicles, respectively, compared to the PPO+CCMR algorithm. The effectiveness of simulation training also increases by 11.1%, 13.8%, and 7.4%. Moreover, there is a reduction in crossing time by 2.37%, 2.62%, and 13.96%. Our optimized PPO algorithm enhances path selection during autonomous vehicle simulation training as they tend to drive in the inner ring over time; however, the influence of social vehicles on path selection diminishes as their quantity increases. The safety of autonomous vehicles remains largely unaffected by our optimized PPO algorithm.

Funder

National Key R&D Program

National Natural Science Foundation of China

National Natural Science Foundation of China Key Project Collaboration

Academic Research Projects of Beijing Union University

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/7/2889/pdf

Reference30 articles.

1. Samizadeh, S., Nikoofard, A., and Yektamoghadam, H. (2022, January 2–3). Decision Making for Autonomous Vehicles’ Strategy in Triple-Lane Roundabout Intersections. Proceedings of the 2022 8th International Conference on Control, Instrumentation and Automation (ICCIA), Tehran, Iran.

2. Mohebifard, R., and Hajbabaie, A. (2020, January 20–23). Effects of Automated Vehicles on Traffic Operations at Roundabouts. Proceedings of the IEEE International Conference on Intelligent Transportation Systems, Rhodes, Greece.

3. Naderi, M., Papageorgiou, M., Karafyllis, I., and Papamichail, I. (2022, January 8–12). Automated vehicle driving on large lane-free roundabouts. Proceedings of the 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), Macau, China.

4. Zhang, Y., Zhang, J., and Dong, B. (2022, January 28–30). An optimal management scheme for connected vehicles merging at a roundabout. Proceedings of the 2022 6th CAA International Conference on Vehicular Control and Intelligence (CVCI), Nanjing, China.

5. Qian, D., Qi, H., Liu, Z., Zhou, Z., and Yi, J. (2023). Research on Autonomous Decision-Making in Air-Combat Based on Improved Proximal Policy Optimization. J. Syst. Simul., 1–11.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing Autonomous Driving Navigation Using Soft Actor-Critic;Future Internet;2024-07-04