Multi-Agent Decision-Making Modes in Uncertain Interactive Traffic Scenarios via Graph Convolution-Based Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Multi-Agent Decision-Making Modes in Uncertain Interactive Traffic Scenarios via Graph Convolution-Based Deep Reinforcement Learning

Published:2022-06-17 Issue:12 Volume:22 Page:4586
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Gao Xin^ORCID,Li Xueyuan,Liu Qi,Li Zirui^ORCID,Yang Fan^ORCID,Luan Tian

Abstract

As one of the main elements of reinforcement learning, the design of the reward function is often not given enough attention when reinforcement learning is used in concrete applications, which leads to unsatisfactory performances. In this study, a reward function matrix is proposed for training various decision-making modes with emphasis on decision-making styles and further emphasis on incentives and punishments. Additionally, we model a traffic scene via graph model to better represent the interaction between vehicles, and adopt the graph convolutional network (GCN) to extract the features of the graph structure to help the connected autonomous vehicles perform decision-making directly. Furthermore, we combine GCN with deep Q-learning and multi-step double deep Q-learning to train four decision-making modes, which are named the graph convolutional deep Q-network (GQN) and the multi-step double graph convolutional deep Q-network (MDGQN). In the simulation, the superiority of the reward function matrix is proved by comparing it with the baseline, and evaluation metrics are proposed to verify the performance differences among decision-making modes. Results show that the trained decision-making modes can satisfy various driving requirements, including task completion rate, safety requirements, comfort level, and completion efficiency, by adjusting the weight values in the reward function matrix. Finally, the decision-making modes trained by MDGQN had better performance in an uncertain highway exit scene than those trained by GQN.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/12/4586/pdf

Reference30 articles.

1. The Future of Autonomous Vehicles in the Opinion of Automotive Market Users

2. Decision-Making Technology for Autonomous Vehicles Learning-Based Methods, Applications and Future Outlook;Liu;Proceedings of the IEEE International Intelligent Transportation Systems Conference,2021

3. Joint Optimization of Sensing, Decision-Making and Motion-Controlling for Autonomous Vehicles: A Deep Reinforcement Learning Approach

4. Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age

5. A Survey on Sensor Technologies for Unmanned Ground Vehicles

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. AF-DQN: A Large-Scale Decision-Making Method at Unsignalized Intersections with Safe Action Filter and Efficient Exploratory Training Strategy;2024 IEEE Intelligent Vehicles Symposium (IV);2024-06-02

2. Optimized TOPSIS technique for trajectory selection of self-driving vehicles on highways;Journal of Intelligent & Fuzzy Systems;2024-03-21

3. Rate GQN: A Deviations-Reduced Decision-Making Strategy for Connected and Automated Vehicles in Mixed Autonomy;IEEE Transactions on Intelligent Transportation Systems;2024-01

4. A Control Method for Initiating and Maintaining Formation of Wheeled Skid-Steering Vehicles based on Distributed Model Predictive Control;2023 6th International Conference on Robotics, Control and Automation Engineering (RCAE);2023-11-03

5. A homologous and heterogeneous multi-view inter-patient adaptive network for arrhythmia detection;Computer Methods and Programs in Biomedicine;2023-11