Mean Field Multi-Agent Reinforcement Learning Method for Area Traffic Signal Control-Reference-Cited by-同舟云学术

Mean Field Multi-Agent Reinforcement Learning Method for Area Traffic Signal Control

Published:2023-11-17 Issue:22 Volume:12 Page:4686
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Zhang Zundong¹^ORCID,Zhang Wei¹,Liu Yuke¹,Xiong Gang²^ORCID

Affiliation:

1. School of Electrical and Control Engineering, North China University of Technology, Beijing 100144, China

2. State Key Laboratory for Multimodal Artificial Intelligence Systems, Chinese Academy of Sciences, Beijing 100190, China

Abstract

Reinforcement learning is an effective method for adaptive traffic signal control in urban transportation networks. As the number of training rounds increases, the optimal control strategy is learned, and the learning capabilities of deep neural networks are further enhanced, thereby avoiding the limitations of traditional signal control methods. However, when faced with the sequential decision tasks of regional signal control, it encounters issues such as the curse of dimensionality and environmental non-stationarity. To address the limitations of traditional reinforcement learning algorithms applied to multiple intersections, the mean field theory is applied. This models the traffic signal control problem at multiple intersections within a region as interactions between individual intersections and the average effects of neighboring intersections. By decomposing the Q-function through bilateral estimation between the agent and its neighbors, this method reduces the complexity of interactions between agents while preserving global interactions between the agents. A traffic signal control model based on Mean Field Multi-Agent Reinforcement Learning (MFMARL) was constructed, containing two algorithms: Mean Field Q-Network Area Traffic Signal Control (MFQ-ATSC) and Mean Field Actor-Critic Network Area Traffic Signal Control (MFAC-ATSC). The model was validated using the SUMO simulation platform. The experimental results indicate that across different metrics, such as average speed, the mean field reinforcement learning method outperforms classical signal control methods and several existing approaches.

Funder

National Natural Science Foundation Project

China National Railway Group Co., Ltd. Science and Technology Research and Development Program Project

Open Topic of National Railway Intelligent Transportation System Engineering Technology Research Center

Guangdong Provincial Key Area Research and Development Program Project

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/22/4686/pdf

Reference22 articles.

1. Recent Advances in Reinforcement Learning for Traffic Signal Control;Hua;ACM SIGKDD Explor. Newsl.,2020

2. Mikami, S., and Kakazu, Y. (1994, January 27–29). Genetic reinforcement learning for cooperative traffic signal control. Proceedings of the First IEEE Conference on Evolutionary Computation, IEEE World Congress on Computational Intelligence, Orlando, FL, USA.

3. Human-level control through deep reinforcement learning;Mnih;Nature,2015

4. Priority of Dedicated Bus Arterial Control Based on Deep Reinforcement Learning;Shang;J. Transp. Syst. Eng. Inf. Technol.,2021

5. Traffic signal timing via deep reinforcement learning;Li;IEEE/CAA J. Autom. Sin.,2016

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CVDMARL: A Communication-Enhanced Value Decomposition Multi-Agent Reinforcement Learning Traffic Signal Control Method;Sustainability;2024-03-05