Effects analysis of reward functions on reinforcement learning for traffic signal control-Reference-Cited by-同舟云学术

Effects analysis of reward functions on reinforcement learning for traffic signal control

Published:2022-11-21 Issue:11 Volume:17 Page:e0277813
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Lee Hyosun^ORCID,Han Yohee^ORCID,Kim Youngchan,Kim Yong Hoon

Abstract

The increasing traffic demand in urban areas frequently causes traffic congestion, which can be managed only through intelligent traffic signal controls. Although many recent studies have focused on reinforcement learning for traffic signal control (RL-TSC), most have focused on improving performance from an intersection perspective, targeting virtual simulation. The performance indexes from intersection perspectives are averaged by the weighted traffic flow; therefore, if the balance of each movement is not considered, the green time may be overly concentrated on the movements of heavy flow rates. Furthermore, as the ultimate purpose of traffic signal control research is to apply these controls to the real-world intersections, it is necessary to consider the real-world constraints. Hence, this study aims to design RL-TSC considering real-world applicability and confirm the appropriate design of the reward function. The limitations of the detector in the real world and the dual-ring traffic signal system are taken into account in the model design to facilitate real-world application. To design the reward for balancing traffic movements, we define the average delay weighted by traffic volume per lane and entropy of delay in the reward function. Model training is performed at the prototype intersection for ensuring scalability to multiple intersections. The model after prototype pre-training is evaluated by applying it to a network with two intersections without additional training. As a result, the reward function considering the equality of traffic movements shows the best performance. The proposed model reduces the average delay by more than 7.4% and 15.0% compared to the existing real-time adaptive signal control at two intersections, respectively.

Funder

Korean National Police Agency

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference47 articles.

1. Reinforcement Learning based Control of Traffic Lights in Non-stationary Environments: A Case Study in a Microscopic Simulator;D.D. Oliveira;In: EUMAS,2006

2. Adaptive Traffic Control Systems: Domestic and Foreign State of Practice;A. Stevanović;NCHRP Synthesis of Highway Practice,2010

3. An Experimental Review of Reinforcement Learning Algorithms for Adaptive Traffic Signal Control;P Mannion;Autonomic Road Transport Support Systems,2016

4. Multi-Agent Reinforcement Learning for Integrated Network of Adaptive Traffic Signal Controllers (MARLIN-ATSC).;S El-Tantawy;2012 15th International IEEE Conference on Intelligent Transportation Systems,2012