Adaptive Deep Q-Network Algorithm with Exponential Reward Mechanism for Traffic Control in Urban Intersection Networks-Reference-Cited by-同舟云学术

Adaptive Deep Q-Network Algorithm with Exponential Reward Mechanism for Traffic Control in Urban Intersection Networks

Published:2022-11-06 Issue:21 Volume:14 Page:14590
ISSN:2071-1050
Container-title:Sustainability
language:en
Short-container-title:Sustainability

Author:

Fuad Muhammad Riza Tanwirul^ORCID,Fernandez Eric Okto,Mukhlish Faqihza^ORCID,Putri Adiyana^ORCID,Sutarto Herman Yoseph^ORCID,Hidayat Yosi Agustina^ORCID,Joelianto Endra^ORCID

Abstract

The demand for transportation has increased significantly in recent decades in line with the increasing demand for passenger and freight mobility, especially in urban areas. One of the most negative impacts is the increasing level of traffic congestion. A possible short-term solution to solve this problem is to utilize a traffic control system. However, most traffic control systems still use classical control algorithms with the green phase sequence determined, based on a specific strategy. Studies have proven that this approach does not provide the expected congestion solution. In this paper, an adaptive traffic controller was developed that uses a reinforcement learning algorithm called deep Q-network (DQN). Since the DQN performance is determined by reward selection, an exponential reward function, based on the macroscopic fundamental diagram (MFD) of the distribution of vehicle density at intersections was considered. The action taken by the DQN is determining traffic phases, based on various rewards, ranging from pressure to adaptive loading of pressure and queue length. The reinforcement learning algorithm was then applied to the SUMO traffic simulation software to assess the effectiveness of the proposed strategy. The DQN-based control algorithm with the adaptive reward mechanism achieved the best performance with a vehicle throughput of 56,384 vehicles, followed by the classical and conventional control methods, such as Webster (50,366 vehicles), max-pressure (50,541 vehicles) and uniform (46,241 vehicles) traffic control. The significant increase in vehicle throughput achieved by the adaptive DQN-based control algorithm with an exponential reward mechanism means that the proposed traffic control could increase the area productivity, implying that the intersections could accommodate more vehicles so that the possibility of congestion was reduced. The algorithm performed remarkably in preventing congestion in a traffic network model of Central Jakarta as one of the world’s most congested cities. This result indicates that traffic control design using MFD as a performance measure can be a successful future direction in the development of reinforcement learning for traffic control systems.

Funder

Ministry of Education, Culture, Research, and Technology of the Republic of Indonesia

Publisher

MDPI AG

Subject

Management, Monitoring, Policy and Law,Renewable Energy, Sustainability and the Environment,Geography, Planning and Development,Building and Construction

Link

https://www.mdpi.com/2071-1050/14/21/14590/pdf

Reference47 articles.

1. A distributed control method for urban networks using multi-agent reinforcement learning based on regional mixed strategy Nash-equilibrium;IEEE Access,2020

2. Reinforcement learning in urban network traffic signal control: A systematic literature review;Expert Syst. Appl.,2022

3. Varaiya, P. (2013). Advances in Dynamic Network Modeling in Complex Transportation Systems, Springer.

4. Maximum Pressure Controller for Stabilizing Queues in Signalized Arterial Networks;Transp. Res. Rec.,2014

5. Webster, F.V. (1957). Traffic Signal Settings, Department of Scientific and Industrial Research. Road Research Technique Paper.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Uniformity of markov elements in deep reinforcement learning for traffic signal control;Electronic Research Archive;2024

2. An efficient algorithm for optimal route node sensing in smart tourism Urban traffic based on priority constraints;Wireless Networks;2023-12-08

3. On Max Pressure Urban Traffic Control with Learning;2023 IEEE 9th Information Technology International Seminar (ITIS);2023-10-18

4. Normalized Traffic Features Using Graph Signal Processing for Traffic Flow Prediction;2023 IEEE 9th Information Technology International Seminar (ITIS);2023-10-18

5. A Resilient Intelligent Traffic Signal Control Scheme for Accident Scenario at Intersections via Deep Reinforcement Learning;Sustainability;2023-01-10