Affiliation:
1. Beijing Key Laboratory for Network System Architecture and Convergence Beijing University of Posts and Telecommunications Beijing China
Abstract
AbstractMulti‐agent reinforcement learning has played an increasingly important role in intelligent traffic signal control due to its self‐learning ability. However, existing algorithms only focus on signal timing mechanism design while ignoring the exponential growth of the joint action dimension as the number of intersections increases, which will ultimately face the learning difficulty. In this paper, traditional traffic methods are introduced into MARL to flexibly determine the phase and duration of each intersection. The proposed MARL algorithm based on mean field theory has the ability to convert a large number of agents to approximately binary interaction, which can effectively reduce the dimension of joint action space in multi‐agent environment and learn in a robust process. Besides, to improve the performance of traditional traffic methods, the recurrent neural network (RNN) and an improved Webster's formula with revised parameters are combined to dynamically determine the phase duration according to the historical volume of traffic flow. The simulation results indicate that the proposed algorithm shows superior scalability compared to baseline methods and has great potential to be applied in the large scale road‐networks scenario.
Publisher
Institution of Engineering and Technology (IET)
Subject
Law,Mechanical Engineering,General Environmental Science,Transportation
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献