Continuous residual reinforcement learning for traffic signal control optimization

Author:

Aslani Mohammad1,Seipel Stefan12,Wiering Marco3

Affiliation:

1. Department of Industrial Development, IT and Land Management, University of Gävle, Gävle, Sweden.

2. Division of Visual Information and Interaction, Department of Information Technology, Uppsala University, Uppsala, Sweden.

3. Institute of Artificial Intelligence and Cognitive Engineering, University of Groningen, Groningen, the Netherlands.

Abstract

Traffic signal control can be naturally regarded as a reinforcement learning problem. Unfortunately, it is one of the most difficult classes of reinforcement learning problems owing to its large state space. A straightforward approach to address this challenge is to control traffic signals based on continuous reinforcement learning. Although they have been successful in traffic signal control, they may become unstable and fail to converge to near-optimal solutions. We develop adaptive traffic signal controllers based on continuous residual reinforcement learning (CRL-TSC) that is more stable. The effect of three feature functions is empirically investigated in a microscopic traffic simulation. Furthermore, the effects of departing streets, more actions, and the use of the spatial distribution of the vehicles on the performance of CRL-TSCs are assessed. The results show that the best setup of the CRL-TSC leads to saving average travel time by 15% in comparison to an optimized fixed-time controller.

Publisher

Canadian Science Publishing

Subject

General Environmental Science,Civil and Structural Engineering

Reference40 articles.

1. Abdoos, M., Mozayani, N., and Bazzan, A.L.C. 2011. Traffic light control in non-stationary environments based on multi agent Q-learning. In Proceedings of the 14th International IEEE Conference on Intelligent Transportation Systems (ITSC), Washington, DC, pp. 1580–1585. 10.1109/ITSC.2011.6083114.

2. Holonic multi-agent system for traffic signals control

3. Hierarchical control of traffic signals using Q-learning with tile coding

4. A New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC)

5. AQCC. 2014. The coefficient of emissions in the warm state for gasoline light duty vehicles of Iran. Air Quality Control Company of Tehran, Municipality Tehran.

Cited by 16 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3