Affiliation:
1. Transport Systems Planning and Transport Telematics, Technische Universität Berlin, Germany. E-mail: tziemke@vsp.tu-berlin.de
2. Instituto da Informática, Universidade Federal do Rio Grande do Sul (UFRGS), Brazil. E-mails: lnalegre@inf.ufrgs.br, bazzan@inf.ufrgs.br
Abstract
Reinforcement learning is an efficient, widely used machine learning technique that performs well when the state and action spaces have a reasonable size. This is rarely the case regarding control-related problems, as for instance controlling traffic signals. Here, the state space can be very large. In order to deal with the curse of dimensionality, a rough discretization of such space can be employed. However, this is effective just up to a certain point. A way to mitigate this is to use techniques that generalize the state space such as function approximation. In this paper, a linear function approximation is used. Specifically, SARSA ( λ ) with Fourier basis features is implemented to control traffic signals in the agent-based transport simulation MATSim. The results are compared not only to trivial controllers such as fixed-time, but also to state-of-the-art rule-based adaptive methods. It is concluded that SARSA ( λ ) with Fourier basis features is able to outperform such methods, especially in scenarios with varying traffic demands or unexpected events.
Reference34 articles.
1. Hierarchical control of traffic signals using Q-learning with tile coding;Abdoos;Appl. Intell.,2014
2. Residual Algorithms: Reinforcement Learning with Function Approximation
3. Opportunities for multiagent systems and multiagent reinforcement learning in traffic control;Bazzan;Autonomous Agents and Multiagent Systems,2009
4. Multi-agent model predictive control of signaling split in urban traffic networks;de Oliveira;Transportation Research Part C: Emerging Technologies,2010
5. M. Di Taranto, UTOPIA, in: Proc. of the IFAC-IFIP-IFORS Conference on Control, Computers, Communication in Transportation, International Federation of Automatic Control, Paris, 1989, pp. 245–252.
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献