1. Coordinated deep reinforcement learners for traffic light control;van der pol;Proc Learn Inference Control Multi-Agent Syst (NIPS),2016
2. The concrete distribution: A continuous relaxation of discrete random variables;maddison;arXiv 1611 00712,2016
3. Using a deep reinforcement learning agent for traffic signal control;genders;arXiv 1611 01142,2016
4. Categorical reparameterization with Gumbel–Softmax;jang;arXiv 1611 01144,2016
5. Deep reinforcement learning for traffic light control in vehicular networks;liang;arXiv 1803 11115,2018