1. Rishabh Agarwal , Dale Schuurmans , and Mohammad Norouzi . 2020 . An optimistic perspective on offline reinforcement learning . In International Conference on Machine Learning. PMLR, 104--114 . Rishabh Agarwal, Dale Schuurmans, and Mohammad Norouzi. 2020. An optimistic perspective on offline reinforcement learning. In International Conference on Machine Learning. PMLR, 104--114.
2. Arthur Argenson and Gabriel Dulac-Arnold . 2020. Model-based offline planning. arXiv preprint arXiv:2008.05556 ( 2020 ). Arthur Argenson and Gabriel Dulac-Arnold. 2020. Model-based offline planning. arXiv preprint arXiv:2008.05556 (2020).
3. James Ault and Guni Sharon . 2021 . Reinforcement Learning Benchmarks for Traffic Signal Control . In Proceedings of the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021) Datasets and Benchmarks Track. James Ault and Guni Sharon. 2021. Reinforcement Learning Benchmarks for Traffic Signal Control. In Proceedings of the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021) Datasets and Benchmarks Track.
4. Erik Bernhardsson . 2020 . Annoy: Approximate Nearest Neighbors in C/Python. https://pypi.org/project/annoy/ Erik Bernhardsson. 2020. Annoy: Approximate Nearest Neighbors in C/Python. https://pypi.org/project/annoy/
5. Toward A Thousand Lights: Decentralized Deep Reinforcement Learning for Large-Scale Traffic Signal Control