1. Implementing the Deep Q-Network;Roderick;ArXiv,2017
2. Safe, multi-agent, reinforcement learning for autonomous driving;Shalev-Shwartz;ArXiv
3. The Surprising Effectiveness of MAPPO in Cooperative, Multi-Agent Games;Yu;ArXiv,2021
4. The Hanabi challenge: A new frontier for AI research