1. Reducing overestimation bias in multi-agent domains using double centralized critics;Ackermann,2019
2. Averaged-dqn: Variance reduction and stabilization for deep reinforcement learning;Anschel,2017
3. Emergent tool use from multi-agent autocurricula;Baker,2019
4. Dota 2 with large scale deep reinforcement learning;Berner,2019
5. The complexity of decentralized control of Markov decision processes;Bernstein,2000