1. Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning;foerster;International Conference on Machine Learning,2017
2. Counterfactual Multi-Agent Policy Gradients;foerster;Thirty-Second AAAI Conference on Artificial Intelligence,2018
3. Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games;peng,2017
4. Semicentralized Deep Deterministic Policy Gradient in Cooperative StarCraft Games
5. Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks;usunier,2016