1. T. Bansal, J. W. Pachocki, S. Sidor, I. Sutskever, I. Mordatch, Emergent complexity via multi-agent competition, arXiv abs/1710.03748(2018).
2. M. Brittain, J. Bertram, X. Yang, P. Wei, Prioritized sequence experience replay, arXiv preprint arXiv:1905.12726(2019).
3. Technical Report 10,003 Multi-agent Reinforcement Learning : An Overview;Busoniu,2012
4. Shared experience actor-critic for multi-agent reinforcement learning;Christianos,2020
5. GEP-PG: decoupling exploration and exploitation in deep reinforcement learning algorithms;Colas,2018