1. Foerster, J., Farquhar, G., Afouras, T., Nardelli, N., Whiteson, S.: Counterfactual multiagent policy gradients. arXiv preprint
arXiv:1705.08926
(2017)
2. Lowe, R.: Multi-agent actor-critic for mixed cooperative-competitive environments. In: Neural Information Processing Systems (NIPS) (2017)
3. Shihui, L., Yi, W., Xinyue, C., Honghua, D., Fei, F., Stuart, R.: Robust multi-agent reinforcement learning via minimax deep deterministic policy gradient. In: AAAI Conference on Artificial Intelligence (AAAI) (2019)
4. Yeung, S., Russakovsky, O., Mori, G., Fei-Fei, L.: End-to-end learning of action detection from frame glimpses in videos. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2678–2687 (2016)
5. Mnih, V., Kavukcuoglu, K., Silver, D.: Playing atari with deep reinforcement learning:
arXiv:1312.5602
[cs.LG]