1. Multi-agent actor-critic for mixed cooperative-competitive environments;Lowe;Advances in Neural Information Processing Systems,2017
2. Deep recurrent q-learning for partially observable mdps;Hausknecht,2015
3. Learning to Collaborate
4. Learning multiagent communication with backpropagation;Sukhbaatar;Advances in neural information processing systems,2016