1. Bernstein, D.S., Givan, R., Immerman, N., Zilberstein, S.: The complexity of decentralized control of Markov decision processes. Math. Oper. Res. 27(4), 819–840 (2002). https://doi.org/10.1287/moor.27.4.819.297
2. Lecture Notes in Computer Science (Lecture Notes in Artificial Intelligence);EAO Diallo,2018
3. Henderson, P., Islam, R., Bachman, P., Pineau, J., Precup, D., Meger, D.: Deep reinforcement learning that matters. CoRR abs/1709.06560 (2017). http://arxiv.org/abs/1709.06560
4. Huang, K., Chen, X., Yu, Z., Yang, C., Gui, W.: Heterogeneous cooperative belief for social dilemma in multi-agent system. Appl. Math. Comput. 320, 572–579 (2018)
5. Kim, D., et al.: Learning to schedule communication in multi-agent reinforcement learning. arXiv preprint arXiv:1902.01554 (2019)