1. A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game;Araghi;Eng. Appl. Artif. Intell.,2013
2. Aştefănoaei, L., De Boer, F.S., Dastani, M., 2010. Strategic executions of choreographed timed normative multi-agent systems. In: Proceedings of the 9th International Conference on Autonomous Agents and Multi-agent Systems, Toronto, Canada, May, pp. 965–972.
3. Analyzing myopic approaches for multi-agent communication;Becker;Comput. Intell.,2009
4. Benda, M., Jagannathan, V., Dodhiawalla, R., 1986. On optimal cooperation of knowledge sources, Technical Report No. BCS-G2010-28, Boeing Advanced Technology Center, Boeing Computer Services, Seattle, WA.
5. A comprehensive survey of multi-agent reinforcement learning;Busoniu;IEEE Trans. Syst. Man Cybern. Part C: Appl. Rev.,2008