1. Boutilier, C.: Sequential Optimality and Coordination in Multi agent Systems, In Proc. of IJCAI-99, Stockholm, Sweden, 1999.
2. Claus, C., Boutilier, C.: The Dynamics of Reinforcement Learning in Cooperative Multiagents Systems, In Proc. of AAAI-97 Multiagent Learning Workshop, pg. 13–18, Providence, 1997.
3. Dorigo, M.: Optimization, Learning, and Natural Algorithms, PhD thesis, Politecnico da Milano, Italy, 1992.
4. Gambardella, L., M., Dorigo, M.: Ant-Q: A reinforcement Learning Approach to the Traveling Salesman Problem, In Proceedings of the 12th International Conference on Machine Learning, pp. 252–260, Morgan Kaufmann, 1995.
5. Hu, J., Wellman, M.: Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm, In Proc. 15th Int. Conf. on Machine Learning, pp. 242–250, Morgan Kaufmann, 1998.