1. R. H. Crites and A. G. Barto, (1996). “Improving elevator performance using reinforcement learning.” In: D. Touretzky et al., eds., Advances in Neural Information Processing Systems 8, 1017–1023, MIT Press.
2. A. Greenwald and J. O. Kephart, (1999). “Shopbots and pricebots.” Proceedings of IJCAI-99, 506–511.
3. J. Hu and M. P. Wellman, (1996). “Self-fulfilling bias in multiagent learning.” Proceedings of ICMAS-96, AAAI Press.
4. J. Hu and M. P. Wellman, (1998). “Multiagent reinforcement learning: theoretical framework and an algorithm.” Proceedings of ICML-98, Morgan Kaufmann.
5. J. O. Kephart, J. E. Hanson and J. Sairamesh, (1998). “Price-war dynamics in a free-market economy of software agents.” In: Proceedings of ALIFE-VI, Los Angeles.