1. A.W. Moore. Variable Resolution Dynamic Programming: Efficiently Learning Action Map in Multivariate Real-valued State-spaces. In Proceedings of the Eighth International Workshop on Machine Learning, pages 333–337, 1991.
2. C. Watkins. Technical Note Qlearning. Machine Learning, 8:279–297, 1992.
3. D. Chapman and L. Kaebling. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons, 1991.
4. M. Dorigo and H. Bersini. A Comparison of Q-Learning and Classifier Systems. In Proceedings of SAB III, pages 248–255. MIT Press, 1994.
5. N. Chatenet. Adaptation of Turnpike Theorem in a Variant of Qlearning. Technical Report 96016, University of Bordeaux I, 1996.