1. Reinforcement learning: a survey;Kaelbling;Journal of Artificial Intelligence Research,1996
2. C. Watkins, Learning from delayed rewards, Ph.D. Thesis, Cambridge University, Cambridge, England, 1989.
3. Practical issues in temporal difference learning;Tesauro;Machine Learning,1992
4. P. Stone, R.S. Sutton, G. Kuhlmann, Reinforcement learning for RoboCup-soccer Keepaway, Adaptive Behavior 13 (3).
5. M.E. Taylor, P. Stone, Y. Liu, Value functions for RL-based behavior transfer: a comparative study, in: Proceedings of the Twentieth National Conference on Artificial Intelligence, 2005.