1. Operant behavior;Skinner;American Psychologist,1963
2. Reinforcement learning: A survey;Kaelbling;Journal of Artificial Intelligence Research,1996
3. Unifying temporal and structural credit assignment problems;Agogino;Proceedings of the third International Joint Conference on Autonomous Agents and Multiagent Systems-Volume,2004
4. Sutton R.S. , Temporal credit assignment in reinforcement learning, Ph.D. dissertation, University of Massachusetts –Amherst, 1984.
5. Q-learning;Watkins;Machine Learning,1992