1. Barto, A. G., Sutton, R. S. & Anderson, C. W. (1983). Neuronlike elements that can solve difficult learning control problems.IEEE Transactions on Systems, Man. and Cybernetics 13: 835?846.
2. Cichosz, P. & Mulawka, J. J. (1995). Fast and efficient reinforcement learning with truncated temporal differences.Proceedings of the Twelfth International Conference on Machine Learning. 99?107.
3. Lin, L. J. (1992).Reinforcement learning for robots using neural networks. Ph.D. Dissertation, Carnegic Mellon University, PA.
4. Moore, A. W. & Atkeson, C. G. (1994). Prioritized sweeping: reinforcement learning with less data and less time.Machine Learning 13(1): 103?130.
5. UNSW-CSE-TR-9410;M. Pendrith,1994