1. Albus, J. S., (1981).Brain, Behavior, and Robotics, chapter 6, pages 139?179, Byte Books.
2. Baase, S., (1988).Computer Algorithms: Introduction to design and analysis. Reading, MA: Addison-Wesley.
3. Barnard, E., (1993). Temporal-difference methods and Markov models.IEEE Transactions on Systems, Man. and Cybernetics,23(2), 357?365.
4. Barto, A. G. & Duff, M., (1994). Monte Carlo matrix inversion and reinforcement learning. InAdvances in Neural Information Processing Systems 6, pages 687?694, San Mateo, CA, Morgan Kaufrnann.
5. Barto, A. G., Sutton, R. S., & Anderson, C. W., (1983). Neuronlike elements that can solve difficult learning control problems.IEEE Trans. on Systems, Man, and Cybernetics,13, 835?846.