1. Albus J. S. (1981) Brains, behavior, and robotics. Byte Books, Peterborough, NH
2. Anderson, C. W. (1986). Learning and problem solving with multilayer connectionist systems. Ph.D. thesis, University of Massachusetts, Amherst, MA.
3. Baird, L., & Moore, A. (1999). Gradient descent for general reinforcement learning. In Advances in Neural Information Processing Systems (Vol. 11). Cambridge, MA: MIT Press.
4. Bakker, B. (2002). Reinforcement learning with long short-term memory. In Advances in Neural Information Processing Systems (Vol. 14, pp. 1475–1482).
5. Barto, A., & Duff, M. (1994). Monte Carlo matrix inversion and reinforcement learning. In Advances in Neural Information Processing Systems (Vol. 6, pp. 687–694).