1. Albus, J.S.: A theory of cerebellar function. Mathematical Biosciences 10, 25–61 (1975)
2. Baird, L.: Residual algorithms: Reinforcement learning with function approximation. In: Prieditis, A., Russell, S. (eds.) Machine Learning: Proceedings of the Twelfth International Conference, pp. 30–37. Morgan Kaufmann Publishers, San Francisco (1995)
3. Bellman, R.: Dynamic Programming. Princeton University Press, Princeton (1957)
4. Boyan, J.A., Moore, A.W.: Generalization in reinforcement learning: Safely approximating the value function. In: Tesauro, G., Touretzky, D.S., Leen, T.K. (eds.) Advances in Neural Information Processing Systems, vol. 7, pp. 369–376. MIT Press, Cambridge (1995)
5. Gordon, G.J.: Stable function approximation in dynamic programming. Technical Report CMU-CS-95-103, Carnegie Mellon University (1995)