1. Sutton, R.S.: Learning to predict by the methods of temporal differences. Machine Learning 3, 9–44 (1988)
2. Crites, R.H., Barto, A.G.: Improving Elevator Performance Using Reinforcement Learning, NIPS-8 (1996)
3. Tesauro, G.J.: Temporal difference learning and TD-Gammon. Communications of the ACM 38(3), 58–68 (1995)
4. Sutton, R.S.: Generalisation in reinforcement learning: Successful examples using sparse coarse coding. In: Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (eds.) Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference, pp. 1038–1044. The MIT Press, Cambridge (1996)
5. Kretchmar, R.M., Anderson, C.W.: Comparison of CMACs and RBFs for local function approximators in reinforcement learning. IEEE International Conference on Neural Networks (1997)