1. Schaal, S., Atkeson, C.G.: Robot juggling: Implementation of memory-based learning. IEEE Control Systems 14, 57–71 (1994)
2. Wyatt, J.: Issues in putting reinforcement learning onto robots. In: 10th Biennal Conference of the AISB, Sheffield, UK (1995)
3. Lecture Notes in Computer Science;R. Iglesias,1998
4. Watkins, C.: Learning from Delayed Rewards. PhD thesis, Cambridge University (1989)
5. Bridle, J.S.: Training stochastic model recognition algorithms as networks can lead to maximum mutual information estimation of parameters. In: Touretzky, D. (ed.) Advances in Neural Information Processing Systems: Proc. 1989 Conf., pp. 211–217. Morgan Kaufmann, San Francisco (1990)