Author:
Peters Jan,Vijayakumar Sethu,Schaal Stefan
Publisher
Springer Berlin Heidelberg
Reference15 articles.
1. Amari, S.: Natural gradient works efficiently in learning. Neural Computation 10, 251–276 (1998)
2. Bagnell, J., Schneider, J.: Covariant policy search. In: International Joint Conference on Artificial Intelligence (2003)
3. Baird, L.C.: Advantage Updating. Wright Lab. Tech. Rep. WL-TR-93-1146 (1993)
4. Baird, L.C., Moore, A.W.: Gradient descent for general reinforcement learning. In: Advances in Neural Information Processing Systems 11 (1999)
5. Lecture Notes in Artificial Intelligence;P. Bartlett,2003
Cited by
104 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献