Author:
Jain Rahul,Varaiya Pravin
Subject
Electrical and Electronic Engineering,Control and Systems Engineering
Reference25 articles.
1. Scale-sensitive dimension, uniform convergence and learnability;Alon;Journal of the ACM,1997
2. Rate of convergence of empirical measures and costs in controlled Markov chains and transient optimality;Altman;Mathematics of Operations Research,1994
3. Neural network learning: theoretical foundations;Anthony,1999
4. Sample complexity of policy search with known dynamics;Bartlett,2007
5. Infinite-horizon policy-gradient estimation;Baxtar;Journal of Artificial Intelligence Research,2001
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献