Subject
Electrical and Electronic Engineering,Artificial Intelligence,Control and Systems Engineering
Reference57 articles.
1. Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path;Antos;Mach. Learn.,2008
2. Unsupervised basis function adaptation for reinforcement learning;Barker;J. Mach. Learn. Res.,2019
3. Approximate policy iteration: A survey and some new methods;Bertsekas;J. Control Theory Appl.,2011
4. Temporal difference methods for general projected equations;Bertsekas;IEEE Trans. Automat. Control,2011
5. Dynamic Programming and Optimal Control, Volume II;Bertsekas,2012
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献