Publisher
Springer Science and Business Media LLC
Subject
Computer Science Applications,Control and Systems Engineering
Reference45 articles.
1. J. Abounadi, D. Bertsekas, and V. Borkar, “Learning algorithms for Markov decision processes with average cost,” SIAM Journal of Control and Optimization, vol. 40, pp. 681–698, 2001.
2. E. Altman, Constrained Markov Decision Processes, CRC Press, Boca Raton, 1998.
3. J. Baxter and P. Bartlett, “Infinite-horizon policygradient estimation,” Journal of Artificial Intelligence, vol. 15, pp. 319–350, 2001.
4. D. P. Bertsekas and J. Tsitsiklis, Neuro-Dynamic Programming, Athena, Belmont, 1996.
5. D. P. Bertsekas, Dynamic Programming and Optimal Control, 2nd edition, Athena, Belmont, 2000.
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献