Author:
Khajuria Rishi,Quyoom Abdul,Sarwar Abid
Publisher
Korea Multimedia Society - English Version Journal
Reference52 articles.
1. Kaelbling, Leslie Pack, Michael L. Littman, and Andrew W. Moore.
“Reinforcement learning: A survey,” Journal of artificial
intelligence research, vol. 4, pp. 237-285, 1996. 10.1613/jair.301
2. Saunders, William, et al. “Trial without error: Towards safe
reinforcement learning via human intervention,” in Proceedings of
the 17th International Conference on Autonomous Agents and MultiAgent
Systems. International Foundation for Autonomous Agents and
Multiagent Systems, pp. 2067-2069, 2018.
3. Bellman, Richard. “A Markovian decision process.”
Journal of mathematics and mechanics, pp. 679-684, 1957.
10.1512/iumj.1957.6.56038
4. Beard, Randal W., George N. Saridis, and John T. Wen.
“Galerkin approximations of the generalized Hamilton-Jacobi-Bellman
equation,” Automatica, vol. 33, no. 12, pp. 2159-2177,
1997. 10.1016/S0005-1098(97)00128-3
5. Busoniu, Lucian et al., Reinforcement learning and dynamic
programming using function approximators, CRC press, 2017. 10.1201/9781439821091
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献