Publisher
Springer Nature Singapore
Reference31 articles.
1. Szepesvt’ari, C.: Reinforcement Learning Algorithms for MDPs. Wiley Encyclopedia of Operations Research and Management Science (2011)
2. Sutton, R.S., Barto, A.G.: Reinforcement learning: an introduction (2018)
3. Watkins, C.J., Dayan, P.: Q-learning. Mach. Learn. 8(3–4), 279–292 (1992)
4. Tsitsiklis, J.N.: Asynchronous stochastic approximation and Q-learning. Mach. Learn. 16(3), 185–202 (1994)
5. Jaakkola, T., Jordan, M.I., Singh, S.P.: On the convergence of stochastic iterative dynamic programming algorithms. Neural Comput. 6(6), 1185–1201 (2014)