Publisher
Springer International Publishing
Reference19 articles.
1. Sutton, R.S., Barto, A.G., Williams, R.J.: Reinforcement learning is direct adaptive optimal control. IEEE Control Syst. Mag. 12(2), 19–22
2. Sutton, R.S., Barto, A.G.: Reinforcement learning: An introduction. MIT Press, Cambridge
3. Watkins CJCHLearning with delayed rewards. Ph. D. Thesis, University of Cambridge (1989)
4. Singh, S., Jaakkola, T., Littman, M., Szpesvari, C.: Convergence results for single step on-policy reinforcement learning algorithms. Machine Learning 38, 287–308 (2000)
5. Hagen, S.T., Kröse, B.: Neural Q-learning. Neural Comput. & Applic. 12, 81–88 (2003)