Author:
Čunko Krešimir,Vuković Marin,Jevtić Dragan
Publisher
Springer International Publishing
Reference7 articles.
1. Sutton, R.S., Barto, A.G.: Reinforcement Learning – An Introduction. MIT Press, Cambridge (1998)
2. Watkins, C.J.C.H., Dayan, P.: Q-learning. Mach. Learn. 8, 55–68 (1992)
3. Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-Dynamic Programming. Athena Scientific, MIT, Belmont (1996)
4. Takadama, K., Fujita, H.: Toward guidelines for modeling learning agents in multiagent-based simulation: implications from Q-learning and Sarsa agents. In: MABS 2004, Conference Proceedings, pp. 159–172 (2004)
5. Farahnakian, F., Ebrahimi, M., Daneshtalab, M., Liljeberg, P., Plosila, J.: Q-learning based congestion-aware routing algorithm for on-chip network. IEEE Xplore (2011)