1. Watkins, C.J.C.H., Dayan, P.: Q-learning. Machine Learning 8, 55–68 (1992)
2. Sutton, R.S., Barto, A.G.: Reinforcement Learning – An Introduction. MIT Press, Cambridge (1998)
3. Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-Dynamic Programming. Athena Scientific, MIT, Belmont, Massachusetts (1996)
4. Atlasis, A.F., Vasilakos, A.V.: The Use of Reinforcement Learning Algorithms in Traffic Control of High Speed Networks. In: Proceedings European Symposium on Intelligent Techniques, Aachen, Germany, pp. 283–288 (2000)
5. Marbach, P., Mihatsch, O., Tsitsiklis, J.N.: Call Admission Control and Routing in Integrated Service Networks Using Neuro-Dynamic Programming. IEEE Journal on Selected Areas in Communications 18(2), 197–208 (2000)