1. Sutton, R., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
2. Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-Dynamic Programming. Athena Scientific, Belmont (1996)
3. Sutton, R.: Learning to Predict by the Methods of Temporal Difference. Mach. Learn. 3, 9–44 (1988)
4. Watkins, C., Dayan, P.: Q-learning. Mach. Learn. 8, 279–292 (1992)
5. Beom, H.R., Cho, H.S.: A Sensor-based Navigation for a Mobile Robot Using Fuzzy Logic and Reinforcement Learning. IEEE Trans. Syst. Man. Cyc. 25, 464–477 (1995)