1. . Reinforcement learning: An introduction. Cambridge, MA: MIT Press, 1998.
2. Reinforcement Learning: A Survey
3. Learning from delayed rewards. PhD thesis, Cambridge University, Cambridge, England, 1989.
4. Dynamic programming. Princeton, NJ. Princeton University Press; 1957.