1. R. Bellman. Dynamic Programming. Princeton University Press, New Jersey, 1957.
2. D. P. Bertsekas. Dynamic Programming and Optimal Control, Vol. (I) and (II). Athena Scientific, Belmont, Massachusetts, 1995. Volumes 1 and 2.
3. R. I. Brafman and M. Tennenholtz. A near-optimal polynomial time algorithm for learning in certain classes of stochastic games. Artificial Intelligence, 121(1–2):31–47, 2000.
4. J. L. Doob. Stochastic Processes. John Wiley & Sons, New York, 1953.
5. M. Hutter. A theory of universal artificial intelligence based on algorithmic complexity. Technical Report cs.AI/0004001, 62 pages, 2000.
http://arxiv.org/abs/cs.AI/0004001
.