1. Andre, D., Friedman, N., Parr, R.: Generalized prioritized sweeping. Advances in Neural Information Processing Systems (1998)
2. Andre, D., Russell, S.J.: State abstraction for programmable reinforcement learning agents. In: AAAI/IAAI, pp. 119–125 (2002)
3. Assaf, D., Shared, M., Shanthikumar, J.G.: First-passage times with PFr densities. J. Appl. Probab. 22(1), 185–196 (1985)
4. Barto, A.G., Bradtke, S.J., Singh, S.P.: Learning to act using real-time dynamic programming. Artif. Intell. 72(1–2), 81–138 (1995)
5. Bertsekas, D.P.: Dynamic Programming: Deterministic and Stochastic Models. Prentice-Hall, Englewood Cliffs (1987)