1. Ai, H., & Litman, D. J. (2007). Knowledge consistent user simulations for dialog systems. In Proceedings of the 8th Annual Conference of the International Speech Communication Association (INTERSPEECH’07), Antwerp.
2. Atrash, A., & Pineau, J. (2010). A Bayesian method for learning POMDP observation parameters for robot interaction management systems. In The POMDP Practitioners Workshop.
3. Bellman, R. (1957a). Dynamic programming. Princeton: Princeton University Press.
4. Bellman, R. (1957b). A Markovian decision process. Journal of Mathematics and Mechanics, 6(6), 679–684
5. Bonet, B., & Geffner, H. (2003). Faster heuristic search algorithms for planning with uncertainty and full feedback. In Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI’03), Acapulco, Mexico.