1. Attias, H. (2003). Planning by probabilistic inference. In Proceedings of the International Conference on Artificial Intelligence and Statistics.
2. Infinite-horizon policy-gradient estimation.;J.Baxter;Journal of Artificial Intelligence Research,2001
3. Cooper, G. (1988). A method for using belief networks as influence diagrams. In Proceedings of the Conference on Uncertainty in Artificial Intelligence.
4. Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning