1. Aberdeen, D., Buffet, O., & Thomas, O. (2007). Policy-gradients for psrs and pomdps. In Artificial intelligence and statistics (pp. 3–10).
2. An introduction to MCMC for machine learning;Andrieu;Machine Learning,2003
3. Barto, A.G., Singh, S., & Chentanez, N. (2004). Intrinsically motivated learning of hierarchical collections of skills. In Proceedings of the 3rd international conference on development and learning (pp. 112–19).
4. Bassino, F., David, J., & Nicaud, C. (2009). On the average complexity of moore’s state minimization algorithm. In 26th International symposium on theoretical aspects of computer science STACS 2009 (pp. 123–134). IBFI Schloss Dagstuhl.
5. A model of inductive bias learning;Baxter;Journal of Artificial Intelligence Research (JAIR),2000