1. Bai, A., Wu, F., Chen, X.: Online planning for large MDPs with MAXQ decomposition (extended abstract). In: Proc. of 11th Int. Conf. on Autonomous Agents and Multiagent Systems, Valencia, Spain (June 2012)
2. Barry, J.: Fast Approximate Hierarchical Solution of MDPs. Ph.D. thesis, Massachusetts Institute of Technology (2009)
3. Barry, J., Kaelbling, L., Lozano-Perez, T.: Deth*: Approximate hierarchical solution of large markov decision processes. In: International Joint Conference on Artificial Intelligence, pp. 1928–1935 (2011)
4. Dietterich, T.G.: Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Machine Learning Research 13(1), 63 (May 1999)
5. LNAI;T. Gabel,2011