1. Using inaccurate models in reinforcement learning;Abbeel,2006
2. Recent advances in hierarchical reinforcement learning;Barto;Discrete Event Dyn. Syst.,2003
3. Robustness in the strategy of scientific model building;Box,1979
4. R-max – a general polynomial time algorithm for near-optimal reinforcement learning;Brafman;J. Mach. Learn. Res.,2003
5. Continual planning and acting in dynamic multiagent environments;Brenner;Auton. Agents Multi-Agent Syst.,2009