1. Using inaccurate models in reinforcement learning;Abbeel,2006
2. A comparison of direct and model-based reinforcement learning;Atkeson,1997
3. Covariant policy search;Bagnell,2003
4. Autonomous helicopter control using reinforcement learning policy search methods;Bagnell,2001
5. Reinforcement learning in POMDP's via direct gradient ascent;Baxter,2000