1. C.G. Atkeson, Using local trajectory optimizers to speed up global optimization in dynamic programming, in: J.E. Hanson, S.J. Moody, R.P. Lippmann (Eds.), Advances in Neural Information Processing Systems, vol. 6, Morgan Kaufmann, Los Altos, CA, 1994, pp. 503–521.
2. C.G. Atkeson, J.C. Santamaría, A comparison of direct and model-based reinforcement learning, in: Proceedings of the International Conference on Robotics and Automation, 1997.
3. Robot learning from demonstration;Atkeson,1997
4. Dynamic Programming;Bellman,1957
5. D.P. Bertsekas, Dynamic Programming and Optimal Control, Optimization and Computation Series, vol. 1, third ed., Athena Scientific, Belmont, MA, USA, 2005.