1. Continuous control with deep rein-forcement learning;Lillicrap;4th International Conference on Learning Representations, ICLR 2016, San Juan, Puerto Rico, May 2–4, 2016, Conference Track Proceedings (Y. Bengio and Y. LeCun
2. Deterministic Policy Gradient Algorithms;Silver;Proceedings of the 31st International Conference on Machine Learning (E. P. Xing and T. Jebara
3. Approximate Newton Methods for Policy Search in Markov Decision Processes;Furmston;Journal of Machine Learning Research,2016