1. On Markovian decision models with a finite skeleton
2. Playing Atari with deep reinforcement learning;mnih,2013
3. Deterministic policy gradient algorithms;silver;Proceedings of the 31st International Conference on Machine Learning,2014
4. Continuous control with deep reinforcement learning;lillicrap,2015
5. Actor-critic algorithms[C]//Advances in neural information processing systems;r,2000