1. Reinforcement Learning: An Introduction
2. An Introduction to Deep
Reinforcement Learning
3. Policy gradient methods for reinforcement learning with function approximation;sutton;Ad- vances in neural information processing systems,2000
4. Sample Efficient Actor-Critic with Experience Replay;wang;ArXiv,2017
5. Trust R egion Policy Optimization;schulman;ArXiv,2017