1. Policy Gradient Methods for Robotics
2. Asynchronous Methods for Deep Reinforcement Learning;Mnih,2016
3. Hindsight Experience Replay;Andrychowicz;Advances in Neural Information Processing Systems,2017
4. Continuous Control with Deep Reinforcement Learning;Lillicrap,2016
5. Addressing Function Approximation Error in Actor-critic Methods;Fujimoto,2018