1. Continuous control with deep reinforce-ment learning;lillicrap;ArXiv Preprint,2019
2. Hindsight experience replay;andrychowicz;In Proceedings of the 2017 Advances in Neural Information Processing Systems,0
3. Proximal policy optimization algorithms;schulman;CoRR,2017
4. Deep learning using rectified linear units (relu);agarap;CoRR,2018