1. Trust region policy optimization;schulman;Int Conference on Machine Learning,2015
2. Improved deep reinforcement learning for indoor mobile robot path planning;cheng;Computer Engineering and Applications,2021
3. Learning Mobile Manipulation through Deep Reinforcement Learning
4. Asynchronous methods for deep reinforcement learning;mnih;International Conference on Machine Learning,2016