1. Policy invariance under reward transformations: Theory and application to reward shaping;ng;Proceedings of the Sixteenth International Conference on Machine Learning,0
2. Dueling network architectures for deep reinforcement learning;wang;International Conference on Machine Learning,2016
3. Prioritized experience replay;schaul;ArXiv Preprint,2015
4. New potential functions for mobile robot path planning
5. Adam: A method for stochastic optimization;kingma;ArXiv Preprint,2014