1. A survey of inverse reinforcement learning: Challenges, methods and progress;Arora;Artificial Intelligence,2021
2. The arcade learning environment: An evaluation platform for general agents;Bellemare;Journal of Artificial Intelligence Research,2013
3. Berner C. , Brockman G. , Chan B. , Cheung V. , Dębiak P. , Dennison C., Farhi D., Fischer Q., Hashme S., Hesse C., et al., Dota 2 with large scale deep reinforcement learning, arXivpreprint arXiv:1912.06680, 2019.
4. Improved heuristics for optimal pathfinding on game maps;Bjornsson;Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment,2006
5. Brockman G. , Cheung V. , Pettersson L. , Schneider J. , Schulman J. , Tang J. , Zaremba W. , Openai gym, 2016.