1. Survey of research on deep reinforcement learning;yang;Computer Engineering,2021
2. Proximal policy optimization algorithms;schulman;CoRR,2017
3. Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation;wu;International Conference on Neural Information Processing Systems,0