1. Human-level control through deep reinforcement learning;Mnih;Nature,2015
2. Mastering the game of go with deep neural networks and tree search;Silver;Nature,2016
3. T.P. Lillicrap, J.J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, D. Wierstra, Continuous control with deep reinforcement learning, arXiv preprint arXiv:1509.02971 (2015).
4. A. Ilyas, L. Engstrom, S. Santurkar, D. Tsipras, F. Janoos, L. Rudolph, A. Madry, A closer look at deep policy gradients, arXiv preprint arXiv:1811.02553 (2018).
5. Parallel exploration via negatively correlated search;Yang;Frontiers of Computer Science,2021