1. Human-level control through deep reinforcement learning;Mnih;Nature,2015
2. M. Hessel, J. Modayil, H. van Hasselt, T. Schaul, G. Ostrovski, W. Dabney, D. Horgan, B. Piot, M.G. Azar, D. Silver, Rainbow: Combining improvements in deep reinforcement learning, in: Thirty-Second AAAI Conference on Artificial Intelligence, 2018, pp. 3215–3222.
3. M.G. Bellemare, W. Dabney, R. Munos, A distributional perspective on reinforcement learning, in: 34th International Conference on Machine Learning, 2017, pp. 449–458.
4. W. Dabney, M. Rowland, M.G. Bellemare, R. Munos, Distributional reinforcement learning with quantile regression, in: Thirty-Second AAAI Conference on Artificial Intelligence, 2017, pp. 2892–2901.
5. Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation;Kulkarni,2016