1. M. Hessel, J. Modayil, H. Hasselt, T. Schaul, G. Ostrovski, W. Dabney, D. Horgan, B. Piot, M. G. Azar, D. Silver, Rainbow: Combining improvements in deep reinforcement learning, in: the 32nd AAAI Conference on Artificial Intelligence (AAAI 2018), Louisiana, USA, 2018, pp. 3215–3222.
2. W. Dabney, G. Ostrovski, D. Silver, R. Munos, Implicit quantile networks for distributional reinforcement learning, in: the 35th International Conference on Machine Learning (ICML 2018), Stockholm, Sweden, 2018, pp. 1104–1113.
3. S. Han, Y. Sung, Dimension-wise importance sampling weight clipping for sample-efficient reinforcement learning, in: the 36th International Conference on Machine Learning (ICML 2019), California, USA, 2019, pp. 2586–2595.
4. T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, D. Wierstra, Continuous control with deep reinforcement learning, in: the 4th International Conference on Learning Representations (ICLR 2016), San Juan, Puerto Rico, 2016.
5. S. Fujimoto, H. Hoof, D. Meger, Addressing function approximation error in actor-critic methods, in: the 35th International Conference on Machine Learning (ICML 2018), Stockholm, Sweden, 2018, pp. 1582–1591.