1. On certain integrals of Lipschitz-Hankel type involving products of bessel functions
2. T. P. Lillicrap, J. J. Hunt, A. Pritzel, , “Continuous control with deep reinforcement learning,” arXiv:1509.02971, 2015.
3. Hierarchical automatic curriculum learning: Converting a sparse reward navigation task into dense reward
4. A data-efficient goal-directed deep reinforcement learning method for robot visuomotor skill
5. M. Fortunato, M.G. Azar, B. Piot, J. Menick, I. Osband, A. Graves, V. Mnih, R. Munos, D. Hassabis, O. Pietquin, C. Blundell, S. Legg, “Noisy networks for exploration,” arXiv preprint, arXiv: 1706.10295, 2018.