1. Mnih, V., et al.: Playing atari with deep reinforcement learning, December 2013.
http://arxiv.org/abs/1312.5602
2. van Hasselt, H., Guez, A., Silver, D.: Deep reinforcement learning with double Q-learning, September 2015.
http://arxiv.org/abs/1509.06461
3. Bellemare, M.G., Dabney, W., Munos, R.: A distributional perspective on reinforcement learning, July 2017.
http://arxiv.org/abs/1707.06887
4. Fortunato, M., et al.: Noisy networks for exploration, June 2017.
http://arxiv.org/abs/1706.10295
5. Amari, S.I.: Natural gradient works efficiently in learning. Neural Comput. 10(2), 251–276 (1998).
https://doi.org/10.1162/089976698300017746
.
http://www.mitpressjournls.org/10.1162/089976698300017746