1. A Markovian decision process;In: Indiana University Mathematics Journal,1957
2. Recurrent attentional reinforcement learning for multi-label image recognition;Proceedings of the AAAI Conference on Artificial Intelligence,2018
3. A study on overfitting in deep reinforcement learning,2018
4. Haarnoja, T., et al. (2018), “Soft actor-critic algorithms and applications”, en. In: CoRR arXiv:1812.05905 [cs, stat]. arXiv: 1812.05905, available at: http://arxiv.org/abs/1812.05905 (accessed 3 September 2019).
5. Deep reinforcement learning with double Q-learning,2016