1. Ferran Alet Martin F. Schneider Tomás Lozano-Pérez and Leslie Pack Kaelbling. 2020. Meta-learning curiosity algorithms. (2020).
2. Hindsight experience replay;Andrychowicz Marcin;Advances in Neural Information Processing Systems,2017
3. A survey on intrinsic motivation in reinforcement learning;Aubret Arthur;arXiv preprint arXiv:1908.06976,2019
4. Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi, Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martin Arjovsky, Alexander Pritzel, Andrew Bolt, and Charles Blundell. 2020. Never give up: Learning directed exploration strategies. In International Conference on Learning Representations. https://openreview.net/forum?id=Sye57xStvB.
5. R-max-a general polynomial time algorithm for near-optimal reinforcement learning;Brafman Ronen I.;Journal of Machine Learning Research,2002