1. Abbeel, P., & Ng, A. Y. (2004). Apprenticeship learning via inverse reinforcement learning. In Proceedings of the twenty-first international conference on machine learning (pp. 1–8).
2. Aytar, Y., Pfaff, T., Budden, D., Paine, T. L., Wang, Z., & Freitas, N. d. (2018). Playing hard exploration games by watching YouTube. In Proceedings of the 32nd international conference on neural information processing systems (pp. 2935–2945).
3. Never give up: Learning directed exploration strategies;Badia,2020
4. Badia, A. P., Sprechmann, P., Vitvitskyi, A., Guo, D., Piot, B., Kapturowski, S., et al. (2019). Never Give Up: Learning Directed Exploration Strategies. In International conference on learning representations.
5. Bellemare, M. G., Srinivasan, S., Ostrovski, G., Schaul, T., Saxton, D., & Munos, R. (2016). Unifying count-based exploration and intrinsic motivation. In Proceedings of the 30th international conference on neural information processing systems (pp. 1479–1487).