1. Afsar, M.M., Crump, T., Far, B.: Reinforcement learning based recommender systems: a survey. ACM Comput. Surv. 55(7) (2022)
2. Bangari, S., et al.: A review on reinforcement learning based news recommendation systems and its challenges. In: Proceedings of 2021 International Conference on Artificial Intelligence and Smart Systems, pp. 260–265 (2021)
3. Barakat, A., Bianchi, P., Lehmann, J.: Analysis of a target-based actor-critic algorithm with linear function approximation. In: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics, pp. 991–1040. PMLR (2022)
4. Fujimoto, S., et al.: Addressing function approximation error in actor-critic methods. In: Proceedings of the 35th International Conference on Machine Learning, vol. 80 (2018)
5. Haarnoja, T., et al.: soft actor-critic: off-policy maximum entropy deep reinforcement learning with a stochastic actor. In: Proceedings of the 35th International Conference on Machine Learning, vol. 80 (2018)