1. Reinforcement Learning based Recommender Systems: A Survey
2. Top-K Off-Policy Correction for a REINFORCE Recommender System
3. Xinshi Chen , Shuang Li , Hui Li , Shaohua Jiang , Yuan Qi , and Le Song . 2019 . Generative adversarial user model for reinforcement learning based recommendation system . In International Conference on Machine Learning. PMLR, 1052–1061 . Xinshi Chen, Shuang Li, Hui Li, Shaohua Jiang, Yuan Qi, and Le Song. 2019. Generative adversarial user model for reinforcement learning based recommendation system. In International Conference on Machine Learning. PMLR, 1052–1061.
4. Romain Deffayet Thibaut Thonet Jean-Michel Renders and Maarten de Rijke. 2023. Generative Slate Recommendation with Reinforcement Learning. (2023). Romain Deffayet Thibaut Thonet Jean-Michel Renders and Maarten de Rijke. 2023. Generative Slate Recommendation with Reinforcement Learning. (2023).
5. Gabriel Dulac-Arnold , Richard Evans , Hado van Hasselt , Peter Sunehag , Timothy Lillicrap , Jonathan Hunt , Timothy Mann , Theophane Weber , Thomas Degris , and Ben Coppin . 2015. Deep reinforcement learning in large discrete action spaces. arXiv preprint arXiv:1512.07679 ( 2015 ). Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, and Ben Coppin. 2015. Deep reinforcement learning in large discrete action spaces. arXiv preprint arXiv:1512.07679 (2015).