1. Multi-sided exposure bias in recommendation;Abdollahpouri,2020
2. Reinforcement Learning Based Recommender Systems: A Survey;Afsar,2021
3. Using confidence bounds for exploitation-exploration trade-offs;Auer;J. Mach. Learn. Res.,2002
4. Finite-time analysis of the multiarmed bandit problem;Auer;Mach. Learn.,2002
5. Model-based reinforcement learning with adversarial training for online recommendation;Bai,2019