1. Shipra Agrawal and Navin Goyal . 2013 . Thompson sampling for contextual bandits with linear payoffs . In International conference on machine learning. PMLR, 127–135 . Shipra Agrawal and Navin Goyal. 2013. Thompson sampling for contextual bandits with linear payoffs. In International conference on machine learning. PMLR, 127–135.
2. The Exploration-Exploitation Trade-off in Interactive Recommender Systems
3. How algorithmic confounding in recommendation systems increases homogeneity and decreases utility
4. Using Exploration to Alleviate Closed Loop Effects in Recommender Systems
5. Addressing Cold Start in Recommender Systems with Hierarchical Graph Neural Networks