1. Improving aggregate recommendation diversity using ranking-based techniques;Adomavicius;IEEE Transactions On Knowledge And Data Engineering,2011
2. Thompson sampling for contextual bandits with linear payoffs;Agrawal,2013
3. UCB revisited: Improved regret bounds for the stochastic multi-armed bandit problem;Auer;Periodica Mathematica Hungarica,2010
4. Using contextual bandits with behavioral constraints for constrained online movie recommendation;Balakrishnan,2018
5. Counterfactual reasoning and learning systems: The example of computational advertising;Bottou;Journal of Machine Learning Research,2013