1. Shipra Agrawal and Navin Goyal . 2012 . Analysis of Thompson Sampling for the Multi-armed Bandit Problem . In Proceedings of the 25th Annual Conference on Learning Theory. 39 .1--39.26. Shipra Agrawal and Navin Goyal. 2012. Analysis of Thompson Sampling for the Multi-armed Bandit Problem. In Proceedings of the 25th Annual Conference on Learning Theory. 39.1--39.26.
2. Shipra Agrawal and Navin Goyal . 2013 a. Further Optimal Regret Bounds for Thompson Sampling . In Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics. 99--107 . Shipra Agrawal and Navin Goyal. 2013a. Further Optimal Regret Bounds for Thompson Sampling. In Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics. 99--107.
3. Shipra Agrawal and Navin Goyal . 2013 b. Thompson Sampling for Contextual Bandits with Linear Payoffs . In Proceedings of the 30th International Conference on Machine Learning. 127--135 . Shipra Agrawal and Navin Goyal. 2013b. Thompson Sampling for Contextual Bandits with Linear Payoffs. In Proceedings of the 30th International Conference on Machine Learning. 127--135.
4. Near-Optimal Regret Bounds for Thompson Sampling