1. Yasin Abbasi-Yadkori David Pal and Csaba Szepesvari. 2011. Improved Algorithms for Linear Stochastic Bandits. In Advances in Neural Information Processing Systems 24. 2312--2320. Yasin Abbasi-Yadkori David Pal and Csaba Szepesvari. 2011. Improved Algorithms for Linear Stochastic Bandits. In Advances in Neural Information Processing Systems 24. 2312--2320.
2. Linear Thompson sampling revisited
3. Shipra Agrawal and Navin Goyal . 2012 . Analysis of Thompson Sampling for the Multi-Armed Bandit Problem . In Proceeding of the 25th Annual Conference on Learning Theory. 39 .1--39.26. Shipra Agrawal and Navin Goyal. 2012. Analysis of Thompson Sampling for the Multi-Armed Bandit Problem. In Proceeding of the 25th Annual Conference on Learning Theory. 39.1--39.26.
4. Shipra Agrawal and Navin Goyal . 2013 . Thompson Sampling for Contextual Bandits with Linear Payoffs . In Proceedings of the 30th International Conference on Machine Learning. 127--135 . Shipra Agrawal and Navin Goyal. 2013. Thompson Sampling for Contextual Bandits with Linear Payoffs. In Proceedings of the 30th International Conference on Machine Learning. 127--135.