1. Yasin Abbasi-Yadkori Dávid Pál and Csaba Szepesvári. 2011. Improved algorithms for linear stochastic bandits. In Advances in Neural Information Processing Systems. 2312–2320. Yasin Abbasi-Yadkori Dávid Pál and Csaba Szepesvári. 2011. Improved algorithms for linear stochastic bandits. In Advances in Neural Information Processing Systems. 2312–2320.
2. Using confidence bounds for exploitation-exploration trade-offs;Auer Peter;Journal of Machine Learning Research 3,2002
3. Generic Outlier Detection in Multi-Armed Bandit