1. Improved algorithms for linear stochastic bandits;Y Abbasi-Yadkori;25th Advances in Neural Information Processing Systems (NIPS),2011
2. Analysis of Thompson Sampling for the multi-armed bandit problem;S Agrawal;25nd Conf. on Learning Theory (COLT),2012
3. Thompson sampling for contextual bandits with linear payoffs. 30th Intl;S Agrawal;Conf. on Machine Learning (ICML),2013
4. Majorization relations for hadamard products;T Ando;Linear algebra and its applications,1995