1. Analysis of thompson sampling for the multi-armed bandit problem;Shipra Agrawal;COLT,2012
2. Thompson sampling for contextual bandits with linear payoffs;Shipra Agrawal;30th International Conference on Machine Learning, ICML,2013
3. Thompson sampling for contextual bandits with linear payoffs;Shipra Agrawal;International Conference on Machine Learning,2013
4. Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-part i: I.i.d. rewards;Pravin Venkatachalam Anantharam;IEEE Transactions on Automatic Control,1987