1. Improved algorithms for linear stochastic bandits;Yasin Abbasi-Yadkori;Advances in Neural Information Processing Systems,2011
2. Repeated contextual auctions with strategic buyers;Kareem Amin;Advances in Neural Information Processing Systems (NIPS),2014
3. Dynamic pricing for nonperishable products with demand learning;Victor F Araman;Operations Research,2009
4. Using confidence bounds for exploitation-exploration trade-offs;Peter Auer;The Journal of Machine Learning Research,2003
5. The nonstochastic multiarmed bandit problem;Peter Auer;SIAM Journal on Computing,2002