Author:
Bortko Kamil,Bartków Piotr,Jankowski Jarosław
Reference25 articles.
1. Best arm identification in multi-armed bandits;Audibert,2010
2. Exploration–exploitation tradeof using variance estimates in multi-armed bandits;Audibert;Theoretical Computer Science,2009
3. Using confidence bounds for exploitation-exploration trade-ofs;Auer;Journal of Machine Learning Research,2002
4. Finite-time analysis of the multiarmed bandit problem;Auer;Machine learning,2002
5. The nonstochastic multiarmed bandit problem;Auer;SIAM journal on computing,2002