Author:
Bubeck Sébastien,Munos Rémi,Stoltz Gilles
Subject
General Computer Science,Theoretical Computer Science
Reference20 articles.
1. Exploration–exploitation trade-off using variance estimates in multi-armed bandits;Audibert;Theoretical Computer Science,2009
2. J.-Y. Audibert, S. Bubeck, R. Munos, Best arm identification in multi-armed bandits, in: Proceedings of the 23rd Annual Conference on Learning Theory, COLT, 2010.
3. Finite-time analysis of the multiarmed bandit problem;Auer;Machine Learning Journal,2002
4. The non-stochastic multi-armed bandit problem;Auer;SIAM Journal on Computing,2002
5. Convergence of Probability Measures;Billingsley,1968
Cited by
83 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献