Author:
Li Xiao,Li Yuqiang,Wu Xianyi
Funder
National Natural Science Foundation of China
National Key Research and Development Program of China
Subject
Applied Mathematics,Computational Theory and Mathematics,Computational Mathematics,Statistics and Probability
Reference67 articles.
1. Forced-exploration based algorithms for playing in bandits with large action sets;Abbasi-Yadkori,2009
2. Forced-exploration based algorithms for playing in bandits with large action sets;Abbasi-Yadkori,2009
3. Sample mean based index policies with O(log n) regret for the multi-armed bandit problem;Agrawal;Adv. Appl. Probab.,1995
4. Sequential medical trials;Anscombe;J. Am. Stat. Assoc.,1963
5. Finite-time analysis of the multi-armed bandit problem;Auer;Mach. Learn.,2002