Subject
Artificial Intelligence,Cognitive Neuroscience,Computer Science Applications
Reference27 articles.
1. Sample mean based index policies with O(log n) regret for the multi-armed bandit problem;Agrawal;Adv. Appl. Probab.,1995
2. Proceedings of the 24th International Conference on Neural Information Processing Systems;Agarwal,2011
3. Analysis of thompson sampling for the multi-armed bandit problem;Agrawal,2012
4. Bandit-based local feature subset selection;Ashtiani;Neurocomputing,2014
5. Finite-time analysis of the multiarmed bandit problem;Auer;Mach. Learn.,2002
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献