Author:
Chen Zengjing,Feng Xinwei,Liu Shuhui,Yan Xiaodong
Subject
Artificial Intelligence,Cognitive Neuroscience,Computer Science Applications
Reference24 articles.
1. Sample mean based index policies by O(logn) regret for the multi-armed bandit problem;Agrawal;Adv. Appl. Prob.,1995
2. Finite-time analysis of the multiarmed bandit problem;Auer;Mach. Learn.,2002
3. A Problem in the Sequential Design of Experiments;Bellman;Sankhy,1956
4. A Bernoulli two-armed bandit;Berry;Ann. Math. Stat.,1972
5. Handbook of Brownian motion-facts and formulae;Borodin,2015
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Strategic two-sample test via the two-armed bandit process;Journal of the Royal Statistical Society Series B: Statistical Methodology;2023-06-14