Author:
Zhong Yueyang,Birge John R.,Ward Amy
Subject
General Earth and Planetary Sciences,General Environmental Science
Reference64 articles.
1. Sample mean based index policies by o(log n) regret for the multi-armed bandit problem;R Agrawal;Advances in Applied Probability,1995
2. Analysis of Thompson sampling for the multi-armed bandit problem;S Agrawal;Conference on Learning Theory,2012
3. A survey of parameter and state estimation in queues;A Asanjarani;Queueing Systems,2021
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献