Author:
Lee Donghun,Powell Warren B.
Publisher
Springer International Publishing
Reference22 articles.
1. Besson, L.: SMPyBandits: an open-source research framework for single and multi-players multi-arms bandits (MAB) algorithms in python. GitHub.com/SMPyBandits/SMPyBandits (2018)
2. Cesa-Bianchi, N., Gentile, C., Lugosi, G., Neu, G.: Boltzmann exploration done right. Adv. Neural Inf. Process. Syst. 30 (2017)
3. Chen, S., Reyes, K.R.G., Gupta, M.K., McAlpine, M.C., Powell, W.B.: Optimal learning in experimental design using the knowledge gradient policy with application to characterizing nanoemulsion stability. SIAM/ASA J. Uncertain. Quant. 3(1), 320–345 (2015)
4. Frazier, P., Powell, W.: The Knowledge Gradient Policy for Offline Learning with Independent Normal Rewards (2007)
5. Frazier, P., Powell, W., Dayanik, S.: The knowledge-gradient policy for correlated normal beliefs. INFORMS J. Comput. 21(4), 599–613 (2009)
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献