Author:
Baardman Lennart,Fata Elaheh,Pani Abhishek,Perakis Georgia
Reference31 articles.
1. Bandits with concave rewards and convex knapsacks;Shipra Agrawal;Proceedings of the fifteenth ACM conference on Economics and computation,2014
2. Using confidence bounds for exploitation-exploration trade-offs;Peter Auer;Journal of Machine Learning Research,2002
3. Finite-time analysis of the multiarmed bandit problem;Peter Auer;Machine learning,2002
4. Bandits with knapsacks;Ashwinkumar Badanidiyuru;Journal of the ACM,2018
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献