Routine Bandits: Minimizing Regret on Recurring Problems-Reference-Cited by-同舟云学术

Routine Bandits: Minimizing Regret on Recurring Problems

Published:2021 Issue: Volume: Page:3-18
ISSN:0302-9743
Container-title:Machine Learning and Knowledge Discovery in Databases. Research Track
language:
Short-container-title:

Author:

Saber Hassan,Saci Léo,Maillard Odalric-Ambrym,Durand Audrey

Publisher

Springer International Publishing

Link

https://link.springer.com/content/pdf/10.1007/978-3-030-86486-6_1

Reference24 articles.

1. Abbasi-Yadkori, Y., Pál, D., Szepesvári, C.: Improved algorithms for linear stochastic bandits. In: Advances in Neural Information Processing Systems, pp. 2312–2320 (2011)

2. Agrawal, R., Teneketzis, D., Anantharam, V.: Asymptotically efficient adaptive allocation schemes for controlled IID processes: finite parameter space. IEEE Trans. Autom. Control 34(3) (1989)

3. Anandkumar, A., Ge, R., Hsu, D., Kakade, S.: A tensor spectral approach to learning mixed membership community models. In: Conference on Learning Theory, pp. 867–881. PMLR (2013)

4. Anandkumar, A., Ge, R., Hsu, D.J., Kakade, S.M., Telgarsky, M.: Tensor decompositions for learning latent variable models. J. Mach. Learn. Res. 15(1), 2773–2832 (2014)

5. Bubeck, S., Cesa-Bianchi, N.: Regret analysis of stochastic and nonstochastic multi-armed bandit problems. Foundations and Trends in Machine Learning abs/1204.5721 (2012). http://arxiv.org/abs/1204.5721

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Differentially Private Federated Combinatorial Bandits with Constraints;Machine Learning and Knowledge Discovery in Databases;2023