Publisher
Springer Science and Business Media LLC
Reference8 articles.
1. N. Alon and J. Spencer. (2008). The Probabilistic Method. Wiley Interscience.
2. P. Auer, N.C-Bianchi and P. Fischer. (2002). Finite Time Analysis of the Multiarmed Bandit Problem. Machine Learning, 47, 235–256.
3. P. Auer, P. Gajane and R. Ortner. (2018). Adaptively Tracking the Best Arm with an Unknown Number of Distribution Changes. European Workshop on Reinforcement Learning, 14, 1–8.
4. O. Besbes, Y. Gur and A. Zeevi. (2014). Stochastic multi-armed-bandit problem with nonstationary rewards. Advances in Neural Information Processing Systems, 27, 199–207.
5. A. N. Burnetas and M. N. Katehakis. (1996). Optimal Adaptive Policies for Sequential Allocation Problems. Advances in Applied Mathematics, 17, 122–142.