Author:
Broek Ronald C. van den,Litjens Rik,Sagis Tobias,Verbeeke Nina,Gajane Pratik
Publisher
Springer Nature Switzerland
Reference15 articles.
1. Arya, S., Yang, Y.: Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards. Stat. Probab. Lett. 164, 108818 (2020). https://doi.org/10.1016/j.spl.2020.108818
2. Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Mach. Learn. 2002 47(2), 235–256 (2002). https://doi.org/10.1023/A:1013689704352
3. van den Broek, R.C., Litjens, R., Sagis, T., Siecker, L., Verbeeke, N., Gajane, P.: Multi-armed bandits with generalized temporally-partitioned rewards (2023). https://arxiv.org/abs/2303.00620
4. Brost, B., Mehrotra, R., Jehan, T.: The music streaming sessions dataset. CoRR abs/1901.09851 (2019). http://arxiv.org/abs/1901.09851
5. Bubeck, S., Cesa-Bianchi, N.: Regret analysis of stochastic and nonstochastic multi-armed bandit problems. Found. Trends Mach. Learn. 5(1), 1–122 (2012). https://doi.org/10.1561/2200000024