1. Agrawal, S., & Devanur, N. (2016). Linear contextual bandits with knapsacks. Advances in Neural Information Processing Systems, 29.
2. Amani, S., Alizadeh, M., & Thrampoulidis, C. (2019). Linear stochastic bandits under safety constraints. Advances in Neural Information Processing Systems, 32.
3. Badanidiyuru, A., Kleinberg, R., & Slivkins, A. (2018). Bandits with knapsacks. Journal of the ACM (JACM), 65(3), 1–55.
4. Bhat, S. P., & Prashanth, L. A. (2019). Concentration of risk measures: A wasserstein distance approach. In Advances in neural information processing systems (pp. 11739–11748).
5. Bouneffouf, D., & Rish, I. (2019). A survey on practical applications of multi-armed and contextual bandits. CoRR arXiv:1904.10040.