1. Improved algorithms for linear stochastic bandits;Abbasi-Yadkori;Advances in Neural Information Processing Systems,2011
2. Abeille, M., Faury, L., & Calauzènes, C. (2021). Instance-wise minimax-optimal algorithms for logistic bandits. PMLR. International conference on artificial intelligence and statistics, 3691–3699
3. Agrawal, S., Avadhanula, V., Goyal, V., & Zeevi, A. (2017). Thompson sampling for the MNL-bandit. PMLR. Conference on learning theory, 76–78,
4. MNL-bandit: A dynamic learning approach to assortment selection;Agrawal;Operations Research,2019
5. An exact method for assortment optimization under the nested logit model;Alfandari;European Journal of Operational Research,2021