1. Thompson sampling algorithms for mean-variance bandits;zhu;International Conference on Machine Learning,2020
2. Introduction to Multi-Armed Bandits
3. Risk-Sensitive Online Learning
4. Sample complexity of risk-averse bandit-arm selection;yu;IJCAI,2013
5. Best arm identification: A unified approach to fixed budget and fixed confidence;gabillon;NIPS,2012