1. Yasin Abbasi-Yadkori, Peter Bartlett, Victor Gabillon, Alan Malek, and Michal Valko. 2018. Best of both worlds: Stochastic & adversarial best-arm identification. In Conference on Learning Theory. 918--949.
2. Jean-Yves Audibert, Sébastien Bubeck, and Rémi Munos. 2010. Best arm identification in multi-armed bandits. In Conference on Learning Theory. 41--53.
3. Rémy Degenne, Thomas Nedelec, Clement Calauzenes, and Vianney Perchet. 2019. Bridging the gap between regret minimization and best arm identification, with application to A/B tests. In International Conference on Artificial Intelligence and Statistics. 1988--1996.
4. Nina Deliu, Joseph J Williams, and Sofia S Villar. 2021. Efficient inference without trading-off regret in bandits: An allocation probability test for Thompson sampling. arXiv preprint arXiv:2111.00137 (2021).
5. Yash Deshpande, Lester Mackey, Vasilis Syrgkanis, and Matt Taddy. 2018. Accurate inference for adaptive linear models. In International Conference on Machine Learning. 1194--1203.