1. Agrawal S, Goyal N (2012) Analysis of Thompson sampling for the multi-armed bandit problem. Mannor S, Srebro N, Williamson RC, eds. Proc. 21st Annual Conf. Learning Theory, Proceedings of Machine Learning Research, vol. 23 (PMLR), 39.1–39.26.
2. The Sequential Design of Experiments for Infinitely Many States of Nature
3. Audibert JY, Bubeck S, Munos R (2010) Best arm identification in multi-armed bandits. Kalai AT, Mohri M, eds. COLT 23rd Conf. Learning Theory (Omnipress, Madison, WI), 41–53.
4. The consistency of posterior distributions in nonparametric problems
5. A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances