1. Shipra Agrawal and Navin Goyal. 2012. Analysis of Thompson Sampling for the Multi-armed Bandit Problem. In Proceedings of the 25th Annual Conference on Learning Theory (Proceedings of Machine Learning Research, Vol. 23), Shie Mannor, Nathan Srebro, and Robert C. Williamson (Eds.). PMLR, Edinburgh, Scotland, 39.1--39.26. https://proceedings.mlr.press/v23/agrawal12.html
2. Shipra Agrawal and Navin Goyal. 2013. Further Optimal Regret Bounds for Thompson Sampling. In Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research, Vol. 31), Carlos M. Carvalho and Pradeep Ravikumar (Eds.). PMLR, Scottsdale, Arizona, USA, 99--107. https://proceedings.mlr.press/v31/agrawal13a.html
3. Applying the Delta Method in Metric Analytics
4. Improving the sensitivity of online controlled experiments by utilizing pre-experiment data