1. Yasin Abbasi-yadkori Dávid Pál and Csaba Szepesvári. 2011. Improved Algorithms for Linear Stochastic Bandits. In NIPS. Yasin Abbasi-yadkori Dávid Pál and Csaba Szepesvári. 2011. Improved Algorithms for Linear Stochastic Bandits. In NIPS.
2. Jacob Abernethy Chansoo Lee Audra McMillan and Ambuj Tewari. 2017. Online learning via differential privacy. arXiv preprint arXiv:1711.10019(2017). Jacob Abernethy Chansoo Lee Audra McMillan and Ambuj Tewari. 2017. Online learning via differential privacy. arXiv preprint arXiv:1711.10019(2017).
3. Using Confidence Bounds for Exploitation-Exploration Trade-offs;Auer Peter;Journal of Machine Learning Research,2002