1. Alekh Agarwal Sarah Bird Markus Cozowicz Luong Hoang John Langford Stephen Lee Jiaji Li Dan Melamed Gal Oshri Oswaldo Ribas Siddhartha Sen and Alex Slivkins. 2017. Making Contextual Decisions with Low Technical Debt. arxiv:1606.03966 [cs.LG]
2. Shipra Agrawal and Navin Goyal. 2012. Analysis of thompson sampling for the multi-armed bandit problem. In Conference on learning theory. JMLR Workshop and Conference Proceedings, 39–1.
3. Peter Auer, Nicolo Cesa-Bianchi, and Paul Fischer. 2002. Finite-time analysis of the multiarmed bandit problem. Machine learning 47 (2002), 235–256.
4. Peter Auer, Yifang Chen, Pratik Gajane, Chung-Wei Lee, Haipeng Luo, Ronald Ortner, and Chen-Yu Wei. 2019. Achieving optimal dynamic regret for non-stationary bandits without prior information. In Conference on Learning Theory. PMLR, 159–163.
5. Mark Burnette. 2020. Payment Card Industry Data Security Standard. https://www.lbmc.com/blog/pci-compliance-fees-fines-penalties/ Accessed on May 23, 2023.