1. Response Prediction for Low-Regret Agents
2. Peter Auer , Nicolo Cesa-Bianchi , and Paul Fischer . 2002. Finite-time analysis of the multiarmed bandit problem. Machine learning 47, 2 ( 2002 ), 235–256. Peter Auer, Nicolo Cesa-Bianchi, and Paul Fischer. 2002. Finite-time analysis of the multiarmed bandit problem. Machine learning 47, 2 (2002), 235–256.
3. Adaptive and Self-Confident On-Line Learning Algorithms
4. James P Bailey Sai Ganesh Nagarajan and Georgios Piliouras. 2021. Stochastic Multiplicative Weights Updates in Zero-Sum Games. arXiv preprint arXiv:2110.02134(2021). James P Bailey Sai Ganesh Nagarajan and Georgios Piliouras. 2021. Stochastic Multiplicative Weights Updates in Zero-Sum Games. arXiv preprint arXiv:2110.02134(2021).