Funder
National Science Foundation
Reference74 articles.
1. Bandit Algorithms;Lattimore,2020
2. Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges;Villar;Stat. Sci.,2015
3. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples;Thompson;Biometrika,1933
4. An information-theoretic approach to minimax regret in partial monitoring;Lattimore,2019
5. Minimax regret of finite partial-monitoring games in stochastic environments;Bartók,2011