1. Taming the monster: a fast and simple algorithm for contextual bandits;Agarwal,2014
2. Does observation influence learning?;Armantier;Games Econ. Behav.,2004
3. Notes on Expectations Equilibria in Bayesian Settings;Arrow,1973
4. Using confidence bounds for exploitation-exploration trade-offs;Auer;J. Mach. Learn. Res.,2002
5. Influence of social reinforcement and the behavior of models in shaping children’s judgment;Bandura;J. Abnormal Social Psychol.,1963