Author:
Guéneron Josselin,Bonnet Grégory
Publisher
Springer Nature Switzerland
Reference17 articles.
1. Agrawal, R.: Sample mean based index policies by $$\cal{O} (\log n)$$ regret for the multi-armed bandit problem. Adv. Appl. Probab. 27(4), 1054–1078 (1995)
2. Audibert, J., Munos, R., Szepesvári, C.: Exploration-exploitation tradeoff using variance estimates in multi-armed bandits. Theor. Comput. Sci. 410(19), 1876–1902 (2009)
3. Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Mach. Learn. 47(2), 235–256 (2002)
4. Benoit, J.P., Krishna, V.: Finitely repeated games. In: Foundations in Microeconomic Theory, pp. 195–212 (1984)
5. Blankenburg, B., Dash, R.K., Ramchurn, S.D., Klusch, M., Jennings, N.R.: Trusted kernel-based coalition formation. In: 4th AAMAS, pp. 989–996 (2005)