Author:
Tamatsukuri Akihiro,Takahashi Tatsuji
Subject
Applied Mathematics,General Biochemistry, Genetics and Molecular Biology,Modelling and Simulation,General Medicine,Statistics and Probability
Reference24 articles.
1. Analysis of Thompson sampling for the multi-armed bandit problem;Agrawal;In: Proceedings of the 25th Annual Conference on Learning Theory, 39,2012
2. Finite-time analysis of the multiarmed bandit problem;Auer;Mach. Learn.,2002
3. Satisficing: a ‘pretty good’ heuristic. The B.E;Bendor;J. Theor. Econ.,2009
4. Regret analysis of stochastic and nonstochastic multi-armed bandit problems;Bubeck;Found. Trends Mach. Learn.,2012
5. Bounded rationality, abstraction, and hierarchical decision-making: an information-theoretic optimality principle;Genewein;Front. Robot. AI,2015
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献