A Simple Distribution-Free Approach to the Max k-Armed Bandit Problem-Reference-Cited by-同舟云学术

A Simple Distribution-Free Approach to the Max k-Armed Bandit Problem

Published:2006 Issue: Volume: Page:560-574
ISSN:0302-9743
Container-title:Principles and Practice of Constraint Programming - CP 2006
language:
Short-container-title:

Author:

Streeter Matthew J.,Smith Stephen F.

Publisher

Springer Berlin Heidelberg

Link

http://link.springer.com/content/pdf/10.1007/11889205_40.pdf

Reference13 articles.

1. Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Machine Learning 47, 235–256 (2002a)

2. Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.E.: The nonstochastic multiarmed bandit problem. SIAM Journal on Computing 32(1), 48–77 (2002b)

3. Berry, D.A., Fristedt, B.: Bandit Problems: Sequential Allocation of Experiments. Chapman and Hall, London (1986)

4. Cicirello, V.A., Smith, S.F.: Heuristic selection for stochastic search optimization: Modeling solution quality by extreme value theory. In: Proceedings of the 10th International Conference on Principles and Practice of Constraint Programming, pp. 197–211 (2004)

5. Cicirello, V.A., Smith, S.F.: The max k-armed bandit: A new model of exploration applied to search heuristic selection. In: Proceedings of the Twentieth National Conference on Artificial Intelligence, pp. 1355–1361 (2005)

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Extreme Bandits Using Robust Statistics;IEEE Transactions on Information Theory;2023-03

2. Max K-Armed Bandit: On the ExtremeHunter Algorithm and Beyond;Machine Learning and Knowledge Discovery in Databases;2017

3. Extreme Reactive Portfolio (XRP): Tuning an Algorithm Population for Global Optimization;Lecture Notes in Computer Science;2016

4. Online Black-Box Algorithm Portfolios for Continuous Optimization;Parallel Problem Solving from Nature – PPSN XIII;2014

5. BoostingTree: parallel selection of weak learners in boosting, with application to ranking;Machine Learning;2013-06-07