Pure Exploration in Multi-armed Bandits Problems-Reference-Cited by-同舟云学术

Pure Exploration in Multi-armed Bandits Problems

Published:2009 Issue: Volume: Page:23-37
ISSN:0302-9743
Container-title:Lecture Notes in Computer Science
language:
Short-container-title:

Author:

Bubeck Sébastien,Munos Rémi,Stoltz Gilles

Publisher

Springer Berlin Heidelberg

Link

http://link.springer.com/content/pdf/10.1007/978-3-642-04414-4_7

Reference15 articles.

1. Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Machine Learning Journal 47, 235–256 (2002)

2. Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.: The non-stochastic multi-armed bandit problem. SIAM Journal on Computing 32(1), 48–77 (2002)

3. Bubeck, S., Munos, R., Stoltz, G.: Pure exploration for multi-armed bandit problems. Technical report, HAL report hal-00257454 (2009), http://hal.archives-ouvertes.fr/hal-00257454/en

4. Bubeck, S., Munos, R., Stoltz, G., Szepesvari, C.: Online optimization in $\mathcal{X}$ –armed bandits. In: Advances in Neural Information Processing Systems, vol. 21 (2009)

5. Coquelin, P.-A., Munos, R.: Bandit algorithms for tree search. In: Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence (2007)

Cited by 168 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Online budget-limited pricing incentives for remote mobile sensing;Peer-to-Peer Networking and Applications;2024-07-05

2. Finding the optimal exploration-exploitation trade-off online through Bayesian risk estimation and minimization;Artificial Intelligence;2024-05

3. Gaussian process classification bandits;Pattern Recognition;2024-05

4. IEEE Transactions on Neural Networks and Learning Systems Special Issue on Causal Discovery and Causality-Inspired Machine Learning;IEEE Transactions on Neural Networks and Learning Systems;2024-04

5. Maximal Objectives in the Multiarmed Bandit with Applications;Management Science;2024-03-15