Simple Bayesian Algorithms for Best-Arm Identification-Reference-Cited by-同舟云学术

Simple Bayesian Algorithms for Best-Arm Identification

Published:2020-11 Issue:6 Volume:68 Page:1625-1647
ISSN:0030-364X
Container-title:Operations Research
language:en
Short-container-title:Operations Research

Author:

Russo Daniel¹^ORCID

Affiliation:

1. Columbia University, New York, New York 10027

Abstract

This paper considers the optimal adaptive allocation of measurement effort for identifying the best among a finite set of options or designs. An experimenter sequentially chooses designs to measure and observes noisy signals of their quality with the goal of confidently identifying the best design after a small number of measurements. Just as the multiarmed bandit problem crystallizes the tradeoff between exploration and exploitation, this “pure exploration” variant crystallizes the challenge of rapidly gathering information before committing to a final decision. The paper proposes several simple Bayesian algorithms for allocating measurement effort and, by characterizing fundamental asymptotic limits on the performance of any algorithm, formalizes a sense in which these seemingly naive algorithms are the best possible.

Publisher

Institute for Operations Research and the Management Sciences (INFORMS)

Subject

Management Science and Operations Research,Computer Science Applications

Reference57 articles.

1. Agrawal S, Goyal N (2012) Analysis of Thompson sampling for the multi-armed bandit problem. Mannor S, Srebro N, Williamson RC, eds. Proc. 21st Annual Conf. Learning Theory, Proceedings of Machine Learning Research, vol. 23 (PMLR), 39.1–39.26.

2. The Sequential Design of Experiments for Infinitely Many States of Nature

3. Audibert JY, Bubeck S, Munos R (2010) Best arm identification in multi-armed bandits. Kalai AT, Mohri M, eds. COLT 23rd Conf. Learning Theory (Omnipress, Madison, WI), 41–53.

4. The consistency of posterior distributions in nonparametric problems

5. A Single-Sample Multiple Decision Procedure for Ranking Means of Normal Populations with known Variances

Cited by 47 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Feature Misspecification in Sequential Learning Problems;Management Science;2024-08-29

2. Simulation Budget Allocation for Improving Scheduling and Routing of Automated Guided Vehicles in Warehouse Management;Journal of the Operations Research Society of China;2024-07-29

3. Finding the optimal exploration-exploitation trade-off online through Bayesian risk estimation and minimization;Artificial Intelligence;2024-05

4. A Contextual Ranking and Selection Method for Personalized Medicine;Manufacturing & Service Operations Management;2024-01

5. Sourcing Under Correlated Uncertainty: An integrated estimation and learning approach;SSRN Electronic Journal;2024