Multinomial Thompson sampling for rating scales and prior considerations for calibrating uncertainty-Reference-Cited by-同舟云学术

Multinomial Thompson sampling for rating scales and prior considerations for calibrating uncertainty

Published:2023-12-06 Issue: Volume: Page:
ISSN:1618-2510
Container-title:Statistical Methods & Applications
language:en
Short-container-title:Stat Methods Appl

Author:

Deliu Nina^ORCID

Abstract

AbstractBandit algorithms such as Thompson sampling (TS) have been put forth for decades as useful tools for conducting adaptively-randomised experiments. By skewing the allocation toward superior arms, they can substantially improve particular outcomes of interest for both participants and investigators. For example, they may use participants’ ratings for continuously optimising their experience with a program. However, most of the bandit and TS variants are based on either binary or continuous outcome models, leading to suboptimal performances in rating scale data. Guided by behavioural experiments we conducted online, we address this problem by introducing Multinomial-TS for rating scales. After assessing its improved empirical performance in unique optimal arm scenarios, we explore potential considerations (including prior’s role) for calibrating uncertainty and balancing arm allocation in scenarios with no unique optimal arms.

Funder

Università degli Studi di Roma La Sapienza

Publisher

Springer Science and Business Media LLC

Subject

Statistics, Probability and Uncertainty,Statistics and Probability

Link

https://link.springer.com/content/pdf/10.1007/s10260-023-00732-y.pdf

Reference52 articles.

1. Agrawal S, Goyal N (2017) Near-optimal regret bounds for Thompson sampling. J ACM (JACM) 64(5):30:1-30:24. https://doi.org/10.1145/3088510

2. Springer Series in Supply Chain Management;S Agrawal,2022

3. Wiley series in probability and statistics;A Agresti,2019

4. Akobeng AK (2005) Understanding randomised controlled trials. Arch Dis Child 90(8):840–844. https://doi.org/10.1136/adc.2004.058222

5. Altman DG, Royston P (2006) The cost of dichotomising continuous variables. BMJ 332(7549):1080.1. https://doi.org/10.1136/bmj.332.7549.1080