Author:
Liu Wanchun,Leong Alex S.,Quevedo Daniel E.
Funder
Australian Research Council
Reference39 articles.
1. Special issue on learning and control;IEEE Transactions on Automatic Control,2023
2. Near-optimal regret bounds for Thompson sampling;Agrawal;Journal of the ACM,2017
3. Finite-time analysis of the multiarmed bandit problem;Auer;Machine Learning,2002
4. The nonstochastic multiarmed bandit problem;Auer;SIAM Journal on Computing,2002
5. Thompson sampling for stochastic control: The continuous parameter case;Banjević;IEEE Transactions on Automatic Control,2019