1. Improved algorithms for linear stochastic bandits;Abbasi-Yadkori,2011
2. Survey on applications of multi-armed and contextual bandits;Bouneffouf,2020
3. Reinforcement learning with algorithms from probabilistic structure estimation;Epperlein,2021
4. Learning rates for Q-learning;Even-Dar;Journal of Machine Learning Research,2003
5. Reinforcement learning based resource allocation in business process management;Huang;Data & Knowledge Engineering,2011