1. Adaptive Caching Networks with Optimality Guarantees
2. Reinforcement Learning Augmented Asymptotically Optimal Index Policies for Finite-Horizon Restless Bandits;xiong;Proc Of AAAI,2022
3. Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits;wang;Proc of NeurIPS,2020
4. Thompson Sampling in Non-Episodic Restless Bandits;jung;arXiv preprint arXiv 1910 05656,2019
5. Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems;jung;Proc of NeurIPS,2019