Asymptotic Optimal Control of Markov-Modulated Restless Bandits-Reference-Cited by-同舟云学术

Asymptotic Optimal Control of Markov-Modulated Restless Bandits

Published:2019-01-17 Issue:1 Volume:46 Page:44-46
ISSN:0163-5999
Container-title:ACM SIGMETRICS Performance Evaluation Review
language:en
Short-container-title:SIGMETRICS Perform. Eval. Rev.

Author:

Duran Santiago¹,Verloop Ina Maria²

Affiliation:

1. CNRS, LAAS & Univ. de Toulouse, Toulouse, France

2. CNRS, IRIT & Univ. de Toulouse, Toulouse, France

Abstract

This paper studies optimal control subject to changing conditions. This is an area that recently received a lot of attention as it arises in numerous situations in practice. Some applications being cloud computing systems with fluctuating arrival rates, or the time-varying capacity as encountered in power-aware systems or wireless downlink channels. To study this, we focus on a restless bandit model, which has proved to be a powerful stochastic optimization framework to model scheduling of activities. This paper is a first step to its optimal control when restless bandits are subject to changing conditions. We consider the restless bandit problem in an asymptotic regime, which is obtained by letting the population of bandits grow large, and letting the environment change relatively fast. We present sufficient conditions for a policy to be asymptotically optimal and show that a set of priority policies satisfies these. Under an indexability assumption, an averaged version of Whittle's index policy is proved to be inside this set of asymptotic optimal policies. The performance of the averaged Whittle's index policy is numerically evaluated for a multi-class scheduling problem.

Funder

Agence Nationale de la Recherche

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3292040.3219636

Reference8 articles.

1. L.L.H. Andrew A. Wierman and A. Tang. 2009. Power aware speed scaling in processor sharing systems Proceedings of IEEE INFOCOM. L.L.H. Andrew A. Wierman and A. Tang. 2009. Power aware speed scaling in processor sharing systems Proceedings of IEEE INFOCOM.

2. A class of mean field interaction models for computer and communication systems

3. C. Bordenave D. McDonald and A. Proutiére. 2010. A particle system in interaction with a rapidly varying environment: Mean field limits and applications. Networks and heterogeneous media Vol. 5 1 (2010) 31--62. C. Bordenave D. McDonald and A. Proutiére. 2010. A particle system in interaction with a rapidly varying environment: Mean field limits and applications. Networks and heterogeneous media Vol. 5 1 (2010) 31--62.

4. User-level performance of channel-aware scheduling algorithms in wireless data networks

5. Asymptotic Optimal Control of Markov-Modulated Restless Bandits