Computable approximations for average Markov decision processes in continuous time-Reference-Cited by-同舟云学术

Computable approximations for average Markov decision processes in continuous time

Published:2018-06 Issue:2 Volume:55 Page:571-592
ISSN:0021-9002
Container-title:Journal of Applied Probability
language:en
Short-container-title:J. Appl. Probab.

Author:

Anselmi Jonatha,Dufour François,Prieto-Rumeau Tomás

Abstract

Abstract In this paper we study the numerical approximation of the optimal long-run average cost of a continuous-time Markov decision process, with Borel state and action spaces, and with bounded transition and reward rates. Our approach uses a suitable discretization of the state and action spaces to approximate the original control model. The approximation error for the optimal average reward is then bounded by a linear combination of coefficients related to the discretization of the state and action spaces, namely, the Wasserstein distance between an underlying probability measure μ and a measure with finite support, and the Hausdorff distance between the original and the discretized actions sets. When approximating μ with its empirical probability measure we obtain convergence in probability at an exponential rate. An application to a queueing system is presented.

Publisher

Cambridge University Press (CUP)

Subject

Statistics, Probability and Uncertainty,General Mathematics,Statistics and Probability

Reference21 articles.

1. Approximating Ergodic Average Reward Continuous-Time Controlled Markov Chains

2. New discount and average optimality conditions for continuous-time Markov decision processes

3. Discounted continuous-time Markov decision processes with unbounded rates and randomized history-dependent policies: the dynamic programming approach

4. Average optimality for continuous-time Markov decision processes in Polish spaces

5. Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Stochastic Control

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Numerical Approximations for Discounted Continuous Time Markov Decision Processes;Modeling, Stochastic Control, Optimization, and Applications;2019