Average optimality for Markov decision processes in borel spaces: a new condition and approach-Reference-Cited by-同舟云学术

Average optimality for Markov decision processes in borel spaces: a new condition and approach

Published:2006-06 Issue:02 Volume:43 Page:318-334
ISSN:0021-9002
Container-title:Journal of Applied Probability
language:en
Short-container-title:J. Appl. Probab.

Author:

Guo Xianping,Zhu Quanxin

Abstract

In this paper we study discrete-time Markov decision processes with Borel state and action spaces. The criterion is to minimize average expected costs, and the costs may have neither upper nor lower bounds. We first provide two average optimality inequalities of opposing directions and give conditions for the existence of solutions to them. Then, using the two inequalities, we ensure the existence of an average optimal (deterministic) stationary policy under additional continuity-compactness assumptions. Our conditions are slightly weaker than those in the previous literature. Also, some new sufficient conditions for the existence of an average optimal stationary policy are imposed on the primitive data of the model. Moreover, our approach is slightly different from the well-known ‘optimality inequality approach’ widely used in Markov decision processes. Finally, we illustrate our results in two examples.

Publisher

Cambridge University Press (CUP)

Subject

Statistics, Probability and Uncertainty,General Mathematics,Statistics and Probability

Reference25 articles.

1. Optimal Stationary Policies in General State Space Markov Decision Chains with Finite Action Sets

2. Markov decision chains with unbounded costs and applications to the control of queues

3. Computable Bounds for Geometric Convergence Rates of Markov Chains

4. Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards