The Expected Total Cost Criterion for Markov Decision Processes under Constraints-Reference-Cited by-同舟云学术

The Expected Total Cost Criterion for Markov Decision Processes under Constraints

Published:2013-09 Issue:3 Volume:45 Page:837-859
ISSN:0001-8678
Container-title:Advances in Applied Probability
language:en
Short-container-title:Advances in Applied Probability

Author:

Dufour François,Piunovskiy A. B.

Abstract

In this work, we study discrete-time Markov decision processes (MDPs) with constraints when all the objectives have the same form of expected total cost over the infinite time horizon. Our objective is to analyze this problem by using the linear programming approach. Under some technical hypotheses, it is shown that if there exists an optimal solution for the associated linear program then there exists a randomized stationary policy which is optimal for the MDP, and that the optimal value of the linear program coincides with the optimal value of the constrained control problem. A second important result states that the set of randomized stationary policies provides a sufficient set for solving this MDP. It is important to note that, in contrast with the classical results of the literature, we do not assume the MDP to be transient or absorbing. More importantly, we do not impose the cost functions to be nonnegative or to be bounded below. Several examples are presented to illustrate our results.

Publisher

Cambridge University Press (CUP)

Subject

Applied Mathematics,Statistics and Probability

Reference15 articles.

1. Optimal Control of Random Sequences in Problems with Constraints

2. Markov decision processes with a stopping time constraint

3. Further Topics on Discrete-Time Markov Control Processes

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Maximizing the probability of visiting a set infinitely often for a Markov decision process with Borel state and action spaces;Journal of Applied Probability;2024-08-22

2. Extreme Occupation Measures in Markov Decision Processes with an Absorbing State;SIAM Journal on Control and Optimization;2024-01-12

3. A Convex Programming Approach for Discrete-Time Markov Decision Processes under the Expected Total Reward Criterion;SIAM Journal on Control and Optimization;2020-01

4. On Reducing a Constrained Gradual-Impulsive Control Problem for a Jump Markov Model to a Model with Gradual Control Only;SIAM Journal on Control and Optimization;2020-01

5. Constrained Markov Decision Processes with Expected Total Reward Criteria;SIAM Journal on Control and Optimization;2019-01