The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms-Reference-Cited by-同舟云学术

The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms

Published:1978-06 Issue:02 Volume:15 Page:356-373
ISSN:0021-9002
Container-title:Journal of Applied Probability
language:en
Short-container-title:J. Appl. Probab.

Author:

Federgruen A.,Tijms H. C.

Abstract

This paper is concerned with the optimality equation for the average costs in a denumerable state semi-Markov decision model. It will be shown that under each of a number of recurrency conditions on the transition probability matrices associated with the stationary policies, the optimality equation has a bounded solution. This solution indeed yields a stationary policy which is optimal for a strong version of the average cost optimality criterion. Besides the existence of a bounded solution to the optimality equation, we will show that both the value-iteration method and the policy-iteration method can be used to determine such a solution. For the latter method we will prove that the average costs and the relative cost functions of the policies generated converge to a solution of the optimality equation.

Publisher

Cambridge University Press (CUP)

Subject

Statistics, Probability and Uncertainty,General Mathematics,Statistics and Probability

Reference19 articles.

1. Iterative solution of the functional equations of undiscounted Markov renewal programming

2. Markov decision chains with unbounded costs and applications to the control of queues

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. On zero-sum two-person undiscounted semi-Markov games with a multichain structure;Advances in Applied Probability;2017-09

2. A Semi-Markov Inventory Control Model;Cybernetics and Systems Analysis;2016-09

3. State dependent pricing policies: Differentiating customers through valuations and waiting costs;Journal of Revenue and Pricing Management;2012-06-22

4. SEMI-MARKOV DECISION PROCESSES;Probability in the Engineering and Informational Sciences;2007-10

5. Applications of Borovkov's Renovation Theory to Non-Stationary Stochastic Recursive Sequences and Their Control;Advances in Applied Probability;1997-06