The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms-Reference-Cited by-同舟云学术

The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms

Published:1978-06 Issue:2 Volume:15 Page:356-373
ISSN:0021-9002
Container-title:Journal of Applied Probability
language:en
Short-container-title:Journal of Applied Probability

Author:

Federgruen A.,Tijms H. C.

Abstract

This paper is concerned with the optimality equation for the average costs in a denumerable state semi-Markov decision model. It will be shown that under each of a number of recurrency conditions on the transition probability matrices associated with the stationary policies, the optimality equation has a bounded solution. This solution indeed yields a stationary policy which is optimal for a strong version of the average cost optimality criterion. Besides the existence of a bounded solution to the optimality equation, we will show that both the value-iteration method and the policy-iteration method can be used to determine such a solution. For the latter method we will prove that the average costs and the relative cost functions of the policies generated converge to a solution of the optimality equation.

Publisher

Cambridge University Press (CUP)

Subject

Statistics, Probability and Uncertainty,General Mathematics,Statistics and Probability

Reference20 articles.

1. Federgruen A. , Schweitzer P. J. and Tijms H. C. (1977) Contraction mappings underlying undiscounted Markov decision problems. J. Math. Anal. Appl. To appear.

2. Iterative solution of the functional equations of undiscounted Markov renewal programming

3. A Solution to a Countable System of Equations Arising in Markovian Decision Processes

Cited by 51 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. On the reduction of total‐cost and average‐cost MDPs to discounted MDPs;Naval Research Logistics (NRL);2017-05-25

2. A Semi-Markov Inventory Control Model;Cybernetics and Systems Analysis;2016-09

3. State dependent pricing policies: Differentiating customers through valuations and waiting costs;Journal of Revenue and Pricing Management;2012-06-22

4. New Average Optimality Conditions for Semi-Markov Decision Processes in Borel Spaces;Journal of Optimization Theory and Applications;2012-01-26

5. Semi-Markov Control Models with Partially Known Holding Times Distribution: Discounted and Average Criteria;Acta Applicandae Mathematicae;2011-03-30