A general markov decision method I: Model and techniques-Reference-Cited by-同舟云学术

A general markov decision method I: Model and techniques

Published:1977-06 Issue:2 Volume:9 Page:296-315
ISSN:0001-8678
Container-title:Advances in Applied Probability
language:en
Short-container-title:Advances in Applied Probability

Author:

De Leve G.,Federgruen A.,Tijms H. C.

Abstract

This paper provides a new approach for solving a wide class of Markov decision problems including problems in which the space is general and the system can be continuously controlled. The optimality criterion is the long-run average cost per unit time. We decompose the decision processes into a common underlying stochastic process and a sequence of interventions so that the decision processes can be embedded upon a reduced set of states. Consequently, in the policy-iteration algorithm resulting from this approach the number of equations to be solved in any iteration step can be substantially reduced. Further, by its flexibility, this algorithm allows us to exploit any structure of the particular problem to be solved.

Publisher

Cambridge University Press (CUP)

Subject

Applied Mathematics,Statistics and Probability

Reference16 articles.

1. Markov-Renewal Programming. I: Formulation, Finite Return Models

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimal maintenance policy for a multi-state deteriorating system with two types of failures under general repair;Computers & Industrial Engineering;2009-08

2. Wide Sense One-Dependent Processes with Embedded Harris Chains and their Applications in Inventory Management;SSRN Electronic Journal;2002

3. Discretizations for the average impulse control of piecewise deterministic processes;Journal of Applied Probability;1993-06

4. Discretizations for the average impulse control of piecewise deterministic processes;Journal of Applied Probability;1993-06

5. Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey;SIAM Journal on Control and Optimization;1993-03