Optimal decision procedures for finite Markov chains. Part II: Communicating systems-Reference-Cited by-同舟云学术

Optimal decision procedures for finite Markov chains. Part II: Communicating systems

Published:1973-12 Issue:3 Volume:5 Page:521-540
ISSN:0001-8678
Container-title:Advances in Applied Probability
language:en
Short-container-title:Advances in Applied Probability

Author:

Bather John

Abstract

A Markov process in discrete time with a finite state space is controlled by choosing the transition probabilities from a given convex family of distributions depending on the present state. The immediate cost is prescribed for each choice and it is required to minimise the average expected cost over an infinite future. The paper considers a special case of this general problem and provides the foundation for a general solution. The main result is that an optimal policy exists if each state of the system can be reached with positive probability from any other state by choosing a suitable policy.

Publisher

Cambridge University Press (CUP)

Subject

Applied Mathematics,Statistics and Probability

Reference10 articles.

1. On Finding Optimal Policies in Discrete Dynamic Programming with No Discounting

2. Non-Discounted Denumerable Markovian Decision Models

3. Étude asymptotique des systèmes Markoviens à commande;Lanery;Revue d'Informatique et Recherche Operationnelle.,1967

4. On the Iterative Method of Dynamic Programming on a Finite Space Discrete Time Markov Process

Cited by 53 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies;Journal of Mathematical Analysis and Applications;2022-05

2. Robbins–Monro Conditions for Persistent Exploration Learning Strategies;Modern Methods in Operator Theory and Harmonic Analysis;2019

3. Generic uniqueness of the bias vector of finite zero-sum stochastic games with perfect information;Journal of Mathematical Analysis and Applications;2018-01

4. Stationary Anonymous Sequential Games with Undiscounted Rewards;Journal of Optimization Theory and Applications;2014-09-09

5. A Zero-Sum Stochastic Game with Compact Action Sets and no Asymptotic Value;Dynamic Games and Applications;2013-01-24