Optimal decision procedures for finite Markov chains. Part III: General convex systems

Author:

Bather John

Abstract

This paper is concerned with the general problem of finding an optimal transition matrix for a finite Markov chain, where the probabilities for each transition must be chosen from a given convex family of distributions. The immediate cost is determined by this choice, but it is required to minimise the average expected cost in the long run. The problem is investigated by classifying the states according to the accessibility relations between them. If an optimal policy exists, it can be found by considering the convex subsystems associated with the states at different levels in the classification scheme.

Publisher

Cambridge University Press (CUP)

Subject

Applied Mathematics,Statistics and Probability

Cited by 23 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Communicating zero-sum product stochastic games;Journal of Mathematical Analysis and Applications;2019-09

2. Maximum-Stopping-Value Policies in Finite Markov Population Decision Chains;Mathematics of Operations Research;2014-08

3. A Zero-Sum Stochastic Game with Compact Action Sets and no Asymptotic Value;Dynamic Games and Applications;2013-01-24

4. Stabilizing Policy Improvement for Large-Scale Infinite-Horizon Dynamic Programming;SIAM Journal on Matrix Analysis and Applications;2009-01

5. On polynomial cases of the unichain classification problem for Markov Decision Processes;Operations Research Letters;2008-09

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3