On Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded Costs-Reference-Cited by-同舟云学术

On Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded Costs

Published:2021-12-15 Issue: Volume: Page:
ISSN:0364-765X
Container-title:Mathematics of Operations Research
language:en
Short-container-title:Mathematics of OR

Author:

Yu Huizhen¹^ORCID

Affiliation:

1. Department of Computing Science, University of Alberta, Edmonton, Alberta T6G 2E8, Canada

Abstract

We consider the linear programming approach for constrained and unconstrained Markov decision processes (MDPs) under the long-run average-cost criterion, where the class of MDPs in our study have Borel state spaces and discrete countable action spaces. Under a strict unboundedness condition on the one-stage costs and a recently introduced majorization condition on the state transition stochastic kernel, we study infinite-dimensional linear programs for the average-cost MDPs and prove the absence of a duality gap and other optimality results. Our results do not require a lower-semicontinuous MDP model. Thus, they can be applied to countable action space MDPs where the dynamics and one-stage costs are discontinuous in the state variable. Our proofs make use of the continuity property of Borel measurable functions asserted by Lusin’s theorem.

Publisher

Institute for Operations Research and the Management Sciences (INFORMS)

Subject

Management Science and Operations Research,Computer Science Applications,General Mathematics

Reference40 articles.

1. Non-Existence of Everywhere Proper Conditional Distributions

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Markov decision processes with burstiness constraints;European Journal of Operational Research;2024-02

2. Multi-stage uplift and exhumation processes in the eastern Pamir since Late Miocene: Constrained by fission tracks and (U-Th)/He thermochronology;ACTA PETROL SIN;2023

3. Thermo-Mechanical Buckling Analysis of Restrained Columns Under Longitudinal Steady-State Heat Conduction;Iranian Journal of Science and Technology, Transactions of Civil Engineering;2022-12-27