Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion-Reference-Cited by-同舟云学术

Nonstationary value iteration in controlled Markov chains with risk-sensitive average criterion

Published:2005-12 Issue:4 Volume:42 Page:905-918
ISSN:0021-9002
Container-title:Journal of Applied Probability
language:en
Short-container-title:Journal of Applied Probability

Author:

Cavazos-Cadena Rolando,Montes-De-Oca Raúl

Abstract

This work concerns Markov decision chains with finite state spaces and compact action sets. The performance index is the long-run risk-sensitive average cost criterion, and it is assumed that, under each stationary policy, the state space is a communicating class and that the cost function and the transition law depend continuously on the action. These latter data are not directly available to the decision-maker, but convergent approximations are known or are more easily computed. In this context, the nonstationary value iteration algorithm is used to approximate the solution of the optimality equation, and to obtain a nearly optimal stationary policy.

Publisher

Cambridge University Press (CUP)

Subject

Statistics, Probability and Uncertainty,General Mathematics,Statistics and Probability

Reference16 articles.

1. Adaptive Markov Control Processes

2. Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward Markov decision chains

3. Non-negative Matrices and Markov Chains

4. Iterative solution of the functional equations of undiscounted Markov renewal programming

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Markov decision processes with risk-sensitive criteria: an overview;Mathematical Methods of Operations Research;2024-04

2. On the global convergence of relative value iteration for infinite-horizon risk-sensitive control of diffusions;Systems & Control Letters;2023-01

3. Ergodic risk-sensitive control—A survey;Annual Reviews in Control;2023

4. Risk-sensitive average optimality in Markov decision processes;Kybernetika;2018-12-28

5. Risk-sensitive semi-Markov decision processes with general utilities and multiple criteria;Advances in Applied Probability;2018-09