The variance of discounted Markov decision processes-Reference-Cited by-同舟云学术

The variance of discounted Markov decision processes

Published:1982-12 Issue:4 Volume:19 Page:794-802
ISSN:0021-9002
Container-title:Journal of Applied Probability
language:en
Short-container-title:Journal of Applied Probability

Author:

Sobel Matthew J.

Abstract

Formulae are presented for the variance and higher moments of the present value of single-stage rewards in a finite Markov decision process. Similar formulae are exhibited for a semi-Markov decision process. There is a short discussion of the obstacles to using the variance formula in algorithms to maximize the mean minus a multiple of the standard deviation.

Publisher

Cambridge University Press (CUP)

Subject

Statistics, Probability and Uncertainty,General Mathematics,Statistics and Probability

Reference15 articles.

1. Contraction Mappings in the Theory Underlying Dynamic Programming

2. Temporal Resolution of Uncertainty and Dynamic Choice Theory

3. Markov Decision Processes with a New Optimality Criterion: Discrete Time

4. Markov Renewal Programs with Small Interest Rates

Cited by 113 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Reinforcement Learning in Latent Heterogeneous Environments;Journal of the American Statistical Association;2024-06-11

2. Robust Quadrupedal Locomotion via Risk-Averse Policy Learning;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

3. Risk probability optimization of finite horizon piecewise deterministic Markov decision processes;Optimization;2024-03-03

4. Reinforcement Learning in Latent Heterogeneous Environments;SSRN Electronic Journal;2024

5. Distributional Probabilistic Model Checking;Lecture Notes in Computer Science;2024