Risk-Averse Markov Decision Processes Through a Distributional Lens-Reference-Cited by-同舟云学术

Risk-Averse Markov Decision Processes Through a Distributional Lens

Published:2024-07-17 Issue: Volume: Page:
ISSN:0364-765X
Container-title:Mathematics of Operations Research
language:en
Short-container-title:Mathematics of OR

Author:

Cheng Ziteng¹^ORCID,Jaimungal Sebastian¹^ORCID

Affiliation:

1. Department of Statistical Sciences, University of Toronto, Toronto, Ontario M5G 1Z5, Canada

Abstract

By adopting a distributional viewpoint on law-invariant convex risk measures, we construct dynamic risk measures (DRMs) at the distributional level. We then apply these DRMs to investigate Markov decision processes, incorporating latent costs, random actions, and weakly continuous transition kernels. Furthermore, the proposed DRMs allow risk aversion to change dynamically. Under mild assumptions, we derive a dynamic programming principle and show the existence of an optimal policy in both finite and infinite time horizons. Moreover, we provide a sufficient condition for the optimality of deterministic actions. For illustration, we conclude the paper with examples from optimal liquidation with limit order books and autonomous driving. Funding: This work was supported by Natural Sciences and Engineering Research Council of Canada [Grants RGPAS-2018-522715 and RGPIN-2018-05705].

Publisher

Institute for Operations Research and the Management Sciences (INFORMS)

Link

https://pubsonline.informs.org/doi/pdf/10.1287/moor.2023.0211

Reference55 articles.

1. Are law-invariant risk functions concave on distributions?

2. Spectral measures of risk: A coherent representation of subjective risk aversion

3. Coherent Measures of Risk

4. Minimizing spectral risk measures applied to Markov decision processes

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Uncertainty Propagation and Dynamic Robust Risk Measures;Mathematics of Operations Research;2024-08-09