Quantile Markov Decision Processes-Reference-Cited by-同舟云学术

Quantile Markov Decision Processes

Published:2021-11-09 Issue: Volume: Page:
ISSN:0030-364X
Container-title:Operations Research
language:en
Short-container-title:Operations Research

Author:

Li Xiaocheng¹^ORCID,Zhong Huaiyang¹^ORCID,Brandeau Margaret L.¹^ORCID

Affiliation:

1. Department of Management Science and Engineering, Stanford University, Stanford, California 94305

Abstract

Title: Sequential Decision Making Using Quantiles The goal of a traditional Markov decision process (MDP) is to maximize the expectation of cumulative reward over a finite or infinite horizon. In many applications, however, a decision maker may be interested in optimizing a specific quantile of the cumulative reward. For example, a physician may want to determine the optimal drug regime for a risk-averse patient with the objective of maximizing the 0.10 quantile of the cumulative reward; this is the cumulative improvement in health that is expected to occur with at least 90% probability for the patient. In “Quantile Markov Decision Processes,” X. Li, H. Zhong, and M. Brandeau provide analytic results to solve the quantile Markov decision process (QMDP) problem. They develop an efficient dynamic programming procedure that finds the optimal QMDP value function for all states and quantiles in one pass. The algorithm also extends to the MDP problem with a conditional value-at-risk objective.

Publisher

Institute for Operations Research and the Management Sciences (INFORMS)

Subject

Management Science and Operations Research,Computer Science Applications

Reference35 articles.

1. Markov Decision Problems Where Means Bound Variances

2. The use of quantile regression in health care research: a case study examining gender differences in the timeliness of thrombolytic therapy

3. Markov Decision Processes with Average-Value-at-Risk criteria

4. How Accurate Are Value-at-Risk Models at Commercial Banks?

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CVaR-based optimization of environmental flow via the Markov lift of a mixed moving average process;Optimization and Engineering;2023-04-26