An axiomatic approach to Markov decision processes-Reference-Cited by-同舟云学术

An axiomatic approach to Markov decision processes

Published:2022-12-02 Issue:1 Volume:97 Page:117-133
ISSN:1432-2994
Container-title:Mathematical Methods of Operations Research
language:en
Short-container-title:Math Meth Oper Res

Author:

Jonsson Adam^ORCID

Abstract

AbstractThis paper presents an axiomatic approach to finite Markov decision processes where the discount rate is zero. One of the principal difficulties in the no discounting case is that, even if attention is restricted to stationary policies, a strong overtaking optimal policy need not exists. We provide preference foundations for two criteria that do admit optimal policies: 0-discount optimality and average overtaking optimality. As a corollary of our results, we obtain conditions on a decision maker’s preferences which ensure that an optimal policy exists. These results have implications for disciplines where dynamic programming problems arise, including automatic control, dynamic games, and economic development.

Funder

Lulea University of Technology

Publisher

Springer Science and Business Media LLC

Subject

Management Science and Operations Research,General Mathematics,Software

Link

https://link.springer.com/content/pdf/10.1007/s00186-022-00806-9.pdf

Reference40 articles.

1. Arapostathis A, Borkar VS, Fernández-Gaucherand E, Ghosh MK, Marcus SI (1993) Discrete-time controlled Markov processes with average cost criterion: a survey. SIAM J Control Optim 31(2):282–344

2. Asheim G, Tungodden B (2004) Resolving distributional conflicts between generations. Econ Theor 24(1):221–230

3. Asheim GB, d’Aspremont C, Banerjee K (2010) Generalized time-invariant overtaking. J Math Econ 46(4):519–533

4. Banerjee K, Mitra T (2008) On the continuity of ethical social welfare orders on infinite utility streams. Soc Choice Welfare 30(1):1–12

5. Basu K, Mitra T (2003) Aggregating infinite utility streams with intergenerational equity: the impossibility of being Paretian. Econometrica 71(5):1557–1563