Continuous-Time Mean Field Markov Decision Models-Reference-Cited by-同舟云学术

Continuous-Time Mean Field Markov Decision Models

Published:2024-06-22 Issue:1 Volume:90 Page:
ISSN:0095-4616
Container-title:Applied Mathematics & Optimization
language:en
Short-container-title:Appl Math Optim

Author:

Bäuerle Nicole^ORCID,Höfer Sebastian

Abstract

AbstractWe consider a finite number of N statistically equal agents, each moving on a finite set of states according to a continuous-time Markov Decision Process (MDP). Transition intensities of the agents and generated rewards depend not only on the state and action of the agent itself, but also on the states of the other agents as well as the chosen action. Interactions like this are typical for a wide range of models in e.g. biology, epidemics, finance, social science and queueing systems among others. The aim is to maximize the expected discounted reward of the system, i.e. the agents have to cooperate as a team. Computationally this is a difficult task when N is large. Thus, we consider the limit for

$$N\rightarrow \infty .$$

N → ∞ . In contrast to other papers we treat this problem from an MDP perspective. This has the advantage that we need less regularity assumptions in order to construct asymptotically optimal strategies than using viscosity solutions of HJB equations. The convergence rate is

$$1/\sqrt{N}$$

1 / N . We show how to apply our results using two examples: a machine replacement problem and a problem from epidemics. We also show that optimal feedback policies from the limiting problem are not necessarily asymptotically optimal.

Funder

Karlsruher Institut für Technologie (KIT)

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s00245-024-10154-1.pdf

Reference44 articles.

1. Bortolussi, L., Hillston, J., Latella, D., Massink, M.: Continuous approximation of collective system behaviour: A tutorial. Perform. Eval. 70(5), 317–349 (2013)

2. Kolesnichenko, A., Senni, V., Pourranjabar, A., Remke, A.: Applying mean-field approximation to continuous time Markov chains. Stochastic Model Checking. Rigorous Dependability Analysis Using Model Checking Techniques for Stochastic Systems: International Autumn School, ROCKS 2012, Vahrn, Italy, October 22-26, 2012, Advanced Lectures, pp. 242–280 (2014)

3. Kurtz, T.G.: Solutions of ordinary differential equations as limits of pure jump Markov processes. J. Appl. Probab. 7(1), 49–58 (1970)

4. Ball, K., Kurtz, T.G., Popovic, L., Rempala, G.: Asymptotic analysis of multiscale approximations to reaction networks. Ann. Appl. Probab. 16(4), 1925–1961 (2006)

5. Darling, R.W., Norris, J.R.: Differential equation approximations for Markov chains. Probab. Surv. 5, 37–79 (2008)