Abstract
AbstractWe consider discrete-time Markov decision processes in which the decision maker is interested in long but finite horizons. First we consider reachability objective: the decision maker’s goal is to reach a specific target state with the highest possible probability. A strategy is said to overtake another strategy, if it gives a strictly higher probability of reaching the target state on all sufficiently large but finite horizons. We prove that there exists a pure stationary strategy that is not overtaken by any pure strategy nor by any stationary strategy, under some condition on the transition structure and respectively under genericity. A strategy that is not overtaken by any other strategy, called an overtaking optimal strategy, does not always exist. We provide sufficient conditions for its existence. Next we consider safety objective: the decision maker’s goal is to avoid a specific state with the highest possible probability. We argue that the results proven for reachability objective extend to this model.
Funder
COST Action European Network for Game Theory
Israel Science Foundation
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Management Science and Operations Research,Control and Optimization
Reference22 articles.
1. Baier, C., Katoen, J.P.: Principles of Model Checking. MIT Press, Cambridge (2008)
2. Chatterjee, K., Henzinger, T.A.: A survey of stochastic $$\omega $$-regular games. J. Comput. Syst. Sci. 78, 394–413 (2012)
3. Lecture Notes in Computer Science;V Bruyère,2017
4. Howard, R.A.: Dynamic Programming and Markov Processes. MIT Press, Cambridge (1960)
5. Blackwell, D.: Discrete dynamic programming. Ann. Math. Stat. 33, 719–726 (1962)
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献