Reduction Techniques for Model Checking and Learning in MDPs-Reference-Cited by-同舟云学术

Reduction Techniques for Model Checking and Learning in MDPs

Published:2017-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Bharadwaj Suda¹,Le Roux Stephane²,Perez Guillermo²,Topcu Ufuk¹

Affiliation:

1. The University of Texas at Austin

2. Universite libre de Bruxelles

Abstract

Omega-regular objectives in Markov decision processes (MDPs) reduce to reachability: find a policy which maximizes the probability of reaching a target set of states. Given an MDP, an initial distribution, and a target set of states, such a policy can be computed by most probabilistic model checking tools. If the MDP is only partially specified, i.e., some prob- abilities are unknown, then model-learning techniques can be used to statistically approximate the probabilities and enable the computation of the de- sired policy. For fully specified MDPs, reducing the size of the MDP translates into faster model checking; for partially specified MDPs, into faster learning. We provide reduction techniques that al- low us to remove irrelevant transition probabilities: transition probabilities (known, or to be learned) that do not influence the maximal reachability probability. Among other applications, these reductions can be seen as a pre-processing of MDPs before model checking or as a way to reduce the number of experiments required to obtain a good approximation of an unknown MDP.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Graph-Based Reductions for Parametric and Weighted MDPs;Automated Technology for Verification and Analysis;2023

2. Fuel in Markov Decision Processes (FiMDP): A Practical Approach to Consumption;Formal Methods;2021

3. Are Parametric Markov Chains Monotonic?;Automated Technology for Verification and Analysis;2019

4. The Complexity of Graph-Based Reductions for Reachability in Markov Decision Processes;Lecture Notes in Computer Science;2018