Strong Simple Policies for POMDPs-Reference-Cited by-同舟云学术

Strong Simple Policies for POMDPs

Published:2024-06 Issue:3 Volume:26 Page:269-299
ISSN:1433-2779
Container-title:International Journal on Software Tools for Technology Transfer
language:en
Short-container-title:Int J Softw Tools Technol Transfer

Author:

Winterer Leonore,Wimmer Ralf,Becker Bernd,Jansen Nils

Abstract

AbstractThe synthesis problem for partially observable Markov decision processes (POMDPs) is to compute a policy that provably adheres to one or more specifications. Yet, the general problem is undecidable, and policies require full (and thus potentially unbounded) traces of execution history. To provide good approximations of such policies, POMDP agents often employ randomization over action choices. We consider the problem of computing simpler policies for POMDPs, and provide several approaches to still ensure their expressiveness. Key aspects are (1) the combination of an arbitrary number of specifications the policies need to adhere to, (2) a restricted form of randomization, and (3) a light-weight preprocessing of the POMDP model to encode memory. We provide a novel encoding as a mixed-integer linear program as baseline to solve the underlying problems. Our experiments demonstrate that the policies we obtain are more robust, smaller, and easier to implement for an engineer than those obtained from state-of-the-art POMDP solvers.

Funder

Ruhr-Universität Bochum

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s10009-024-00747-0.pdf

Reference60 articles.

1. Altman, E.: Constrained Markov Decision Processes. Routledge, London (1999)

2. Amato, C., Bernstein, D.S., Zilberstein, S.: Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs. Auton. Agents Multi-Agent Syst. 21(3), 293–320 (2010)

3. Proceedings of Machine Learning Research;R. Andriushchenko,2022

4. Lecture Notes in Computer Science;R. Andriushchenko,2023

5. Badings, T.S., Simão, T.D., Suilen, M., Jansen, N.: Decision-making under uncertainty: beyond probabilities. Int. J. Softw. Tools Technol. Transf. 25(3), 375–391 (2023)