Partial-Observation Stochastic Games-Reference-Cited by-同舟云学术

Partial-Observation Stochastic Games

Published:2014-04 Issue:2 Volume:15 Page:1-44
ISSN:1529-3785
Container-title:ACM Transactions on Computational Logic
language:en
Short-container-title:ACM Trans. Comput. Logic

Author:

Chatterjee Krishnendu¹,Doyen Laurent²

Affiliation:

1. IST Austria

2. LSV, ENS Cachan & CNRS, France

Abstract

In two-player finite-state stochastic games of partial observation on graphs, in every state of the graph, the players simultaneously choose an action, and their joint actions determine a probability distribution over the successor states. The game is played for infinitely many rounds and thus the players construct an infinite path in the graph. We consider reachability objectives where the first player tries to ensure a target state to be visited almost-surely (i.e., with probability 1) or positively (i.e., with positive probability), no matter the strategy of the second player. We classify such games according to the information and to the power of randomization available to the players. On the basis of information, the game can be one-sided with either ( a ) player 1, or ( b ) player 2 having partial observation (and the other player has perfect observation), or two-sided with ( c ) both players having partial observation. On the basis of randomization, ( a ) the players may not be allowed to use randomization (pure strategies), or ( b ) they may choose a probability distribution over actions but the actual random choice is external and not visible to the player (actions invisible), or ( c ) they may use full randomization. Our main results for pure strategies are as follows: (1) For one-sided games with player 2 having perfect observation we show that (in contrast to full randomized strategies) belief-based (subset-construction based) strategies are not sufficient, and we present an exponential upper bound on memory both for almost-sure and positive winning strategies; we show that the problem of deciding the existence of almost-sure and positive winning strategies for player 1 is EXPTIME-complete and present symbolic algorithms that avoid the explicit exponential construction. (2) For one-sided games with player 1 having perfect observation we show that nonelementary memory is both necessary and sufficient for both almost-sure and positive winning strategies. (3) We show that for the general (two-sided) case finite-memory strategies are sufficient for both positive and almost-sure winning, and at least nonelementary memory is required. We establish the equivalence of the almost-sure winning problems for pure strategies and for randomized strategies with actions invisible. Our equivalence result exhibit serious flaws in previous results of the literature: we show a nonelementary memory lower bound for almost-sure winning whereas an exponential upper bound was previously claimed.

Funder

European Research Council

Microsoft Research

Austrian Science Fund

Publisher

Association for Computing Machinery (ACM)

Subject

Computational Mathematics,Logic,General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/2579821

Reference59 articles.

1. Alternating-time temporal logic

2. The Effect of Tossing Coins in Omega-Automata

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. HSVI Can Solve Zero-Sum Partially Observable Stochastic Games;Dynamic Games and Applications;2023-09-02

2. Model Checking for Probabilistic Multiagent Systems;Journal of Computer Science and Technology;2023-09

3. An Overview of Opponent Modeling for Multi-agent Competition;Machine Learning for Cyber Security;2023

4. Analysis and applications of a bridge game;Journal of Ambient Intelligence and Humanized Computing;2021-11-01

5. Alternating Tree Automata with Qualitative Semantics;ACM Transactions on Computational Logic;2021-01-22