An Exact Double-Oracle Algorithm for Zero-Sum Extensive-Form Games with Imperfect Information-Reference-Cited by-同舟云学术

An Exact Double-Oracle Algorithm for Zero-Sum Extensive-Form Games with Imperfect Information

Published:2014-12-30 Issue: Volume:51 Page:829-866
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Bosansky B.,Kiekintveld C.,Lisy V.,Pechoucek M.

Abstract

Developing scalable solution algorithms is one of the central problems in computational game theory. We present an iterative algorithm for computing an exact Nash equilibrium for two-player zero-sum extensive-form games with imperfect information. Our approach combines two key elements: (1) the compact sequence-form representation of extensive-form games and (2) the algorithmic framework of double-oracle methods. The main idea of our algorithm is to restrict the game by allowing the players to play only selected sequences of available actions. After solving the restricted game, new sequences are added by finding best responses to the current solution using fast algorithms. We experimentally evaluate our algorithm on a set of games inspired by patrolling scenarios, board, and card games. The results show significant runtime improvements in games admitting an equilibrium with small support, and substantial improvement in memory use even on games with large support. The improvement in memory use is particularly important because it allows our algorithm to solve much larger game instances than existing linear programming methods. Our main contributions include (1) a generic sequence-form double-oracle algorithm for solving zero-sum extensive-form games; (2) fast methods for maintaining a valid restricted game model when adding new sequences; (3) a search algorithm and pruning methods for computing best-response sequences; (4) theoretical guarantees about the convergence of the algorithm to a Nash equilibrium; (5) experimental analysis of our algorithm on several games, including an approximate version of the algorithm.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Partially Observable Stochastic Games with Neural Perception Mechanisms;Lecture Notes in Computer Science;2024-09-11

2. Co-optimization of multiple virtual power plants considering electricity-heat-carbon trading: A Stackelberg game strategy;International Journal of Electrical Power & Energy Systems;2023-11

3. HSVI Can Solve Zero-Sum Partially Observable Stochastic Games;Dynamic Games and Applications;2023-09-02

4. RM-FSP: Regret minimization optimizes neural fictitious self-play;Neurocomputing;2023-09

5. Solving zero-sum one-sided partially observable stochastic games;Artificial Intelligence;2023-03