POMCP with Human Preferences in Settlers of Catan
-
Published:2018-09-25
Issue:1
Volume:14
Page:17-23
-
ISSN:2334-0924
-
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment
-
language:
-
Short-container-title:AIIDE
Author:
Dobre Mihai,Lascarides Alex
Abstract
We present a suite of techniques for extending the Partially Observable Monte Carlo Planning algorithm to handle complex multi-agent games. We design the planning algorithm to exploit the inherent structure of the game. When game rules naturally cluster the actions into sets called types, these can be leveraged to extract characteristics and high-level strategies from a sparse corpus of human play. Another key insight is to account for action legality both when extracting policies from game play and when these are used to inform the forward sampling method. We evaluate our algorithm against other baselines and versus ablated versions of itself in the well-known board game Settlers of Catan.
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献