Expected Value of Communication for Planning in Ad Hoc Teamwork-Reference-Cited by-同舟云学术

Expected Value of Communication for Planning in Ad Hoc Teamwork

Published:2021-05-18 Issue:13 Volume:35 Page:11290-11298
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Macke William,Mirsky Reuth,Stone Peter

Abstract

A desirable goal for autonomous agents is to be able to coordinate on the fly with previously unknown teammates. Known as “ad hoc teamwork”, enabling such a capability has been receiving increasing attention in the research community. One of the central challenges in ad hoc teamwork is quickly recognizing the current plans of other agents and planning accordingly. In this paper, we focus on the scenario in which teammates can communicate with one another, but only at a cost. Thus, they must carefully balance plan recognition based on observations vs. that based on communication. This paper proposes a new metric for evaluating how similar are two policies that a teammate may be following - the Expected Divergence Point (EDP). We then present a novel planning algorithm for ad hoc teamwork, determining which query to ask and planning accordingly. We demonstrate the effectiveness of this algorithm in a range of increasingly general communication in ad hoc teamwork problems.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring the Cost of Interruptions in Human-Robot Teaming;2023 IEEE-RAS 22nd International Conference on Humanoid Robots (Humanoids);2023-12-12

2. Knowledge-based Reasoning and Learning under Partial Observability in Ad Hoc Teamwork;Theory and Practice of Logic Programming;2023-06-26

3. Deep reinforcement learning for multi-agent interaction;AI Communications;2022-09-20

4. A Survey of Ad Hoc Teamwork Research;Multi-Agent Systems;2022