A Framework for Sequential Planning in Multi-Agent Settings-Reference-Cited by-同舟云学术

A Framework for Sequential Planning in Multi-Agent Settings

Published:2005-07-01 Issue: Volume:24 Page:49-79
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Gmytrasiewicz P. J.,Doshi P.

Abstract

This paper extends the framework of partially observable Markov decision processes (POMDPs) to multi-agent settings by incorporating the notion of agent models into the state space. Agents maintain beliefs over physical states of the environment and over models of other agents, and they use Bayesian updates to maintain their beliefs over time. The solutions map belief states to actions. Models of other agents may include their belief states and are related to agent types considered in games of incomplete information. We express the agents' autonomy by postulating that their models are not directly manipulable or observable by other agents. We show that important properties of POMDPs, such as convergence of value iteration, the rate of convergence, and piece-wise linearity and convexity of the value functions carry over to our framework. Our approach complements a more traditional approach to interactive settings which uses Nash equilibria as a solution paradigm. We seek to avoid some of the drawbacks of equilibria which may be non-unique and do not capture off-equilibrium behaviors. We do so at the cost of having to represent, process and continuously revise models of other agents. Since the agent's beliefs may be arbitrarily nested, the optimal solutions to decision making problems are only asymptotically computable. However, approximate belief updates and approximately optimal plans are computable. We illustrate our framework using a simple application domain, and we show examples of belief updates and value functions.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 148 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Hierarchical Framework for Optimizing Wildfire Surveillance and Suppression Using Human-Autonomous Teaming;Journal of Aerospace Information Systems;2024-07-09

2. Risk-Bounded Online Team Interventions via Theory of Mind;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

3. “Guess what I'm doing”: Extending legibility to sequential decision tasks;Artificial Intelligence;2024-05

4. On the computational complexity of ethics: moral tractability for minds and machines;Artificial Intelligence Review;2024-03-31

5. Modeling and reinforcement learning in partially observable many-agent systems;Autonomous Agents and Multi-Agent Systems;2024-03-26