Solving Transition Independent Decentralized Markov Decision Processes-Reference-Cited by-同舟云学术

Solving Transition Independent Decentralized Markov Decision Processes

Published:2004-12-01 Issue: Volume:22 Page:423-455
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Becker R.,Zilberstein S.,Lesser V.,Goldman C. V.

Abstract

Formal treatment of collaborative multi-agent systems has been lagging behind the rapid progress in sequential decision making by individual agents. Recent work in the area of decentralized Markov Decision Processes (MDPs) has contributed to closing this gap, but the computational complexity of these models remains a serious obstacle. To overcome this complexity barrier, we identify a specific class of decentralized MDPs in which the agents' transitions are independent. The class consists of independent collaborating agents that are tied together through a structured global reward function that depends on all of their histories of states and actions. We present a novel algorithm for solving this class of problems and examine its properties, both as an optimal algorithm and as an anytime algorithm. To our best knowledge, this is the first algorithm to optimally solve a non-trivial subclass of decentralized MDPs. It lays the foundation for further work in this area on both exact and approximate algorithms.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 73 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Single- and Multi-Agent Private Active Sensing: A Deep Neuroevolution Approach;2024 IEEE International Conference on Communications Workshops (ICC Workshops);2024-06-09

2. Game‐theoretic algorithm for interdependent infrastructure network restoration in a decentralized environment;Risk Analysis;2024-01-04

3. Reinforcement learning algorithms;Decision-Making Models;2024

4. A Dual-Agent Scheduler for Distributed Deep Learning Jobs on Public Cloud via Reinforcement Learning;Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2023-08-04

5. Formal Verification for Multi-Agent Path Execution Under Stochastic Environments;2023