SHP-VI Method of Solving DEC-POMDP Problem-Reference-Cited by-同舟云学术

SHP-VI Method of Solving DEC-POMDP Problem

Published:2014-05 Issue: Volume:926-930 Page:3245-3249
ISSN:1662-8985
Container-title:Advanced Materials Research
language:
Short-container-title:AMR

Author:

Wan Xiao Ping¹,Li Shu Yu¹

Affiliation:

1. Shaanxi Normal University

Abstract

DEC-POMDP(Distributed Partially Observable Markov Decision Process) model is a multi-agent model of collaborative decision-making is important, but due to an alarming number of DEC-POMDP problem state space and great strategy solution space, so DEC-POMDP solution of the problem becomes very difficult. The agent from the initial state to the target state during the interaction with the environment, the system's maximum benefit is often only with some small amount of a higher reward states. This article by searching from the initial belief state to the target state to get a shortest Hamiltonian path, according to the corresponding sequence of actions on the path forward search to get faith belief state space trajectory, and then along the trajectory reverse convictions value function iteration, thus forming the state with the largest gains beliefs trajectory corresponding optimal strategy. In this paper, shortest Hamiltonian path-based value iteration to search the optimal path of faith so as to solve the state Hamiltonian larger DEC-POMDP problem.

Publisher

Trans Tech Publications, Ltd.

Subject

General Engineering

Link

https://www.scientific.net/AMR.926-930.3245.pdf

Reference15 articles.

1. Littman M L, Dean T, Kaelbling L P, On the Complexity of Solving Markov Decision Problems. Proc. of the Eleventh International Conference on Uncertainty in Artificial Intelligence, UAI1995.

2. Wu Feng, Chen Xiaoping. Solving Large-Scale and Sparse-Reward DEC-POMDPs with Correlation-MDPs, Lecture Notes in Computer Science, (2008).

3. Smith T, Simmons R. Focused Real-Time Dynamic Programming for MDPs: Squeezing More Out of a Heuristic. Proc. of AAAI2006, 2006.

4. Fuzhong WANG. A decision support system for logistics distribution network planning based on multi-agent systems, Proceedings of the Ninth International Symposium on Distributed Computing and Applications to Business, Engineering and Science, 2010, 8-10.

5. Barils Eker, H. Levent Akln. Using Evolution Strategies To Solve DEC-POMDP Problems. Soft Computing, 2010, 14: 35-47.