Speeding Up the Convergence of Value Iteration in Partially Observable Markov Decision Processes-Reference-Cited by-同舟云学术

Speeding Up the Convergence of Value Iteration in Partially Observable Markov Decision Processes

Published:2001-02-01 Issue: Volume:14 Page:29-51
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Zhang N. L.,Zhang W.

Abstract

Partially observable Markov decision processes (POMDPs) have recently become popular among many AI researchers because they serve as a natural model for planning under uncertainty. Value iteration is a well-known algorithm for finding optimal policies for POMDPs. It typically takes a large number of iterations to converge. This paper proposes a method for accelerating the convergence of value iteration. The method has been evaluated on an array of benchmark problems and was found to be very effective: It enabled value iteration to converge after only a few iterations on all the test problems.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 59 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Nitty-Gritty of Deep Reinforcement Learning for the Healthcare Sector;AI and IoT-Based Technologies for Precision Medicine;2023-10-18

2. An MDP-based Method for Dynamic Workforce Allocation in Bernoulli Serial Production Lines;2023 IEEE 19th International Conference on Automation Science and Engineering (CASE);2023-08-26

3. Optimizing Age of Information in Wireless Uplink Networks With Partial Observations;IEEE Transactions on Communications;2023-07

4. Solving zero-sum one-sided partially observable stochastic games;Artificial Intelligence;2023-03

5. Answerable and Unanswerable Questions in Decision and Risk Analysis;International Series in Operations Research & Management Science;2023