Anytime Point-Based Approximations for Large POMDPs-Reference-Cited by-同舟云学术

Anytime Point-Based Approximations for Large POMDPs

Published:2006-11-26 Issue: Volume:27 Page:335-380
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Pineau J.,Gordon G.,Thrun S.

Abstract

The Partially Observable Markov Decision Process has long been recognized as a rich framework for real-world planning and control problems, especially in robotics. However exact solutions in this framework are typically computationally intractable for all but the smallest problems. A well-known technique for speeding up POMDP solving involves performing value backups at specific belief points, rather than over the entire belief simplex. The efficiency of this approach, however, depends greatly on the selection of points. This paper presents a set of novel techniques for selecting informative belief points which work well in practice. The point selection procedure is combined with point-based value backups to form an effective anytime POMDP algorithm called Point-Based Value Iteration (PBVI). The first aim of this paper is to introduce this algorithm and present a theoretical analysis justifying the choice of belief selection technique. The second aim of this paper is to provide a thorough empirical comparison between PBVI and other state-of-the-art POMDP methods, in particular the Perseus algorithm, in an effort to highlight their similarities and differences. Evaluation is performed using both standard POMDP domains and realistic robotic tasks.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 157 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Synergetic-informed deep reinforcement learning for sustainable management of transportation networks with large action spaces;Automation in Construction;2024-04

2. Dynamic joint sensor selection and maintenance optimization in partially observable deteriorating systems;Computers & Industrial Engineering;2024-01

3. Reinforcement Learning for Partially Observable Models;Systems & Control: Foundations & Applications;2024

4. Topological belief space planning for active SLAM with pairwise Gaussian potentials and performance guarantees;The International Journal of Robotics Research;2023-12-20

5. Multi-Agent Cooperative Search in Multi-Object Uncertain Environment;2023 IEEE International Conference on Unmanned Systems (ICUS);2023-10-13