Efficient Decision-Theoretic Target Localization-Reference-Cited by-同舟云学术

Efficient Decision-Theoretic Target Localization

Published:2017-06-05 Issue: Volume:27 Page:70-78
ISSN:2334-0843
Container-title:Proceedings of the International Conference on Automated Planning and Scheduling
language:
Short-container-title:ICAPS

Author:

Dressel Louis,Kochenderfer Mykel

Abstract

Partially observable Markov decision processes (POMDPs) offer a principled approach to control under uncertainty. However, POMDP solvers generally require rewards to depend only on the state and action. This limitation is unsuitable for information-gathering problems, where rewards are more naturally expressed as functions of belief. In this work, we consider target localization, an information-gathering task where an agent takes actions leading to informative observations and a concentrated belief over possible target locations. By leveraging recent theoretical and algorithmic advances, we investigate offline and online solvers that incorporate belief-dependent rewards. We extend SARSOP — a state-of-the-art offline solver — to handle belief-dependent rewards, exploring different reward strategies and showing how they can be compactly represented. We present an improved lower bound that greatly speeds convergence. POMDP-lite, an online solver, is also evaluated in the context of information-gathering tasks. These solvers are applied to control a hexcopter UAV searching for a radio frequency source—a challenging real-world problem.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. No compromise in solution quality: Speeding up belief-dependent continuous partially observable Markov decision processes via adaptive multilevel simplification;The International Journal of Robotics Research;2024-09-06

2. Online tree-based planning for active spacecraft fault estimation and collision avoidance;Science Robotics;2024-08-28

3. Adaptive Coverage Path Planning of Marine Vehicles with Multi-Sensor;2024 3rd Conference on Fully Actuated System Theory and Applications (FASTA);2024-05-10

4. Simultaneous search and monitoring by multiple aerial robots;Robotics and Autonomous Systems;2023-12

5. From Reactive to Active Sensing: A Survey on Information Gathering in Decision-theoretic Planning;ACM Computing Surveys;2023-07-13