On the Computational Complexity of Stochastic Controller Optimization in POMDPs-Reference-Cited by-同舟云学术

On the Computational Complexity of Stochastic Controller Optimization in POMDPs

Published:2012-11 Issue:4 Volume:4 Page:1-8
ISSN:1942-3454
Container-title:ACM Transactions on Computation Theory
language:en
Short-container-title:ACM Trans. Comput. Theory

Author:

Vlassis Nikos¹,Littman Michael L.²,Barber David³

Affiliation:

1. University of Luxembourg

2. Brown University

3. University College London

Abstract

We show that the problem of finding an optimal stochastic blind controller in a Markov decision process is an NP-hard problem. The corresponding decision problem is NP-hard in PSPACE and sqrt-sum -hard, hence placing it in NP would imply breakthroughs in long-standing open problems in computer science. Our result establishes that the more general problem of stochastic controller optimization in POMDPs is also NP-hard. Nonetheless, we outline a special case that is convex and admits efficient global solutions.

Publisher

Association for Computing Machinery (ACM)

Subject

Computational Theory and Mathematics,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/2382559.2382563

Reference29 articles.

1. Jointly Constrained Biconvex Programming

2. On the Complexity of Numerical Analysis

3. Boyd S. and Vandenberghe L. 2004. Convex Optimization. Cambridge University Press Cambridge UK. Boyd S. and Vandenberghe L. 2004. Convex Optimization . Cambridge University Press Cambridge UK.

Cited by 26 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Learning optimal admission control in partially observable queueing networks;Queueing Systems;2024-06-29

2. Strong Simple Policies for POMDPs;International Journal on Software Tools for Technology Transfer;2024-06

3. Multi-agent reinforcement learning based optimal energy sensing threshold control in distributed cognitive radio networks with directional antenna;ICT Express;2024-06

4. Algebraic optimization of sequential decision problems;Journal of Symbolic Computation;2024-03

5. Density estimation based soft actor-critic: deep reinforcement learning for static output feedback control with measurement noise;Advanced Robotics;2024-02-07