Steady-State Planning in Expected Reward Multichain MDPs-Reference-Cited by-同舟云学术

Steady-State Planning in Expected Reward Multichain MDPs

Published:2021-11-29 Issue: Volume:72 Page:1029-1082
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Atia George K.,Beckus Andre,Alkhouri Ismail,Velasquez Alvaro

Abstract

The planning domain has experienced increased interest in the formal synthesis of decision-making policies. This formal synthesis typically entails finding a policy which satisfies formal specifications in the form of some well-defined logic. While many such logics have been proposed with varying degrees of expressiveness and complexity in their capacity to capture desirable agent behavior, their value is limited when deriving decision-making policies which satisfy certain types of asymptotic behavior in general system models. In particular, we are interested in specifying constraints on the steady-state behavior of an agent, which captures the proportion of time an agent spends in each state as it interacts for an indefinite period of time with its environment. This is sometimes called the average or expected behavior of the agent and the associated planning problem is faced with significant challenges unless strong restrictions are imposed on the underlying model in terms of the connectivity of its graph structure. In this paper, we explore this steady-state planning problem that consists of deriving a decision-making policy for an agent such that constraints on its steady-state behavior are satisfied. A linear programming solution for the general case of multichain Markov Decision Processes (MDPs) is proposed and we prove that optimal solutions to the proposed programs yield stationary policies with rigorous guarantees of behavior.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Controller synthesis for linear temporal logic and steady-state specifications;Autonomous Agents and Multi-Agent Systems;2024-05-03

2. Optimal Deterministic Controller Synthesis from Steady-State Distributions;Journal of Automated Reasoning;2023-01-12