Elaboration Tolerant Representation of Markov Decision Process via Decision-Theoretic Extension of Probabilistic Action Language +-Reference-Cited by-同舟云学术

Elaboration Tolerant Representation of Markov Decision Process via Decision-Theoretic Extension of Probabilistic Action Language +

Published:2020-12-23 Issue:3 Volume:21 Page:348-371
ISSN:1471-0684
Container-title:Theory and Practice of Logic Programming
language:en
Short-container-title:Theory and Practice of Logic Programming

Author:

WANG YI^ORCID,LEE JOOHYUNG^ORCID

Abstract

AbstractWe extend probabilistic action language

$p{\cal BC}$

+ with the notion of utility in decision theory. The semantics of the extended

$p{\cal BC}$

+ can be defined as a shorthand notation for a decision-theoretic extension of the probabilistic answer set programming language LPMLN. Alternatively, the semantics of

$p{\cal BC}$

+ can also be defined in terms of Markov decision process (MDP), which in turn allows for representing MDP in a succinct and elaboration tolerant way as well as leveraging an MDP solver to compute a

$p{\cal BC}$

+ action description. The idea led to the design of the system pbcplus2mdp, which can find an optimal policy of a

$p{\cal BC}$

+ action description using an MDP solver.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Computational Theory and Mathematics,Hardware and Architecture,Theoretical Computer Science,Software

Reference36 articles.

1. Younes, H. L. and Littman, M. L. 2004. PPDDL1.0: An extension to PDDL for expressing planning domains with probabilistic effects. Techn. Rep. CMU-CS-04-162.

2. Answer set programming for non-stationary Markov decision processes

3. Wang, Y. 2020. ywang485/pbcplus2mdp: pbcplus2mdp v0.1.

4. Watkins, C. J. C. H. 1989. Learning from Delayed Rewards. Ph.D. thesis, King’s College, Cambridge, UK.