Affiliation:
1. Carnegie Mellon University, Pittsburgh, PA
2. Arizona State University, Temped, AZ
Abstract
We introduce an algorithm to automatically learn probabilistic hierarchical task networks (pHTNs) that capture a user's preferences on plans by observing only the user's behavior. HTNs are a common choice of representation for a variety of purposes in planning, including work on learning in planning. Our contributions are twofold. First, in contrast with prior work, which employs HTNs to represent domain physics or search control knowledge, we use HTNs to model user preferences. Second, while most prior work on HTN learning requires additional information (e.g., annotated traces or tasks) to assist the learning process, our system only takes plan traces as input. Initially, we will assume that users carry out preferred plans more frequently, and thus the observed distribution of plans is an accurate representation of user preference. We then generalize to the situation where feasibility constraints frequently prevent the execution of preferred plans. Taking the prevalent perspective of viewing HTNs as grammars over primitive actions, we adapt an expectation-maximization (EM) technique from the discipline of probabilistic grammar induction to acquire probabilistic context-free grammars (pCFG) that capture the distribution on plans. To account for the difference between the distributions of possible and preferred plans, we subsequently modify this core EM technique by rescaling its input. We empirically demonstrate that the proposed approaches are able to learn HTNs representing user preferences better than the inside-outside algorithm. Furthermore, when feasibility constraints are obfuscated, the algorithm with rescaled input performs better than the algorithm with the original input.
Funder
Division of Information and Intelligent Systems
Office of Naval Research
Publisher
Association for Computing Machinery (ACM)
Subject
Artificial Intelligence,Theoretical Computer Science
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. VAL: Interactive Task Learning with GPT Dialog Parsing;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11
2. Aligning Human and Robot Representations;Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction;2024-03-11
3. Adaptive and intelligent robot task planning for home service: A review;Engineering Applications of Artificial Intelligence;2023-01
4. Ontological query answering under many-valued group preferences in Datalog+/–;International Journal of Approximate Reasoning;2018-02
5. A Method for Unifying the Representations of Domain Knowledge and Planning Algorithm in Hierarchical Task Network;International Journal of Pattern Recognition and Artificial Intelligence;2017-05-09