Learning and planning with logical automata-Reference-Cited by-同舟云学术

Learning and planning with logical automata

Published:2021-08-13 Issue:7 Volume:45 Page:1013-1028
ISSN:0929-5593
Container-title:Autonomous Robots
language:en
Short-container-title:Auton Robot

Author:

Araki Brandon^ORCID,Vodrahalli Kiran,Leech Thomas,Vasile Cristian-Ioan,Donahue Mark,Rus Daniela

Abstract

AbstractWe introduce a method to learn policies from expert demonstrations that are interpretable and manipulable. We achieve interpretability by modeling the interactions between high-level actions as an automaton with connections to formal logic. We achieve manipulability by integrating this automaton into planning via Logical Value Iteration, so that changes to the automaton have predictable effects on the learned behavior. These qualities allow a human user to first understand what the model has learned, and then either correct the learned behavior or zero-shot generalize to new, similar tasks. Our inference method requires only low-level trajectories and a description of the environment in order to learn high-level rules. We achieve this by using a deep Bayesian nonparametric hierarchical model. We test our model on several domains of interest and also show results for a real-world implementation on a mobile robotic arm platform for lunchbox-packing and cabinet-opening tasks.

Funder

National Science Foundation

Office of Naval Research

U.S. Air Force

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s10514-021-09993-6.pdf

Reference61 articles.

1. Abbeel, P., & Ng, A. Y. (2004). Apprenticeship learning via inverse reinforcement learning. In ICML’04 international conference on machine learning.

2. Andreas, J., Klein, D., & Levine, S. (2016). Modular multitask reinforcement learning with policy sketches. ArXiv e-prints, arXiv:1611.01796.

3. Angluin, D. (1987). Learning regular sets from queries and counterexamples. Information and Computation, 75(2), 87–106.

4. Araki, B., Vodrahalli, K., Leech, T., Vasile, C. I., Donahue, & T., Rus, D. (2019). Learning to plan with logical automata. Robotics: Science and Systems.

5. Araki, B., Vodrahalli, K., Leech, T., Vasile, C. I., Donahue, M., & Rus, D. (2020). Deep Bayesian nonparametric learning of rules and plans from demonstrations with a learned automaton prior. Proceedings of the AAAI Conference on Artificial Intelligence, 34, 10026–10034.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploiting Transformer in Sparse Reward Reinforcement Learning for Interpretable Temporal Logic Motion Planning;IEEE Robotics and Automation Letters;2023-08