Exploring Computational User Models for Agent Policy Summarization-Reference-Cited by-同舟云学术

Exploring Computational User Models for Agent Policy Summarization

Published:2019-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Lage Isaac¹,Lifschitz Daphna²,Doshi-Velez Finale¹,Amir Ofra²

Affiliation:

1. Harvard University

2. Technion - Israel Institute of Technology

Abstract

AI agents support high stakes decision-making processes from driving cars to prescribing drugs, making it increasingly important for human users to understand their behavior. Policy summarization methods aim to convey strengths and weaknesses of such agents by demonstrating their behavior in a subset of informative states. Some policy summarization methods extract a summary that optimizes the ability to reconstruct the agent's policy under the assumption that users will deploy inverse reinforcement learning. In this paper, we explore the use of different models for extracting summaries. We introduce an imitation learning-based approach to policy summarization; we demonstrate through computational simulations that a mismatch between the model used to extract a summary and the model used to reconstruct the policy results in worse reconstruction quality; and we demonstrate through a human-subject study that people use different models to reconstruct policies in different contexts, and that matching the summary extraction model to these can improve performance. Together, our results suggest that it is important to carefully consider user models in policy summarization.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Data-Driven Policy Learning Methods from Biological Behavior: A Systematic Review;Applied Sciences;2024-05-09

2. Explainable reinforcement learning (XRL): a systematic literature review and taxonomy;Machine Learning;2023-11-29

3. Natural Language Specification of Reinforcement Learning Policies Through Differentiable Decision Trees;IEEE Robotics and Automation Letters;2023-06

4. Explainable reinforcement learning for broad-XAI: a conceptual framework and survey;Neural Computing and Applications;2023-03-06

5. Contrastive Visual Explanations for Reinforcement Learning via Counterfactual Rewards;Communications in Computer and Information Science;2023