Towards reconciling usability and usefulness of policy explanations for sequential decision-making systems-Reference-Cited by-同舟云学术

Towards reconciling usability and usefulness of policy explanations for sequential decision-making systems

Published:2024-07-22 Issue: Volume:11 Page:
ISSN:2296-9144
Container-title:Frontiers in Robotics and AI
language:
Short-container-title:Front. Robot. AI

Author:

Tambwekar Pradyumna,Gombolay Matthew

Abstract

Safefy-critical domains often employ autonomous agents which follow a sequential decision-making setup, whereby the agent follows a policy to dictate the appropriate action at each step. AI-practitioners often employ reinforcement learning algorithms to allow an agent to find the best policy. However, sequential systems often lack clear and immediate signs of wrong actions, with consequences visible only in hindsight, making it difficult to humans to understand system failure. In reinforcement learning, this is referred to as the credit assignment problem. To effectively collaborate with an autonomous system, particularly in a safety-critical setting, explanations should enable a user to better understand the policy of the agent and predict system behavior so that users are cognizant of potential failures and these failures can be diagnosed and mitigated. However, humans are diverse and have innate biases or preferences which may enhance or impair the utility of a policy explanation of a sequential agent. Therefore, in this paper, we designed and conducted human-subjects experiment to identify the factors which influence the perceived usability with the objective usefulness of policy explanations for reinforcement learning agents in a sequential setting. Our study had two factors: the modality of policy explanation shown to the user (Tree, Text, Modified Text, and Programs) and the “first impression” of the agent, i.e., whether the user saw the agent succeed or fail in the introductory calibration video. Our findings characterize a preference-performance tradeoff wherein participants perceived language-based policy explanations to be significantly more useable; however, participants were better able to objectively predict the agent’s behavior when provided an explanation in the form of a decision tree. Our results demonstrate that user-specific factors, such as computer science experience (p < 0.05), and situational factors, such as watching agent crash (p < 0.05), can significantly impact the perception and usefulness of the explanation. This research provides key insights to alleviate prevalent issues regarding innapropriate compliance and reliance, which are exponentially more detrimental in safety-critical settings, providing a path forward for XAI developers for future work on policy-explanations.

Publisher

Frontiers Media SA

Reference115 articles.

1. Apprenticeship learning via inverse reinforcement learning;Abbeel,2004

2. Sanity checks for saliency maps;Adebayo;Adv. Neural Inf. Process. Syst.,2018

3. Summarizing agent strategies;Amir;Aut. Agents Multi-Agent Syst.,2019

4. Mental models of mere mortals with explanations of reinforcement learning;Anderson;ACM Trans. Interact. Intell. Syst.,2020

5. Explainable agents and robots: results from a systematic literature review;Anjomshoae,2019