BEERL: Both Ends Explanations for Reinforcement Learning-Reference-Cited by-同舟云学术

BEERL: Both Ends Explanations for Reinforcement Learning

Published:2022-10-28 Issue:21 Volume:12 Page:10947
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Terra Ahmad^ORCID,Inam Rafia^ORCID,Fersman Elena^ORCID

Abstract

Deep Reinforcement Learning (RL) is a black-box method and is hard to understand because the agent employs a neural network (NN). To explain the behavior and decisions made by the agent, different eXplainable RL (XRL) methods are developed; for example, feature importance methods are applied to analyze the contribution of the input side of the model, and reward decomposition methods are applied to explain the components of the output end of the RL model. In this study, we present a novel method to connect explanations from both input and output ends of a black-box model, which results in fine-grained explanations. Our method exposes the reward prioritization to the user, which in turn generates two different levels of explanation and allows RL agent reconfigurations when unwanted behaviors are observed. The method further summarizes the detailed explanations into a focus value that takes into account all reward components and quantifies the fulfillment of the explanation of desired properties. We evaluated our method by applying it to a remote electrical telecom-antenna-tilt use case and two openAI gym environments: lunar lander and cartpole. The results demonstrated fine-grained explanations by detailing input features’ contributions to certain rewards and revealed biases of the reward components, which are then addressed by adjusting the reward’s weights.

Funder

Knut and Alice Wallenberg Foundation

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/21/10947/pdf

Reference38 articles.

1. Playing Atari with Deep Reinforcement Learning;Mnih;CoRR,2013

2. Mastering the game of Go with deep neural networks and tree search

3. Explainable AI Methods - A Brief Overview

4. A Unified Approach to Interpreting Model Predictions

5. “Why Should I Trust You?”: Explaining the Predictions of Any Classifier

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Rollout-based Shapley Values for Explainable Cooperative Multi-Agent Reinforcement Learning;2024 IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN);2024-05-05

2. Explainable reinforcement learning (XRL): a systematic literature review and taxonomy;Machine Learning;2023-11-29

3. Evaluation of Intrinsic Explainable Reinforcement Learning in Remote Electrical Tilt Optimization;Proceedings of Eighth International Congress on Information and Communication Technology;2023-09-15