Explainable Reinforcement Learning: A Survey and Comparative Review-Reference-Cited by-同舟云学术

Explainable Reinforcement Learning: A Survey and Comparative Review

Published:2023-08-26 Issue: Volume: Page:
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Milani Stephanie¹,Topin Nicholay²,Veloso Manuela³,Fang Fei¹

Affiliation:

1. Carnegie Mellon University, United States

2. Inpleo, Inc., United States

3. J. P. Morgan AI Research, United States

Abstract

Explainable reinforcement learning (XRL) is an emerging subfield of explainable machine learning that has attracted considerable attention in recent years. The goal of XRL is to elucidate the decision-making process of reinforcement learning (RL) agents in sequential decision-making settings. Equipped with this information, practitioners can better understand important questions about RL agents (especially those deployed in the real world), such as what the agents will do and why. Despite increased interest, there exists a gap in the literature for organizing the plethora of papers — especially in a way that centers the sequential decision-making nature of the problem. In this survey, we propose a novel taxonomy for organizing the XRL literature that prioritizes the RL setting. We propose three high-level categories: feature importance, learning process and Markov decision process, and policy-level. We overview techniques according to this taxonomy, highlighting challenges and opportunities for future work. We conclude by using these gaps to motivate and outline a roadmap for future work.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3616864

Reference170 articles.

1. Apprenticeship learning via inverse reinforcement learning

2. COGAM: Measuring and Moderating Cognitive Load in Machine Learning Model Explanations

3. David Abel. 2022. A theory of abstraction in reinforcement learning. arXiv preprint arXiv:2203.00397(2022). David Abel. 2022. A theory of abstraction in reinforcement learning. arXiv preprint arXiv:2203.00397(2022).

4. Test, Measurement, and Evaluation: Understanding and Use of the Concepts in Education;Adom Dickson;International Journal of Evaluation and Research in Education,2020

5. Reinforcement Learning based Recommender Systems: A Survey

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. LearningTuple: A packet classification scheme with high classification and high update;Computer Networks;2024-12

2. Integrating machine learning and biosensors in microfluidic devices: A review;Biosensors and Bioelectronics;2024-11

3. Explainable Reinforcement Learning for Network Management via Surrogate Model;2024 IEEE 44th International Conference on Distributed Computing Systems Workshops (ICDCSW);2024-07-23

4. A Review on the Form and Complexity of Human–Robot Interaction in the Evolution of Autonomous Surgery;Advanced Intelligent Systems;2024-07-09

5. Why Reinforcement Learning?;Algorithms;2024-06-20