Explainable Reinforcement Learning: A Survey and Comparative Review

Author:

Milani Stephanie1,Topin Nicholay2,Veloso Manuela3,Fang Fei1

Affiliation:

1. Carnegie Mellon University, United States

2. Inpleo, Inc., United States

3. J. P. Morgan AI Research, United States

Abstract

Explainable reinforcement learning (XRL) is an emerging subfield of explainable machine learning that has attracted considerable attention in recent years. The goal of XRL is to elucidate the decision-making process of reinforcement learning (RL) agents in sequential decision-making settings. Equipped with this information, practitioners can better understand important questions about RL agents (especially those deployed in the real world), such as what the agents will do and why. Despite increased interest, there exists a gap in the literature for organizing the plethora of papers — especially in a way that centers the sequential decision-making nature of the problem. In this survey, we propose a novel taxonomy for organizing the XRL literature that prioritizes the RL setting. We propose three high-level categories: feature importance, learning process and Markov decision process, and policy-level. We overview techniques according to this taxonomy, highlighting challenges and opportunities for future work. We conclude by using these gaps to motivate and outline a roadmap for future work.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Reference170 articles.

1. Apprenticeship learning via inverse reinforcement learning

2. COGAM: Measuring and Moderating Cognitive Load in Machine Learning Model Explanations

3. David Abel. 2022. A theory of abstraction in reinforcement learning. arXiv preprint arXiv:2203.00397(2022). David Abel. 2022. A theory of abstraction in reinforcement learning. arXiv preprint arXiv:2203.00397(2022).

4. Test, Measurement, and Evaluation: Understanding and Use of the Concepts in Education;Adom Dickson;International Journal of Evaluation and Research in Education,2020

5. Reinforcement Learning based Recommender Systems: A Survey

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3