Explainable reinforcement learning for broad-XAI: a conceptual framework and survey-Reference-Cited by-同舟云学术

Explainable reinforcement learning for broad-XAI: a conceptual framework and survey

Published:2023-03-06 Issue:23 Volume:35 Page:16893-16916
ISSN:0941-0643
Container-title:Neural Computing and Applications
language:en
Short-container-title:Neural Comput & Applic

Author:

Dazeley Richard^ORCID,Vamplew Peter,Cruz Francisco

Abstract

AbstractBroad-XAI moves away from interpreting individual decisions based on a single datum and aims to provide integrated explanations from multiple machine learning algorithms into a coherent explanation of an agent’s behaviour that is aligned to the communication needs of the explainee. Reinforcement Learning (RL) methods, we propose, provide a potential backbone for the cognitive model required for the development of Broad-XAI. RL represents a suite of approaches that have had increasing success in solving a range of sequential decision-making problems. However, these algorithms operate as black-box problem solvers, where they obfuscate their decision-making policy through a complex array of values and functions. EXplainable RL (XRL) aims to develop techniques to extract concepts from the agent’s: perception of the environment; intrinsic/extrinsic motivations/beliefs; Q-values, goals and objectives. This paper aims to introduce the Causal XRL Framework (CXF), that unifies the current XRL research and uses RL as a backbone to the development of Broad-XAI. CXF is designed to incorporate many standard RL extensions and integrated with external ontologies and communication facilities so that the agent can answer questions that explain outcomes its decisions. This paper aims to: establish XRL as a distinct branch of XAI; introduce a conceptual framework for XRL; review existing approaches explaining agent behaviour; and identify opportunities for future research. Finally, this paper discusses how additional information can be extracted and ultimately integrated into models of communication, facilitating the development of Broad-XAI.

Funder

Deakin University

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

https://link.springer.com/content/pdf/10.1007/s00521-023-08423-1.pdf

Reference248 articles.

1. Silver D, Huang A, Maddison CJ, Guez A, Sifre L, Van Den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M et al (2016) Mastering the game of Go with deep neural networks and tree search. Nature 529(7587):484–489

2. Huval B, Wang T, Tandon S, Kiske J, Song W, Pazhayampallil J, Andriluka M, Rajpurkar P, Migimatsu T, Cheng-Yue R, et al (2015) “An empirical evaluation of deep learning on highway driving,” http://arxiv.org/abs/1504.01716

3. Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533

4. Knight W (2017) “Reinforcement learning: By experimenting, computers are figuring out how to do things that no programmer could teach them,”. accessed: 2019-10-06

5. Metz C (2017) “In two moves, AlphaGo and Lee Sedol redefined the future,”. accessed: 2019-10-06

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. LIME-Mine: Explainable Machine Learning for User Behavior Analysis in IoT Applications;Electronics;2024-08-15

2. Rethinking high-resolution remote sensing image segmentation not limited to technology: a review of segmentation methods and outlook on technical interpretability;International Journal of Remote Sensing;2024-05-21

3. Redefining Counterfactual Explanations for Reinforcement Learning: Overview, Challenges and Opportunities;ACM Computing Surveys;2024-04-24

4. PCaLDI: Explainable Similarity and Distance Metrics Using Principal Component Analysis Loadings for Feature Importance;IEEE Access;2024

5. A Review of Explainable Recommender Systems Utilizing Knowledge Graphs and Reinforcement Learning;IEEE Access;2024