Explainable Deep Reinforcement Learning: State of the Art and Challenges-Reference-Cited by-同舟云学术

Explainable Deep Reinforcement Learning: State of the Art and Challenges

Published:2022-12-03 Issue:5 Volume:55 Page:1-39
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Vouros George A.¹^ORCID

Affiliation:

1. University of Piraeus, Piraeus, Greece

Abstract

Interpretability, explainability, and transparency are key issues to introducing artificial intelligence methods in many critical domains. This is important due to ethical concerns and trust issues strongly connected to reliability, robustness, auditability, and fairness, and has important consequences toward keeping the human in the loop in high levels of automation, especially in critical cases for decision making, where both (human and the machine) play important roles. Although the research community has given much attention to explainability of closed (or black) prediction boxes, there are tremendous needs for explainability of closed-box methods that support agents to act autonomously in the real world. Reinforcement learning methods, and especially their deep versions, are such closed-box methods. In this article, we aim to provide a review of state-of-the-art methods for explainable deep reinforcement learning methods, taking also into account the needs of human operators—that is, of those who make the actual and critical decisions in solving real-world problems. We provide a formal specification of the deep reinforcement learning explainability problems, and we identify the necessary components of a general explainable reinforcement learning framework. Based on these, we provide a comprehensive review of state-of-the-art methods, categorizing them into classes according to the paradigm they follow, the interpretable models they use, and the surface representation of explanations provided. The article concludes by identifying open questions and important challenges.

Funder

TAPAS

Towards an Automated and exPlainable Air traffic management (ATM) System

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3527448

Reference70 articles.

1. E. Puiutta and E. M. S. P. Veith. 2020. Explainable Reinforcement Learning: A Survey . arXiv:2005.06247 (2020).

2. M. T. Ribeiro S. Singh and C. Guestrin. 2016. Why should I trust you?: Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD KDD . ACM New York NY 1135–1144.

3. R. Iyer Y. Li H. Li M. Lewis R. Sundar and K. Sycara. 2018. Transparency and explanation in deep reinforcement learning neural networks. In Proceedings of the 2018 AAAI/ACM Conference on AI Ethics and Society.

4. R. Pocius L. Neal and A. Fern. 2019. Strategic tasks for explainable reinforcement learning. Proceedings of the AAAI Conference on Artificial Intelligence 33 1 (2019) AAAI-19 IAAI-19 EAAI-20.

5. W. Shi S. Song Z. Wang and G. Huang. 2020. Self-supervised discovering of causal features: Towards interpretable reinforcement learning. arXiv:2003.07069v2 (2020).

Cited by 44 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Resilience-based explainable reinforcement learning in chemical process safety;Computers & Chemical Engineering;2024-12

2. Explainable artificial intelligence: A survey of needs, techniques, applications, and future direction;Neurocomputing;2024-09

3. XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

4. Indoor Infrared Sensor Layout Optimization for Elderly Monitoring Based on Fused Genetic Gray Wolf Optimization (FGGWO) Algorithm;Sensors;2024-08-21

5. Explainable, Deep Reinforcement Learning–Based Decision Making for Operations and Maintenance;Nuclear Technology;2024-08-02