Hierarchical goals contextualize local reward decomposition explanations-Reference-Cited by-同舟云学术

Hierarchical goals contextualize local reward decomposition explanations

Published:2022-05-12 Issue: Volume: Page:
ISSN:0941-0643
Container-title:Neural Computing and Applications
language:en
Short-container-title:Neural Comput & Applic

Author:

Rietz Finn^ORCID,Magg Sven,Heintz Fredrik,Stoyanov Todor,Wermter Stefan,Stork Johannes A.

Abstract

AbstractOne-step reinforcement learning explanation methods account for individual actions but fail to consider the agent’s future behavior, which can make their interpretation ambiguous. We propose to address this limitation by providing hierarchical goals as context for one-step explanations. By considering the current hierarchical goal as a context, one-step explanations can be interpreted with higher certainty, as the agent’s future behavior is more predictable. We combine reward decomposition with hierarchical reinforcement learning into a novel explainable reinforcement learning framework, which yields more interpretable, goal-contextualized one-step explanations. With a qualitative analysis of one-step reward decomposition explanations, we first show that their interpretability is indeed limited in scenarios with multiple, different optimal policies—a characteristic shared by other one-step explanation methods. Then, we show that our framework retains high interpretability in such cases, as the hierarchical goal can be considered as context for the explanation. To the best of our knowledge, our work is the first to investigate hierarchical goals not as an explanation directly but as additional context for one-step reinforcement learning explanations.

Funder

Bundesministerium für Wirtschaft und Energie

Knut och Alice Wallenbergs Stiftelse

Örebro University

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

https://link.springer.com/content/pdf/10.1007/s00521-022-07280-8.pdf

Reference43 articles.

1. Olden JD, Jackson DA (2002) Illuminating the “black box’’: a randomization approach for understanding variable contributions in artificial neural networks. Ecol Model 154(1–2):135–150. https://doi.org/10.1016/S0304-3800(02)00064-9