Explaining in Time-Reference-Cited by-同舟云学术

Explaining in Time

Published:2021-07 Issue:3 Volume:10 Page:1-23
ISSN:2573-9522
Container-title:ACM Transactions on Human-Robot Interaction
language:en
Short-container-title:J. Hum.-Robot Interact.

Author:

Arnold Thomas¹,Kasenberg Daniel¹,Scheutz Matthias¹

Affiliation:

1. Tufts University, USA

Abstract

Explainability has emerged as a critical AI research objective, but the breadth of proposed methods and application domains suggest that criteria for explanation vary greatly. In particular, what counts as a good explanation, and what kinds of explanation are computationally feasible, has become trickier in light of oqaque “black box” systems such as deep neural networks. Explanation in such cases has drifted from what many philosophers stipulated as having to involve deductive and causal principles to mere “interpretation,” which approximates what happened in the target system to varying degrees. However, such post hoc constructed rationalizations are highly problematic for social robots that operate interactively in spaces shared with humans. For in such social contexts, explanations of behavior, and, in particular, justifications for violations of expected behavior, should make reference to socially accepted principles and norms. In this article, we show how a social robot’s actions can face explanatory demands for how it came to act on its decision, what goals, tasks, or purposes its design had those actions pursue and what norms or social constraints the system recognizes in the course of its action. As a result, we argue that explanations for social robots will need to be accurate representations of the system’s operation along causal, purposive, and justificatory lines. These explanations will need to generate appropriate references to principles and norms—explanations based on mere “interpretability” will ultimately fail to connect the robot’s behaviors to its appropriate determinants. We then lay out the foundations for a cognitive robotic architecture for HRI, together with particular component algorithms, for generating explanations and engaging in justificatory dialogues with human interactants. Such explanations track the robot’s actual decision-making and behavior, which themselves are determined by normative principles the robot can describe and use for justifications.

Funder

NASA Engineering and Safety Center

NSF

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Human-Computer Interaction

Link

https://dl.acm.org/doi/pdf/10.1145/3457183

Reference57 articles.

1. Dario Amodei Chris Olah Jacob Steinhardt Paul Christiano John Schulman and Dan Mané. 2016. Concrete problems in AI safety. arXiv:1606.06565. Retrieved from https://arxiv.org/abs/1606.06565. Dario Amodei Chris Olah Jacob Steinhardt Paul Christiano John Schulman and Dan Mané. 2016. Concrete problems in AI safety. arXiv:1606.06565. Retrieved from https://arxiv.org/abs/1606.06565.

2. Improving palliative care with deep learning;Avati Anand;BMC Med. Inf,2018

3. Toward a General Logicist Methodology for Engineering Ethically Correct Robots

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Understanding the spirit of a norm: Challenges for norm‐learning agents;AI Magazine;2023-10-31

2. A Literature Review of Human–AI Synergy in Decision Making: From the Perspective of Affordance Actualization Theory;Systems;2023-08-25

3. Counterfactual learning in enhancing resilience in autonomous agent systems;Frontiers in Artificial Intelligence;2023-07-28

4. A Literature Survey of How to Convey Transparency in Co-Located Human–Robot Interaction;Multimodal Technologies and Interaction;2023-02-25

5. The Need for Novelty in Social Roles: Exploring Robots in Social Roles from the Perspective of Interactional Novelty;Social Robots in Social Institutions;2023-01-09