Declarative Approaches to Counterfactual Explanations for Classification-Reference-Cited by-同舟云学术

Declarative Approaches to Counterfactual Explanations for Classification

Published:2021-12-27 Issue: Volume: Page:1-35
ISSN:1471-0684
Container-title:Theory and Practice of Logic Programming
language:en
Short-container-title:Theory and Practice of Logic Programming

Author:

BERTOSSI LEOPOLDO^ORCID

Abstract

Abstract We propose answer-set programs that specify and compute counterfactual interventions on entities that are input on a classification model. In relation to the outcome of the model, the resulting counterfactual entities serve as a basis for the definition and computation of causality-based explanation scores for the feature values in the entity under classification, namely responsibility scores. The approach and the programs can be applied with black-box models, and also with models that can be specified as logic programs, such as rule-based classifiers. The main focus of this study is on the specification and computation of best counterfactual entities, that is, those that lead to maximum responsibility scores. From them one can read off the explanations as maximum responsibility feature values in the original entity. We also extend the programs to bring into the picture semantic or domain knowledge. We show how the approach could be extended by means of probabilistic methods, and how the underlying probability distributions could be modified through the use of constraints. Several examples of programs written in the syntax of the DLV ASP-solver, and run with it, are shown.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Computational Theory and Mathematics,Hardware and Architecture,Theoretical Computer Science,Software

Reference59 articles.

1. Molnar, C. 2020. Interpretable Machine Learning: A Guide for Making Black Box Models Explainable. https://christophm.github.io/interpretable-ml-book

2. Machine Learning

3. Ignatiev, A. , Narodytska, N. and Marques-Silva, J. 2019. Abduction-based explanations for machine learning models. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, AAAI Press, 1511–1519.

4. Arenas, M. , Pablo Barceló, P. , Bertossi, L. and Monet, M. 2012. The tractability of shap-scores over deterministic and decomposable boolean circuits. In Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021. AAAI Press, 6670–6678.

5. Propositional semantics for disjunctive logic programs

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Counterfactual Explanations and Algorithmic Recourses for Machine Learning: A Review;ACM Computing Surveys;2024-07-09

2. The Generalized Causal-Effect Score in Data Management (short paper);Proceedings of the Conference on Governance, Understanding and Integration of Data for Effective and Responsible AI;2024-06-09

3. Efficient Computation of Shap Explanation Scores for Neural Network Classifiers via Knowledge Compilation;Logics in Artificial Intelligence;2023

4. Attribution-Scores and Causal Counterfactuals as Explanations in Artificial Intelligence;Reasoning Web. Causality, Explanations and Declarative Knowledge;2023

5. Attribution-Scores in Data Management and Explainable Machine Learning;Advances in Databases and Information Systems;2023