Stable and actionable explanations of black-box models through factual and counterfactual rules-Reference-Cited by-同舟云学术

Stable and actionable explanations of black-box models through factual and counterfactual rules

Published:2022-11-14 Issue: Volume: Page:
ISSN:1384-5810
Container-title:Data Mining and Knowledge Discovery
language:en
Short-container-title:Data Min Knowl Disc

Author:

Guidotti Riccardo^ORCID,Monreale Anna,Ruggieri Salvatore,Naretto Francesca,Turini Franco,Pedreschi Dino,Giannotti Fosca

Abstract

AbstractRecent years have witnessed the rise of accurate but obscure classification models that hide the logic of their internal decision processes. Explaining the decision taken by a black-box classifier on a specific input instance is therefore of striking interest. We propose a local rule-based model-agnostic explanation method providing stable and actionable explanations. An explanation consists of a factual logic rule, stating the reasons for the black-box decision, and a set of actionable counterfactual logic rules, proactively suggesting the changes in the instance that lead to a different outcome. Explanations are computed from a decision tree that mimics the behavior of the black-box locally to the instance to explain. The decision tree is obtained through a bagging-like approach that favors stability and fidelity: first, an ensemble of decision trees is learned from neighborhoods of the instance under investigation; then, the ensemble is merged into a single decision tree. Neighbor instances are synthetically generated through a genetic algorithm whose fitness function is driven by the black-box behavior. Experiments show that the proposed method advances the state-of-the-art towards a comprehensive approach that successfully covers stability and actionability of factual and counterfactual explanations.

Funder

SoBigData++

HumanE AI Net

TAILOR

XAI

NoBIAS

SAI

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Computer Science Applications,Information Systems

Link

https://link.springer.com/content/pdf/10.1007/s10618-022-00878-5.pdf

Reference82 articles.

1. Adadi A, Berrada M (2018) Peeking inside the black-box: a survey on explainable artificial intelligence (XAI). IEEE Access 6:52138–52160

2. Alvarez-Melis D, Jaakkola TS (2018) Towards robust interpretability with self-explaining neural networks. In: NeurIPS, pp 7786–7795

3. Angelino E, Larus-Stone N, Alabi D, Seltzer MI, Rudin C (2017) Learning certifiably optimal rule lists for categorical data. J Mach Learn Res 18:234:1-234:78

4. Assche AV, Blockeel H (2007) Seeing the forest through the trees: learning a comprehensible model from an ensemble. In: ECML. Lecture notes in computer science, vol 4701. Springer, pp 418–429

5. Bäck T, Fogel DB, Michalewicz Z (2000) Evolutionary computation 1: basic algorithms and operators, vol 1. CRC Press, Boca Raton

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. On the failings of Shapley values for explainability;International Journal of Approximate Reasoning;2024-08

2. Explainable and interpretable machine learning and data mining;Data Mining and Knowledge Discovery;2024-07-30

3. GLOR-FLEX: Local to Global Rule-Based EXplanations for Federated Learning;2024 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE);2024-06-30

4. An Advanced Explainable Belief Rule-Based Framework to Predict the Energy Consumption of Buildings;Energies;2024-04-09

5. Rheological‐based digital approach for gel curve analysis of alcohol ethoxylates;Journal of Surfactants and Detergents;2024-04