Investigating the impact of calibration on the quality of explanations-Reference-Cited by-同舟云学术

Investigating the impact of calibration on the quality of explanations

Published:2023-03-13 Issue: Volume: Page:
ISSN:1012-2443
Container-title:Annals of Mathematics and Artificial Intelligence
language:en
Short-container-title:Ann Math Artif Intell

Author:

Löfström Helena^ORCID,Löfström Tuwe,Johansson Ulf,Sönströd Cecilia

Abstract

AbstractPredictive models used in Decision Support Systems (DSS) are often requested to explain the reasoning to users. Explanations of instances consist of two parts; the predicted label with an associated certainty and a set of weights, one per feature, describing how each feature contributes to the prediction for the particular instance. In techniques like Local Interpretable Model-agnostic Explanations (LIME), the probability estimate from the underlying model is used as a measurement of certainty; consequently, the feature weights represent how each feature contributes to the probability estimate. It is, however, well-known that probability estimates from classifiers are often poorly calibrated, i.e., the probability estimates do not correspond to the actual probabilities of being correct. With this in mind, explanations from techniques like LIME risk becoming misleading since the feature weights will only describe how each feature contributes to the possibly inaccurate probability estimate. This paper investigates the impact of calibrating predictive models before applying LIME. The study includes 25 benchmark data sets, using Random forest and Extreme Gradient Boosting (xGBoost) as learners and Venn-Abers and Platt scaling as calibration methods. Results from the study show that explanations of better calibrated models are themselves better calibrated, with ECE and log loss for the explanations after calibration becoming more conformed to the model ECE and log loss. The conclusion is that calibration makes the models and the explanations better by accurately representing reality.

Funder

Stiftelsen för Kunskaps- och Kompetensutveckling

University of Boras

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s10472-023-09837-2.pdf

Reference32 articles.

1. High-Level Expert Group on AI: Ethics Guidelines for Trustworthy AI. Report, European Commission, Brussels (2019)

2. Muhlbacher, T., Piringer, H., Gratzl, S., Sedlmair, M., Streit, M.: Opening the black box: Strategies for increased user involvement in existing algorithm implementations. IEEE Trans. Vis. Comput. Graph. 20(12), 1643–1652 (2014)

3. Freitas, A.A.: Comprehensible Classification Models—a position paper. SigKDD Explor. 15(1), 1–10 (2014)

4. Rudin, C.: Algorithms for interpretable machine learning. In: Proc. of the 20th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pp 1519–1519 (2014)

5. Ribeiro, M.T., Singh, S., Guestrin, C.: “Why Should I Trust You?”: Explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD ’16, pp 1135–1144. Association for Computing Machinery (2016). https://doi.org/10.1145/2939672.2939778

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Identifying Frailty in Older Adults Receiving Home Care Assessment Using Machine Learning: Longitudinal Observational Study on the Role of Classifier, Feature Selection, and Sample Size;JMIR AI;2024-01-31

2. Explainable AI Evaluation: A Top-Down Approach for Selecting Optimal Explanations for Black Box Models;Information;2023-12-20