Right for the Right Reasons: Training Differentiable Models by Constraining their Explanations-Reference-Cited by-同舟云学术

Right for the Right Reasons: Training Differentiable Models by Constraining their Explanations

Published:2017-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Ross Andrew Slavin¹,Hughes Michael C.¹,Doshi-Velez Finale¹

Affiliation:

1. Harvard University

Abstract

Expressive classifiers such as neural networks are among the most accurate supervised learning methods in use today, but their opaque decision boundaries make them difficult to trust in critical applications. We propose a method to explain the predictions of any differentiable model via the gradient of the class label with respect to the input (which provides a normal to the decision boundary). Not only is this approach orders of magnitude faster at identifying input dimensions of high sensitivity than sample-based perturbation methods (e.g. LIME), but it also lends itself to efficiently discovering multiple qualitatively different decision boundaries as well as decision boundaries that are consistent with expert annotation. On multiple datasets, we show our approach generalizes much better when test conditions differ from those in training.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 139 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Ex-Fuzzy: A library for symbolic explainable AI through fuzzy logic programming;Neurocomputing;2024-09

2. May I Ask a Follow-up Question? Understanding the Benefits of Conversations in Neural Network Explainability;International Journal of Human–Computer Interaction;2024-08-08

3. A comprehensive multi-task deep learning approach for predicting metabolic syndrome with genetic, nutritional, and clinical data;Scientific Reports;2024-08-01

4. Gradient-based explanation for non-linear non-parametric dimensionality reduction;Data Mining and Knowledge Discovery;2024-07-11

5. Learning the Irreversible Progression Trajectory of Alzheimer’s Disease;2024 IEEE International Symposium on Biomedical Imaging (ISBI);2024-05-27