Definitions, methods, and applications in interpretable machine learning-Reference-Cited by-同舟云学术

Definitions, methods, and applications in interpretable machine learning

Published:2019-10-16 Issue:44 Volume:116 Page:22071-22080
ISSN:0027-8424
Container-title:Proceedings of the National Academy of Sciences
language:en
Short-container-title:Proc Natl Acad Sci USA

Author:

Murdoch W. James,Singh Chandan,Kumbier Karl,Abbasi-Asl Reza,Yu Bin

Abstract

Machine-learning models have demonstrated great success in learning complex patterns that enable them to make predictions about unobserved data. In addition to using models for prediction, the ability to interpret what a model has learned is receiving an increasing amount of attention. However, this increased focus has led to considerable confusion about the notion of interpretability. In particular, it is unclear how the wide array of proposed interpretation methods are related and what common concepts can be used to evaluate them. We aim to address these concerns by defining interpretability in the context of machine learning and introducing the predictive, descriptive, relevant (PDR) framework for discussing interpretations. The PDR framework provides 3 overarching desiderata for evaluation: predictive accuracy, descriptive accuracy, and relevancy, with relevancy judged relative to a human audience. Moreover, to help manage the deluge of interpretation methods, we introduce a categorization of existing techniques into model-based and post hoc categories, with subgroups including sparsity, modularity, and simulatability. To demonstrate how practitioners can use the PDR framework to evaluate and understand interpretations, we provide numerous real-world examples. These examples highlight the often underappreciated role played by human audiences in discussions of interpretability. Finally, based on our framework, we discuss limitations of existing methods and directions for future work. We hope that this work will provide a common vocabulary that will make it easier for both practitioners and researchers to discuss and choose from the full range of interpretation methods.

Funder

National Science Foundation

Gouvernement du Canada | Natural Sciences and Engineering Research Council of Canada

DOD | United States Navy | Office of Naval Research

DOD | United States Army | RDECOM | Army Research Office

Publisher

Proceedings of the National Academy of Sciences

Subject

Multidisciplinary

Reference101 articles.

1. A survey on deep learning in medical image analysis

2. The emergence of machine learning techniques in criminology;Brennan;Criminol. Public Policy,2013

3. Deep learning for computational biology

4. A Shared Vision for Machine Learning in Neuroscience

5. B. Goodman , S. Flaxman , European Union regulations on algorithmic decision-making and a “right to explanation”. arXiv:1606.08813 (31 August 2016).

Cited by 903 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Calibrated explanations: With uncertainty information and counterfactuals;Expert Systems with Applications;2024-07

2. Interpretable classifier design by axiomatic fuzzy sets theory and derivative-free optimization;Expert Systems with Applications;2024-07

3. A simple approach for local and global variable importance in nonlinear regression models;Computational Statistics & Data Analysis;2024-06

4. Anomaly diagnosis of connected autonomous vehicles: A survey;Information Fusion;2024-05

5. Interpretable synthetic signals for explainable one-class time-series classification;Engineering Applications of Artificial Intelligence;2024-05