Chapter 15. Human-Centered Concept Explanations for Neural Networks-Reference-Cited by-同舟云学术

Chapter 15. Human-Centered Concept Explanations for Neural Networks

Published:2021-12-22 Issue: Volume: Page:
ISSN:0922-6389
Container-title:Frontiers in Artificial Intelligence and Applications
language:
Short-container-title:

Author:

Yeh Chih-Kuan¹,Kim Been²,Ravikumar Pradeep¹

Affiliation:

1. Carnegie Mellon University Machine Learning Department

2. Google Brain

Abstract

Understanding complex machine learning models such as deep neural networks with explanations is crucial in various applications. Many explanations stem from the model perspective, and may not necessarily effectively communicate why the model is making its predictions at the right level of abstraction. For example, providing importance weights to individual pixels in an image can only express which parts of that particular image is important to the model, but humans may prefer an explanation which explains the prediction by concept-based thinking. In this work, we review the emerging area of concept based explanations. We start by introducing concept explanations including the class of Concept Activation Vectors (CAV) which characterize concepts using vectors in appropriate spaces of neural activations, and discuss different properties of useful concepts, and approaches to measure the usefulness of concept vectors. We then discuss approaches to automatically extract concepts, and approaches to address some of their caveats. Finally, we discuss some case studies that showcase the utility of such concept-based explanations in synthetic settings and real world applications.

Publisher

IOS Press

Link

https://ebooks.iospress.nl/pdf/doi/10.3233/FAIA210362

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Concept-Based Analysis of Neural Networks via Vision-Language Models;Lecture Notes in Computer Science;2024

2. Explaining Deep Neural Networks for Bearing Fault Detection with Vibration Concepts;2023 IEEE 21st International Conference on Industrial Informatics (INDIN);2023-07-18

3. Translating theory into practice: assessing the privacy implications of concept-based explanations for biomedical AI;Frontiers in Bioinformatics;2023-07-05

4. Leveraging explanations in interactive machine learning: An overview;Frontiers in Artificial Intelligence;2023-02-23

5. Feature-Guided Analysis of Neural Networks;Fundamental Approaches to Software Engineering;2023