SNIPPET: A Framework for Subjective Evaluation of Visual Explanations Applied to DeepFake Detection-Reference-Cited by-同舟云学术

SNIPPET: A Framework for Subjective Evaluation of Visual Explanations Applied to DeepFake Detection

Published:2024-06-13 Issue:8 Volume:20 Page:1-29
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Yang Yuqing¹²^ORCID,Joukovsky Boris³²^ORCID,Oramas Mogrovejo José⁴⁵^ORCID,Tuytelaars Tinne⁶^ORCID,Deligiannis Nikos³²^ORCID

Affiliation:

1. Departement of Electronics and Informatics, VUB, Brussel, Belgium

2. Imec, Leuven Belgium

3. Departement of Electronics and Informatics, VUB, Brussel Belgium

4. University of Antwerp, Antwerpen Belgium

5. Imec-IDLab, Antwerpen Belgium

6. KU Leuven, Leuven Belgium

Abstract

Explainable Artificial Intelligence (XAI) attempts to help humans understand machine learning decisions better and has been identified as a critical component toward increasing the trustworthiness of complex black-box systems, such as deep neural networks. In this article, we propose a generic and comprehensive framework named SNIPPET and create a user interface for the subjective evaluation of visual explanations, focusing on finding human-friendly explanations. SNIPPET considers human-centered evaluation tasks and incorporates the collection of human annotations. These annotations can serve as valuable feedback to validate the qualitative results obtained from the subjective assessment tasks. Moreover, we consider different user background categories during the evaluation process to ensure diverse perspectives and comprehensive evaluation. We demonstrate SNIPPET on a DeepFake face dataset. Distinguishing real from fake faces is a non-trivial task even for humans that depends on rather subtle features, making it a challenging use case. Using SNIPPET, we evaluate four popular XAI methods which provide visual explanations: Gradient-weighted Class Activation Mapping, Layer-wise Relevance Propagation, attention rollout, and Transformer Attribution. Based on our experimental results, we observe preference variations among different user categories. We find that most people are more favorable to the explanations of rollout. Moreover, when it comes to XAI-assisted understanding, those who have no or lack relevant background knowledge often consider that visual explanations are insufficient to help them understand. We open-source our framework for continued data collection and annotation at https://github.com/XAI-SubjEvaluation/SNIPPET .

Funder

FWO

Flemish Government

Onderzoeksprogramma Artificiele Intelligentie (AI) Vlaanderen

Trustworthy AI Methods

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3665248

Reference66 articles.

1. Quantifying attention flow in transformers;Abnar Samira;arXiv preprint arXiv:2005.00928,2020

2. On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation

3. Artificial faces are harder to remember

4. Trustworthiness perception is disrupted in artificial faces

5. Exemplary natural images explain CNN activations better than state-of-the-art feature visualization;Borowski Judy;arXiv preprint arXiv:2010.12606,2020