Trusting deep learning natural-language models via local and global explanations-Reference-Cited by-同舟云学术

Trusting deep learning natural-language models via local and global explanations

Published:2022-06-22 Issue:7 Volume:64 Page:1863-1907
ISSN:0219-1377
Container-title:Knowledge and Information Systems
language:en
Short-container-title:Knowl Inf Syst

Author:

Ventura Francesco^ORCID,Greco Salvatore^ORCID,Apiletti Daniele^ORCID,Cerquitelli Tania^ORCID

Abstract

AbstractDespite the high accuracy offered by state-of-the-art deep natural-language models (e.g., LSTM, BERT), their application in real-life settings is still widely limited, as they behave like a black-box to the end-user. Hence, explainability is rapidly becoming a fundamental requirement of future-generation data-driven systems based on deep-learning approaches. Several attempts to fulfill the existing gap between accuracy and interpretability have been made. However, robust and specialized eXplainable Artificial Intelligence solutions, tailored to deep natural-language models, are still missing. We propose a new framework, named T-EBAnO, which provides innovative prediction-local and class-based model-global explanation strategies tailored to deep learning natural-language models. Given a deep NLP model and the textual input data, T-EBAnO provides an objective, human-readable, domain-specific assessment of the reasons behind the automatic decision-making process. Specifically, the framework extracts sets of interpretable features mining the inner knowledge of the model. Then, it quantifies the influence of each feature during the prediction process by exploiting the normalized Perturbation Influence Relation index at the local level and the novel Global Absolute Influence and Global Relative Influence indexes at the global level. The effectiveness and the quality of the local and global explanations obtained with T-EBAnO are proved on an extensive set of experiments addressing different tasks, such as a sentiment-analysis task performed by a fine-tuned BERT model and a toxic-comment classification task performed by an LSTM model. The quality of the explanations proposed by T-EBAnO, and, specifically, the correlation between the influence index and human judgment, has been evaluated by humans in a survey with more than 4000 judgments. To prove the generality of T-EBAnO and its model/task-independent methodology, experiments with other models (ALBERT, ULMFit) on popular public datasets (Ag News and Cola) are also discussed in detail.

Funder

Politecnico di Torino

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Hardware and Architecture,Human-Computer Interaction,Information Systems,Software

Link

https://link.springer.com/content/pdf/10.1007/s10115-022-01690-9.pdf

Reference53 articles.

1. Adadi A, Berrada M (2018) Peeking inside the black-box: a survey on explainable artificial intelligence (xai). IEEE Access 6:52138–52160. https://doi.org/10.1109/ACCESS.2018.2870052

2. Alvarez-Melis D, Jaakkola TS (2017) A causal framework for explaining the predictions of black-box sequence-to-sequence models. arXiv preprint arXiv:1707.01943

3. Banzhaf J (1965) Weighted voting doesn’t work: a mathematical analysis. Rutgers Law Rev 19(2):317–343

4. Basiri ME, Nemati S, Abdar M, Cambria E, Acharya UR (2021) Abcdm: an attention-based bidirectional cnn-rnn deep model for sentiment analysis. Futur Gener Comput Syst 115:279–294. https://doi.org/10.1016/j.future.2020.08.005

5. Bolukbasi T, Chang KW, Zou J, Saligrama V, Kalai A (2016) Man is to computer programmer as woman is to homemaker? debiasing word embeddings

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Boosting court judgment prediction and explanation using legal entities;Artificial Intelligence and Law;2024-03-18

2. Feature importance measure of a multilayer perceptron based on the presingle-connection layer;Knowledge and Information Systems;2023-09-04

3. Understanding stance classification of BERT models: an attention-based framework;Knowledge and Information Systems;2023-08-31

4. A multi-scenario approach to continuously learn and understand norm violations;Autonomous Agents and Multi-Agent Systems;2023-08-16

5. Explaining deep convolutional models by measuring the influence of interpretable features in image classification;Data Mining and Knowledge Discovery;2023-02-10