On the Explainability of Natural Language Processing Deep Models-Reference-Cited by-同舟云学术

On the Explainability of Natural Language Processing Deep Models

Published:2022-12-03 Issue:5 Volume:55 Page:1-31
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Zini Julia El¹^ORCID,Awad Mariette¹^ORCID

Affiliation:

1. Department of Electrical and Computer Engineering, American University of Beirut, Beirut, Lebanon

Abstract

Despite their success, deep networks are used as black-box models with outputs that are not easily explainable during the learning and the prediction phases. This lack of interpretability is significantly limiting the adoption of such models in domains where decisions are critical such as the medical and legal fields. Recently, researchers have been interested in developing methods that help explain individual decisions and decipher the hidden representations of machine learning models in general and deep networks specifically. While there has been a recent explosion of work on Explainable Artificial Intelligence (ExAI) on deep models that operate on imagery and tabular data, textual datasets present new challenges to the ExAI community. Such challenges can be attributed to the lack of input structure in textual data, the use of word embeddings that add to the opacity of the models and the difficulty of the visualization of the inner workings of deep models when they are trained on textual data. Lately, methods have been developed to address the aforementioned challenges and present satisfactory explanations on Natural Language Processing (NLP) models. However, such methods are yet to be studied in a comprehensive framework where common challenges are properly stated and rigorous evaluation practices and metrics are proposed. Motivated to democratize ExAI methods in the NLP field, we present in this work a survey that studies model-agnostic as well as model-specific explainability methods on NLP models. Such methods can either develop inherently interpretable NLP models or operate on pre-trained models in a post hoc manner. We make this distinction and we further decompose the methods into three categories according to what they explain: (1) word embeddings (input level), (2) inner workings of NLP models (processing level), and (3) models’ decisions (output level). We also detail the different evaluation approaches interpretability methods in the NLP field. Finally, we present a case-study on the well-known neural machine translation in an appendix, and we propose promising future research directions for ExAI in the NLP field.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3529755

Reference129 articles.

1. Carl Allen and Timothy Hospedales. 2019. Analogies explained: Towards understanding word embeddings. In Proceedings of the International Conference on Machine Learning. 223–231.

2. A causal framework for explaining the predictions of black-box sequence-to-sequence models

3. Learning to Compose Neural Networks for Question Answering

4. Neural Module Networks

5. Leila Arras, José Arjona-Medina, Michael Widrich, Grégoire Montavon, Michael Gillhofer, Klaus-Robert Müller, Sepp Hochreiter, and Wojciech Samek. 2019. Explaining and interpreting LSTMs. In Explainable ai: Interpreting, Explaining and Visualizing Deep Learning. Springer, 211–238.

Cited by 32 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. From outputs to insights: a survey of rationalization approaches for explainable text classification;Frontiers in Artificial Intelligence;2024-07-23

2. From large language models to small logic programs: building global explanations from disagreeing local post-hoc explainers;Autonomous Agents and Multi-Agent Systems;2024-07-08

3. EXtrA-ShaRC: Explainable and Scrutable Reading Comprehension for Conversational Systems;Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization;2024-06-22

4. Explainable Artificial Intelligence (XAI) 2.0: A manifesto of open challenges and interdisciplinary research directions;Information Fusion;2024-06

5. Identification of patients’ smoking status using an explainable AI approach: a Danish electronic health records case study;BMC Medical Research Methodology;2024-05-17