Do Neural Transformers Learn Human-Defined Concepts? An Extensive Study in Source Code Processing Domain-Reference-Cited by-同舟云学术

Do Neural Transformers Learn Human-Defined Concepts? An Extensive Study in Source Code Processing Domain

Published:2022-11-29 Issue:12 Volume:15 Page:449
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Ferretti Claudio^ORCID,Saletta Martina^ORCID

Abstract

State-of-the-art neural networks build an internal model of the training data, tailored to a given classification task. The study of such a model is of interest, and therefore, research on explainable artificial intelligence (XAI) aims at investigating if, in the internal states of a network, it is possible to identify rules that associate data to their corresponding classification. This work moves toward XAI research on neural networks trained in the classification of source code snippets, in the specific domain of cybersecurity. In this context, typically, textual instances have firstly to be encoded with non-invertible transformation into numerical vectors to feed the models, and this limits the applicability of known XAI methods based on the differentiation of neural signals with respect to real valued instances. In this work, we start from the known TCAV method, designed to study the human understandable concepts that emerge in the internal layers of a neural network, and we adapt it to transformers architectures trained in solving source code classification problems. We first determine domain-specific concepts (e.g., the presence of given patterns in the source code), and for each concept, we train support vector classifiers to separate points in the vector activation spaces that represent input instances with the concept from those without the concept. Then, we study if the presence (or the absence) of such concepts affects the decision process of the neural network. Finally, we discuss about how our approach contributes to general XAI goals and we suggest specific applications in the source code analysis field.

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

https://www.mdpi.com/1999-4893/15/12/449/pdf

Reference31 articles.

1. Kanade, A., Maniatis, P., Balakrishnan, G., and Shi, K. (2020, January 13–18). Learning and evaluating contextual embedding of source code. Proceedings of the 37th International Conference on Machine Learning, ICML 2020, Virtual.

2. Kim, B., Wattenberg, M., Gilmer, J., Cai, C.J., Wexler, J., Viégas, F.B., and Sayres, R. (2018, January 10–15). Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV). Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholm, Sweden.

3. Saletta, M., and Ferretti, C. (2022, January 9–13). Towards the Evolutionary Assessment of Neural Transformers Trained on Source Code. Proceedings of the GECCO ’22: Genetic and Evolutionary Computation Conference, Companion Volume, Boston, MA, USA.

4. Gosain, A., and Sharma, G. (2015). Intelligent Computing and Applications, Springer.

5. A Survey of Machine Learning for Big Code and Naturalness;Allamanis;ACM Comput. Surv.,2018

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A heuristic method for discovering multi-class classification rules from multi-source data in cloud–edge system;Journal of King Saud University - Computer and Information Sciences;2024-02

2. Potential of Explainable Artificial Intelligence in Advancing Renewable Energy: Challenges and Prospects;Energy & Fuels;2024-01-19

3. Evolutionary Approaches for Adversarial Attacks on Neural Source Code Classifiers;Algorithms;2023-10-12

4. Exploring Neural Dynamics in Source Code Processing Domain;Information;2023-04-21