Affiliation:
1. Department of Electronics, Chungwoon University, Incheon 22100, Republic of Korea
2. Department of AIOT, Sangmyung University, Seoul 03016, Republic of Korea
Abstract
In the past few years, deep convolutional neural networks (DCNNs) have surpassed human performance in tasks related to recognizing objects. However, DCNNs are also threatened by performance degradation due to adversarial examples. DCNNs are essentially black-boxed, and it is not known how the output is determined internally; consequently, it is not known how adversarial attacks cause performance degradation inside the DCNNs. To observe the internal neuronal activities of DCNN models for adversarial examples, we analyzed the population sparseness index (PSI) values at each layer of two representative DCNN models, namely AlexNet and VGG11. From the experimental results, we observed that the internal responses of the two DCNN models to adversarial examples exhibited distinct layer-wise PSI values, differing from the internal responses to benign examples. The main contribution of this study is the discovery of significant differences in the internal responses of two specific DCNN models to adversarial and benign examples by PSI. Furthermore, our research has the potential not only to contribute to the design of more robust DCNN models against adversarial examples but also to bridge the gap between the fields of artificial intelligence and neurophysiology of the brain.
Funder
National Research Foundation of Korea
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference30 articles.
1. An Analysis Of Convolutional Neural Networks For Image Classification;Sharma;Procedia Comput. Sci.,2018
2. Development of convolutional neural network and its application in image classification: A survey;Wang;Opt. Eng.,2019
3. Gibb, R., and Kolb, B. (2018). The Neurobiology of Brain and Behavioral Development, Academic Press.
4. Goodfellow, I.J., Shlens, J., and Szegedy, C. (2015). Explaining and Harnessing Adversarial Examples. arXiv.
5. Philipp, G., and Carbonell, J.G. (2019). The Nonlinearity Coefficient—Predicting Generalization in Deep Neural Networks. arXiv.