On the Relationship between Generalization and Robustness to Adversarial Examples-Reference-Cited by-同舟云学术

On the Relationship between Generalization and Robustness to Adversarial Examples

Published:2021-05-07 Issue:5 Volume:13 Page:817
ISSN:2073-8994
Container-title:Symmetry
language:en
Short-container-title:Symmetry

Author:

Pedraza Anibal^ORCID,Deniz Oscar^ORCID,Bueno Gloria^ORCID

Abstract

One of the most intriguing phenomenons related to deep learning is the so-called adversarial examples. These samples are visually equivalent to normal inputs, undetectable for humans, yet they cause the networks to output wrong results. The phenomenon can be framed as a symmetry/asymmetry problem, whereby inputs to a neural network with a similar/symmetric appearance to regular images, produce an opposite/asymmetric output. Some researchers are focused on developing methods for generating adversarial examples, while others propose defense methods. In parallel, there is a growing interest in characterizing the phenomenon, which is also the focus of this paper. From some well known datasets of common images, like CIFAR-10 and STL-10, a neural network architecture is first trained in a normal regime, where training and validation performances increase, reaching generalization. Additionally, the same architectures and datasets are trained in an overfitting regime, where there is a growing disparity in training and validation performances. The behaviour of these two regimes against adversarial examples is then compared. From the results, we observe greater robustness to adversarial examples in the overfitting regime. We explain this simultaneous loss of generalization and gain in robustness to adversarial examples as another manifestation of the well-known fitting-generalization trade-off.

Funder

Ministerio de Economía y Competitividad

Publisher

MDPI AG

Subject

Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2073-8994/13/5/817/pdf

Reference26 articles.

1. Intriguing properties of neural networks;Szegedy;arXiv,2013

2. Towards deep learning models resistant to adversarial attacks;Madry;arXiv,2017

3. Ensemble adversarial training: Attacks and defenses;Tramèr;arXiv,2017

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Speech Adversarial Sample Detection Method Based on Manifold Learning;Mathematics;2024-04-19

2. NFT Image Plagiarism Check Using EfficientNet-Based Deep Neural Network with Triplet Semi-Hard Loss;Applied Sciences;2023-02-27

3. Security Versus Accuracy: Trade-Off Data Modeling to Safe Fault Classification Systems;IEEE Transactions on Neural Networks and Learning Systems;2023

4. Hyper-flexible Convolutional Neural Networks based on Generalized Lehmer and Power Means;Neural Networks;2022-11

5. Detecting chaos in adversarial examples;Chaos, Solitons & Fractals;2022-10