Minimally Distorted Adversarial Images with a Step-Adaptive Iterative Fast Gradient Sign Method-Reference-Cited by-同舟云学术

Minimally Distorted Adversarial Images with a Step-Adaptive Iterative Fast Gradient Sign Method

Published:2024-06-18 Issue:2 Volume:5 Page:922-937
ISSN:2673-2688
Container-title:AI
language:en
Short-container-title:AI

Author:

Ding Ning¹,Möller Knut¹^ORCID

Affiliation:

1. Institute of Technical Medicine, Furtwangen University, 78054 Villingen-Schwenningen, Germany

Abstract

The safety and robustness of convolutional neural networks (CNNs) have raised increasing concerns, especially in safety-critical areas, such as medical applications. Although CNNs are efficient in image classification, their predictions are often sensitive to minor, for human observers, invisible modifications of the image. Thus, a modified, corrupted image can be visually equal to the legitimate image for humans but fool the CNN and make a wrong prediction. Such modified images are called adversarial images throughout this paper. A popular method to generate adversarial images is backpropagating the loss gradient to modify the input image. Usually, only the direction of the gradient and a given step size were used to determine the perturbations (FGSM, fast gradient sign method), or the FGSM is applied multiple times to craft stronger perturbations that change the model classification (i-FGSM). On the contrary, if the step size is too large, the minimum perturbation of the image may be missed during the gradient search. To seek exact and minimal input images for a classification change, in this paper, we suggest starting the FGSM with a small step size and adapting the step size with iterations. A few decay algorithms were taken from the literature for comparison with a novel approach based on an index tracking the loss status. In total, three tracking functions were applied for comparison. The experiments show our loss adaptive decay algorithms could find adversaries with more than a 90% success rate while generating fewer perturbations to fool the CNNs.

Funder

German Federal Ministry of Research and Education

Publisher

MDPI AG

Link

https://www.mdpi.com/2673-2688/5/2/46/pdf

Reference32 articles.

1. Endonet: A deep architecture for recognition tasks on laparoscopic videos;Twinanda;IEEE Trans. Med. Imaging,2016

2. Adversarial examples: Attacks and defences on medical deep learning systems;Puttagunta;Multimed. Tools Appl.,2023

3. Carlini, N., Athalye, A., Papernot, N., Brendel, W., Rauber, J., Tsipras, D., Goodfellow, I., Madry, A., and Kurakin, A. (2019). On evaluating adversarial robustness. arXiv.

4. Adversarial examples: Opportunities and challenges;Zhang;IEEE Trans. Neural Netw. Learn. Syst.,2019

5. Balda, E.R., Behboodi, A., and Mathar, R. (2020). Adversarial examples in deep neural networks: An overview. Deep Learning: Algorithms and Applications, Springer.