Protecting the Neural Networks against FGSM Attack Using Machine Unlearning-Reference-Cited by-同舟云学术

Protecting the Neural Networks against FGSM Attack Using Machine Unlearning

Published:2023-08-25 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Jahanian Ali¹,Rastgarpour Maryam²,Khorasani Amir Hossein²

Affiliation:

1. Shahid Beheshti University

2. Islamic Azad University Saveh

Abstract

Abstract Machine learning is a powerful tool for building predictive models. However, it is vulnerable to adversarial attacks. Fast Gradient Sign Method (FGSM) attacks are a common type of adversarial attack that add small perturbations to input data in order to trick a model into misclassifying it. In response to these attacks, researchers have developed methods for "unlearning" these attacks, which involves retraining a model on the original data without the added perturbations. Machine unlearning is a technique that try to "forget" a specific data points from the training dataset, in order to improve the robustness of a machine learning model against adversarial attacks like FGSM. In this paper, we focus on applying unlearning techniques to the LeNet neural network, a popular architecture for image classification. We evaluate the efficacy of unlearning FGSM attacks on the LeNet network and find that it can significantly improve its robustness against these types of attacks.

Publisher

Research Square Platform LLC

Reference25 articles.

1. Holland, B.J., Handbook of Research on Technological Advances of Library and Information Science in Industry 5.0. Information Science Reference (2022)

2. Abiodun, O.I., Jantan, A., Omolara, A.E., Dada, K. V., Mohamed, N.A., Arshad, H.: State-of-the-art in artificial neural network applications: A survey. Elsevier Ltd. (2018). https://doi.org/10.1016/j.heliyon.2018.e00938

3. Keijsers, N.L.W.: Neural Networks. Elsevier Ltd, pp. 257–259, (2010). https://doi.org/10.1016/B978-0-12-374105-9.00493-7

4. Puri, M., Pathak, Y., Sutariya, V.K., Tipparaju, S., Moreno, W.: Artificial Neural Network for Drug Design, Delivery and Disposition. Elsevier Inc. (2016)

5. Lin, J., Dang, L., Rahouti, M., Xiong, K.: ML Attack Models: Adversarial Attacks and Data Poisoning Attacks. (2021) arXiv preprint arXiv:2112.02797v1