Geometry-Aware Weight Perturbation for Adversarial Training-Reference-Cited by-同舟云学术

Geometry-Aware Weight Perturbation for Adversarial Training

Published:2024-09-04 Issue:17 Volume:13 Page:3508
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Jiang Yixuan¹^ORCID,Chiang Hsiao-Dong¹

Affiliation:

1. School of Electrical and Computer Engineering, Cornell University, Ithaca, NY 14850, USA

Abstract

Adversarial training is one of the most successful approaches to improve model robustness against maliciously crafted data. Instead of training on a clean dataset, the model is trained on adversarial data generated on the fly. Based on that, a group of geometry-aware methods are proposed to further enhance the model robustness by assigning higher weights to the data points that are closer to the decision boundary during training. Although the robustness against the adversarial attack seen in the training process is significantly improved, the model becomes more vulnerable to unseen attacks, and the reason for the issue remains unclear. In this paper, we investigate the cause of the issue and claim that such geometry-aware methods lead to a sharp minimum, which results in poor robustness generalization for unseen attacks. Furthermore, we propose a remedy for the issue by imposing the adversarial weight perturbation mechanism and further develop a novel weight perturbation strategy called Geometry-Aware Weight Perturbation (GAWP). Extensive results demonstrate that the proposed method alleviates the robustness generalization issue of geometry-aware methods while consistently improving model robustness compared to existing weight perturbation strategies.

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/17/3508/pdf

Reference44 articles.

1. Bai, T., Luo, J., Zhao, J., Wen, B., and Wang, Q. (2021, January 19–26). Recent Advances in Adversarial Training for Adversarial Robustness. Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, Montreal, QC, Canada.

2. A Survey of Autonomous Driving: Common Practices and Emerging Technologies;Yurtsever;IEEE Access,2020

3. Madry, A., Makelov, A., Schmidt, L., Tsipras, D., and Vladu, A. (May, January 30). Towards Deep Learning Models Resistant to Adversarial Attacks. Proceedings of the International Conference on Learning Representations, Vancouver, BC, Canada.

4. Zhang, J., Zhu, J., Niu, G., Han, B., Sugiyama, M., and Kankanhalli, M.S. (2020). Geometry-aware Instance-reweighted Adversarial Training. arXiv.

5. Hitaj, D., Pagnotta, G., Masi, I., and Mancini, L.V. (2021). Evaluating the Robustness of Geometry-Aware Instance-Reweighted Adversarial Training. arXiv.