Adversarial robustness in deep neural networks based on variable attributes of the stochastic ensemble model
-
Published:2023-08-08
Issue:
Volume:17
Page:
-
ISSN:1662-5218
-
Container-title:Frontiers in Neurorobotics
-
language:
-
Short-container-title:Front. Neurorobot.
Author:
Qin Ruoxi,Wang Linyuan,Du Xuehui,Xie Pengfei,Chen Xingyuan,Yan Bin
Abstract
Deep neural networks (DNNs) have been shown to be susceptible to critical vulnerabilities when attacked by adversarial samples. This has prompted the development of attack and defense strategies similar to those used in cyberspace security. The dependence of such strategies on attack and defense mechanisms makes the associated algorithms on both sides appear as closely processes, with the defense method being particularly passive in these processes. Inspired by the dynamic defense approach proposed in cyberspace to address endless arm races, this article defines ensemble quantity, network structure, and smoothing parameters as variable ensemble attributes and proposes a stochastic ensemble strategy based on heterogeneous and redundant sub-models. The proposed method introduces the diversity and randomness characteristic of deep neural networks to alter the fixed correspondence gradient between input and output. The unpredictability and diversity of the gradients make it more difficult for attackers to directly implement white-box attacks, helping to address the extreme transferability and vulnerability of ensemble models under white-box attacks. Experimental comparison of ASR-vs.-distortion curves with different attack scenarios under CIFAR10 preliminarily demonstrates the effectiveness of the proposed method that even the highest-capacity attacker cannot easily outperform the attack success rate associated with the ensemble smoothed model, especially for untargeted attacks.
Publisher
Frontiers Media SA
Subject
Artificial Intelligence,Biomedical Engineering
Reference53 articles.
1. Threat of adversarial attacks on deep learning in computer vision: a survey;Akhtar;IEEE Access,2018
2. “Synthesizing robust adversarial examples,”;Athalye,2018
3. “A game theoretic approach to model cyber attack and defense strategies,”;Attiah,2018
4. “Understanding dropout,”;Baldi;Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013,2013
5. End to end learning for self-driving cars
BojarskiM.
TestaD. D.
DworakowskiD.
FirnerB.
FleppB.
GoyalP.
arXiv [Preprint].2014