On the robustness of randomized classifiers to adversarial examples-Reference-Cited by-同舟云学术

On the robustness of randomized classifiers to adversarial examples

Published:2022-08-02 Issue:9 Volume:111 Page:3425-3457
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

Pinot Rafael^ORCID,Meunier Laurent,Yger Florian,Gouy-Pailler Cédric,Chevaleyre Yann,Atif Jamal

Abstract

AbstractThis paper investigates the theory of robustness against adversarial attacks. We focus on randomized classifiers (i.e. classifiers that output random variables) and provide a thorough analysis of their behavior through the lens of statistical learning theory and information theory. To this aim, we introduce a new notion of robustness for randomized classifiers, enforcing local Lipschitzness using probability metrics. Equipped with this definition, we make two new contributions. The first one consists in devising a new upper bound on the adversarial generalization gap of randomized classifiers. More precisely, we devise bounds on the generalization gap and the adversarial gap i.e. the gap between the risk and the worst-case risk under attack) of randomized classifiers. The second contribution presents a yet simple but efficient noise injection method to design robust randomized classifiers. We show that our results are applicable to a wide range of machine learning models under mild hypotheses. We further corroborate our findings with experimental results using deep neural networks on standard image datasets, namely CIFAR-10 and CIFAR-100. On these tasks, we manage to design robust models that simultaneously achieve state-of-the-art accuracy (over 0.82 clean accuracy on CIFAR-10) and enjoy guaranteed robust accuracy bounds (0.45 against

$$\ell _{2}$$

ℓ 2 adversaries with magnitude 0.5 on CIFAR-10).

Funder

Ecocloud Research Center

EPFL Lausanne

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

https://link.springer.com/content/pdf/10.1007/s10994-022-06216-6.pdf

Reference66 articles.

1. Athalye, A., Carlini, N., & Wagner, D. (2018). Obfuscated gradients give a false sense of security: Circumventing defenses to adversarial examples. In J. Dy & A. Krause (Eds.), Proceedings of the 35th international conference on machine learning, proceedings of machine learning research (vol. 8, pp. 274–283). Stockholm Sweden: Stockholmsmässan.

2. Awasthi, P., Frank, N., & Mohri, M. (2020). Adversarial learning guarantees for linear hypotheses and neural networks. In H. D. III & A. Singh (Eds.), Proceedings of the 37th international conference on machine learning, proceedings of machine learning research (vol. 119, pp. 431–441). PMLR.

3. Bartlett, P. L., & Mendelson, S. (2002). Rademacher and Gaussian complexities: Risk bounds and structural results. Journal of Machine Learning Research, 3, 463–482.

4. Ben-Tal, A., El Ghaoui, L., & Nemirovski, A. (2009). Robust optimization (Vol. 28). Princeton University Press.

5. Biggio, B., Corona, I., Maiorca, D., Nelson, B., Šrndić, N., Laskov, P., Giacinto, G., & Roli, F. (2013). Evasion attacks against machine learning at test time. In Joint European conference on machine learning and knowledge discovery in databases (pp. 387–402). Springer.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Byzantine Machine Learning: A Primer;ACM Computing Surveys;2023-08-18

2. Implementing Responsible AI: Tensions and Trade-Offs Between Ethics Aspects;2023 International Joint Conference on Neural Networks (IJCNN);2023-06-18