A Survey of Adversarial Defenses and Robustness in NLP-Reference-Cited by-同舟云学术

A Survey of Adversarial Defenses and Robustness in NLP

Published:2023-07-17 Issue:14s Volume:55 Page:1-39
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Goyal Shreya¹^ORCID,Doddapaneni Sumanth¹^ORCID,Khapra Mitesh M.¹^ORCID,Ravindran Balaraman¹^ORCID

Affiliation:

1. Robert Bosch Centre for Data Science and AI, Indian Institute of Technology Madras, India

Abstract

In the past few years, it has become increasingly evident that deep neural networks are not resilient enough to withstand adversarial perturbations in input data, leaving them vulnerable to attack. Various authors have proposed strong adversarial attacks for computer vision and Natural Language Processing (NLP) tasks. As a response, many defense mechanisms have also been proposed to prevent these networks from failing. The significance of defending neural networks against adversarial attacks lies in ensuring that the model’s predictions remain unchanged even if the input data is perturbed. Several methods for adversarial defense in NLP have been proposed, catering to different NLP tasks such as text classification, named entity recognition, and natural language inference. Some of these methods not only defend neural networks against adversarial attacks but also act as a regularization mechanism during training, saving the model from overfitting. This survey aims to review the various methods proposed for adversarial defenses in NLP over the past few years by introducing a novel taxonomy. The survey also highlights the fragility of advanced deep neural networks in NLP and the challenges involved in defending them.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3593042

Reference154 articles.

1. Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey

2. Generating Natural Language Adversarial Examples

3. Anish Athalye, Nicholas Carlini, and David Wagner. 2018. Obfuscated gradients give a false sense of security: Circumventing defenses to adversarial examples. In Proceedings of the International Conference on Machine Learning. PMLR, 274–283.

4. Imperceptible adversarial attacks on tabular data;Ballet Vincent;arXiv preprint arXiv:1911.03274,2019

5. Defending pre-trained language models from adversarial word substitutions without performance sacrifice;Bao Rongzhou;arXiv preprint arXiv:2105.14553,2021

Cited by 30 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. TextJuggler: Fooling text classification tasks by generating high-quality adversarial examples;Knowledge-Based Systems;2024-09

2. Robustness of models addressing Information Disorder: A comprehensive review and benchmarking study;Neurocomputing;2024-09

3. A verified training support vector machine in bearing fault diagnosis;Measurement Science and Technology;2024-08-20

4. From text to multimodal: a survey of adversarial example generation in question answering systems;Knowledge and Information Systems;2024-08-09

5. Roadmap of Adversarial Machine Learning in Internet of Things-Enabled Security Systems;Sensors;2024-08-09