Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment-Reference-Cited by-同舟云学术

Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment

Published:2020-04-03 Issue:05 Volume:34 Page:8018-8025
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Jin Di,Jin Zhijing,Zhou Joey Tianyi,Szolovits Peter

Abstract

Machine learning algorithms are often vulnerable to adversarial examples that have imperceptible alterations from the original counterparts but can fool the state-of-the-art models. It is helpful to evaluate or even improve the robustness of these models by exposing the maliciously crafted adversarial examples. In this paper, we present TextFooler, a simple but strong baseline to generate adversarial text. By applying it to two fundamental natural language tasks, text classification and textual entailment, we successfully attacked three target models, including the powerful pre-trained BERT, and the widely used convolutional and recurrent neural networks. We demonstrate three advantages of this framework: (1) effective—it outperforms previous attacks by success rate and perturbation rate, (2) utility-preserving—it preserves semantic content, grammaticality, and correct types classified by humans, and (3) efficient—it generates adversarial text with computational complexity linear to the text length.1

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 255 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. HyGloadAttack: Hard-label black-box textual adversarial attacks via hybrid optimization;Neural Networks;2024-10

2. TextJuggler: Fooling text classification tasks by generating high-quality adversarial examples;Knowledge-Based Systems;2024-09

3. Robustness of models addressing Information Disorder: A comprehensive review and benchmarking study;Neurocomputing;2024-09

4. A Semantic, Syntactic, and Context-Aware Natural Language Adversarial Example Generator;IEEE Transactions on Dependable and Secure Computing;2024-09

5. LLMEffiChecker : Understanding and Testing Efficiency Degradation of Large Language Models;ACM Transactions on Software Engineering and Methodology;2024-08-26