Enhancing Neural Text Detector Robustness with μAttacking and RR-Training-Reference-Cited by-同舟云学术

Enhancing Neural Text Detector Robustness with μAttacking and RR-Training

Published:2023-04-21 Issue:8 Volume:12 Page:1948
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Liang Gongbo¹^ORCID,Guerrero Jesus¹,Zheng Fengbo²^ORCID,Alsmadi Izzat¹^ORCID

Affiliation:

1. College of Arts and Sciences, Texas A&M University-San Antonio, San Antonio, TX 78224, USA

2. College of Computer and Information Engineering, Tianjin Normal University, Tianjin 300387, China

Abstract

With advanced neural network techniques, language models can generate content that looks genuinely created by humans. Such advanced progress benefits society in numerous ways. However, it may also bring us threats that we have not seen before. A neural text detector is a classification model that separates machine-generated text from human-written ones. Unfortunately, a pretrained neural text detector may be vulnerable to adversarial attack, aiming to fool the detector into making wrong classification decisions. Through this work, we propose μAttacking, a mutation-based general framework that can be used to evaluate the robustness of neural text detectors systematically. Our experiments demonstrate that μAttacking identifies the detector’s flaws effectively. Inspired by the insightful information revealed by μAttacking, we also propose an RR-training strategy, a straightforward but effective method to improve the robustness of neural text detectors through finetuning. Compared with the normal finetuning method, our experiments demonstrated that RR-training effectively increased the model robustness by up to 11.33% without increasing much effort when finetuning a neural text detector. We believe the μAttacking and RR-training are useful tools for developing and evaluating neural language models.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/8/1948/pdf

Reference84 articles.

1. Imagenet classification with deep convolutional neural networks;Krizhevsky;Commun. ACM,2017

2. Zhang, Y., Liang, G., Salem, T., and Jacobs, N. (2019, January 9–12). Defense-pointnet: Protecting pointnet against adversarial attacks. Proceedings of the IEEE International Conference on Big Data, Los Angeles, CA, USA.

3. Xing, X., Liang, G., Blanton, H., Rafique, M.U., Wang, C., Lin, A.L., and Jacobs, N. (2020, January 23–28). Dynamic image for 3d mri image alzheimer’s disease classification. Proceedings of the European Conference on Computer Vision Workshops, Glasgow, UK. Part I.

4. A deep learning view of the census of galaxy clusters in illustristng;Su;Mon. Not. R. Astron. Soc.,2020

5. Ying, Q., Xing, X., Liu, L., Lin, A.L., Jacobs, N., and Liang, G. (2021, January 1–5). Multi-modal data analysis for alzheimer’s disease diagnosis: An ensemble model using imagery and genetic features. Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, Mexico City, Mexico.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Detecting the Use of ChatGPT in University Newspapers by Analyzing Stylistic Differences with Machine Learning;Information;2024-05-25

2. Benchmark assessment for the DeepSpeed acceleration library on image classification;Cluster Computing;2023-08-26