Data Augmentation Methods for Enhancing Robustness in Text Classification Tasks-Reference-Cited by-同舟云学术

Data Augmentation Methods for Enhancing Robustness in Text Classification Tasks

Published:2023-01-16 Issue:1 Volume:16 Page:59
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Tang Huidong^ORCID,Kamei Sayaka^ORCID,Morimoto Yasuhiko

Abstract

Text classification is widely studied in natural language processing (NLP). Deep learning models, including large pre-trained models like BERT and DistilBERT, have achieved impressive results in text classification tasks. However, these models’ robustness against adversarial attacks remains an area of concern. To address this concern, we propose three data augmentation methods to improve the robustness of such pre-trained models. We evaluated our methods on four text classification datasets by fine-tuning DistilBERT on the augmented datasets and exposing the resulting models to adversarial attacks to evaluate their robustness. In addition to enhancing the robustness, our proposed methods can improve the accuracy and F1-score on three datasets. We also conducted comparison experiments with two existing data augmentation methods. We found that one of our proposed methods demonstrates a similar improvement in terms of performance, but all demonstrate a superior robustness improvement.

Funder

KAKENHI

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

https://www.mdpi.com/1999-4893/16/1/59/pdf

Reference28 articles.

1. An intelligent system for spam detection and identification of the most relevant features based on evolutionary random weight networks;Faris;Inf. Fusion,2019

2. Optimizing semantic deep forest for tweet topic classification;Daouadi;Inf. Syst.,2021

3. Fan, F., Feng, Y., and Zhao, D. (November, January 31). Multi-grained Attention Network for Aspect-Level Sentiment Classification. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium.

4. Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K. (2019, January 2–7). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, Minnesota.

5. Sanh, V., Debut, L., Chaumond, J., and Wolf, T. (2019). DistilBERT, a distilled version of BERT: Smaller, faster, cheaper and lighter. arXiv.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Data Sorting Influence on Short Text Manual Labeling Quality for Hierarchical Classification;Big Data and Cognitive Computing;2024-04-07

2. An Artificial-Intelligence-Driven Spanish Poetry Classification Framework;Big Data and Cognitive Computing;2023-12-14

3. DATG: Data Augmentation with Transformer-Based Generation for Low-Resource Named Entity Recognition;2023 China Automation Congress (CAC);2023-11-17

4. IDA: An Imbalanced Data Augmentation for Text Classification;Communications in Computer and Information Science;2023-11-05