Data augmentation using virtual word insertion techniques in text classification tasks-Reference-Cited by-同舟云学术

Data augmentation using virtual word insertion techniques in text classification tasks

Published:2023-12-12 Issue: Volume: Page:
ISSN:0266-4720
Container-title:Expert Systems
language:en
Short-container-title:Expert Systems

Author:

Long Zhigao¹^ORCID,Li Hong¹,Shi Jiawen¹,Ma Xin¹

Affiliation:

1. School of Computer Science and Engineering Central South University Changsha China

Abstract

AbstractLabelling multiple training examples for text classification models is usually time‐consuming and complex. Data augmentation can be used to automatically expand the dataset by transforming the original data. However, it may cause semantic changes without modifying the labels, which reduces the effectiveness of the classifiers. In this paper, we propose a data‐augmentation method called the virtual word insertion technique, which generates new sentences by randomly inserting virtual words into existing sentences. Two methods are used to achieve virtual word embedding: unweighted average and weighted average. Furthermore, a new concept of weight is proposed: the class deviation factor, which demonstrates the correlation between words and classes. Based on this new concept, different weights are assigned to words of different classes. Experiments are conducted on five different classification tasks. Ablation experiments are also performed to explore the effects of random operation and number of augmented sentences for classification results. The results of these experiments show that our method improves the classification performance and outperforms two other contrasting data‐augmentation methods in automatically augmenting the dataset. Compared to raw datasets, the average accuracy improvement of our method is 3.5% for a small‐scale dataset and 1% for a large‐scale dataset.

Funder

National Natural Science Foundation of China

Publisher

Wiley

Subject

Artificial Intelligence,Computational Theory and Mathematics,Theoretical Computer Science,Control and Systems Engineering

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1111/exsy.13519

Reference40 articles.

1. Alom M. Z. Taha T. M. Yakopcic C. Westberg S. Sidike P. Nasrin M. S. Van Esesn B. C. Awwal A. A. S. &Asari V. K.(2018).The history began from alexnet: A comprehensive survey on deep learning approaches.arXiv preprint arXiv:1803.01164.

2. Alzantot M. Sharma Y. Elgohary A. Ho B.‐J. Srivastava M. &Chang K.‐W.(2018 October‐November).Generating natural language adversarial examples[Conference presentation]. Proceedings of the 2018 conference on empirical methods in natural language processing Brussels Belgium. 2890–2896.https://aclanthology.org/D18-1316

3. Do Not Have Enough Data? Deep Learning to the Rescue!

4. A Survey on Data Augmentation for Text Classification

5. Belinkov Y. &Bisk Y.(2018).Synthetic and natural noise both break neural machine translation[Conference presentation]. International Conference on Learning Representations.