Affiliation:
1. School of Computer Engineering and Science, Shanghai University, China
2. School of Computer and Information Engineering, Shanghai Polytechnic University, China
Abstract
In the multi-label text classification task, a text usually corresponds to multiple label categories, and the labels have correlation and hierarchical structure. However, when the label hierarchy is unknown, the number of various labels is not balanced, which makes it difficult for the model to classify low-frequency labels. In addition, labels have semantic similarities that make it difficult for the model to distinguish between them. In this article, we propose a multi-label text classification model based on multi-level constraint augmentation and label association attention. Compared with traditional methods, our method has two contributions: (1) In order to alleviate the problem of unbalanced number of different label categories and ensure the rationality of sample generation, we propose a data augmentation method based on multi-level constraints. In the process of sample generation, this method uses historical generation information, sample original text information, and sample topic to constrain the generated text. (2) In order to make the model recognize the associated labels accurately, we propose an interaction mechanism based on label association attention and filter gate. This method combines text information and label weight information. At the same time, our classification model considers the important weights of text sentences and effectively utilizes the co-occurrence relationship between labels. Experimental results on three benchmark datasets show that our model outperforms state-of-the-art methods on all main evaluation metrics, especially on low-frequency label prediction with sparse samples.
Funder
National Social Science Fund of China
SSPU young talent
Publisher
Association for Computing Machinery (ACM)
Reference53 articles.
1. Joint event causality extraction using dual-channel enhanced neural network
2. DAFS: A domain aware few shot generative model for event detection;Machine Learning,2022
3. Z. Zhao, H. Yu, X. Luo, and G. Shengming. 2022. IA-ICGCN: Integrating prior knowledge via intra-event association and inter-event causality for Chinese causal event extraction. In Proceedings of the International Conference on Artificial Neural Networks. Springer, Cham, 519–531.
4. DBGARE: Across-Within Dual Bipartite Graph Attention for Enhancing Distantly Supervised Relation Extraction
5. Hierarchical Attention Networks for Document Classification
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献