Multi-Label Text Classification Model Based on Multi-Level Constraint Augmentation and Label Association Attention-Reference-Cited by-同舟云学术

Multi-Label Text Classification Model Based on Multi-Level Constraint Augmentation and Label Association Attention

Published:2024-01-15 Issue:1 Volume:23 Page:1-20
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Wei Xiao¹^ORCID,Huang Jianbao¹^ORCID,Zhao Rui¹^ORCID,Yu Hang¹^ORCID,Xu Zheng²^ORCID

Affiliation:

1. School of Computer Engineering and Science, Shanghai University, China

2. School of Computer and Information Engineering, Shanghai Polytechnic University, China

Abstract

In the multi-label text classification task, a text usually corresponds to multiple label categories, and the labels have correlation and hierarchical structure. However, when the label hierarchy is unknown, the number of various labels is not balanced, which makes it difficult for the model to classify low-frequency labels. In addition, labels have semantic similarities that make it difficult for the model to distinguish between them. In this article, we propose a multi-label text classification model based on multi-level constraint augmentation and label association attention. Compared with traditional methods, our method has two contributions: (1) In order to alleviate the problem of unbalanced number of different label categories and ensure the rationality of sample generation, we propose a data augmentation method based on multi-level constraints. In the process of sample generation, this method uses historical generation information, sample original text information, and sample topic to constrain the generated text. (2) In order to make the model recognize the associated labels accurately, we propose an interaction mechanism based on label association attention and filter gate. This method combines text information and label weight information. At the same time, our classification model considers the important weights of text sentences and effectively utilizes the co-occurrence relationship between labels. Experimental results on three benchmark datasets show that our model outperforms state-of-the-art methods on all main evaluation metrics, especially on low-frequency label prediction with sparse samples.

Funder

National Social Science Fund of China

SSPU young talent

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3586008

Reference53 articles.

1. Joint event causality extraction using dual-channel enhanced neural network

2. DAFS: A domain aware few shot generative model for event detection;Machine Learning,2022

3. Z. Zhao, H. Yu, X. Luo, and G. Shengming. 2022. IA-ICGCN: Integrating prior knowledge via intra-event association and inter-event causality for Chinese causal event extraction. In Proceedings of the International Conference on Artificial Neural Networks. Springer, Cham, 519–531.

4. DBGARE: Across-Within Dual Bipartite Graph Attention for Enhancing Distantly Supervised Relation Extraction

5. Hierarchical Attention Networks for Document Classification

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A social context-aware graph-based multimodal attentive learning framework for disaster content classification during emergencies;Expert Systems with Applications;2025-01

2. Improving Clothing Product Quality and Reducing Waste Based on Consumer Review Using RoBERTa and BERTopic Language Model;Big Data and Cognitive Computing;2023-10-25