TextGTL: Graph-based Transductive Learning for Semi-supervised Text Classification via Structure-Sensitive Interpolation-Reference-Cited by-同舟云学术

TextGTL: Graph-based Transductive Learning for Semi-supervised Text Classification via Structure-Sensitive Interpolation

Published:2021-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Li Chen¹,Peng Xutan²,Peng Hao¹,Li Jianxin¹,Wang Lihong³

Affiliation:

1. Beihang University

2. The University of Sheffield

3. National Computer Network Emergency Response Technical Team/Coordination Center of China

Abstract

Compared with traditional sequential learning models, graph-based neural networks exhibit excellent properties when encoding text, such as the capacity of capturing global and local information simultaneously. Especially in the semi-supervised scenario, propagating information along the edge can effectively alleviate the sparsity of labeled data. In this paper, beyond the existing architecture of heterogeneous word-document graphs, for the first time, we investigate how to construct lightweight non-heterogeneous graphs based on different linguistic information to better serve free text representation learning. Then, a novel semi-supervised framework for text classification that refines graph topology under theoretical guidance and shares information across different text graphs, namely Text-oriented Graph-based Transductive Learning (TextGTL), is proposed. TextGTL also performs attribute space interpolation based on dense substructure in graphs to predict low-entropy labels with high-quality feature nodes for data augmentation. To verify the effectiveness of TextGTL, we conduct extensive experiments on various benchmark datasets, observing significant performance gains over conventional heterogeneous graphs. In addition, we also design ablation studies to dive deep into the validity of components in TextTGL.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. GEML: a graph-enhanced pre-trained language model framework for text classification via mutual learning;Applied Intelligence;2024-09-11

2. Graph neural networks for text classification: a survey;Artificial Intelligence Review;2024-07-01

3. Machine learning based software effort estimation using development-centric features for crowdsourcing platform;Intelligent Data Analysis;2024-04-01

4. Study on Combined-CNN Model for Classification of Terrorism Text;2024 7th International Conference on Advanced Algorithms and Control Engineering (ICAACE);2024-03-01

5. Mutual Learning for News Classification;Lecture Notes in Networks and Systems;2024