HGAT: Heterogeneous Graph Attention Networks for Semi-supervised Short Text Classification-Reference-Cited by-同舟云学术

HGAT: Heterogeneous Graph Attention Networks for Semi-supervised Short Text Classification

Published:2021-07-26 Issue:3 Volume:39 Page:1-29
ISSN:1046-8188
Container-title:ACM Transactions on Information Systems
language:en
Short-container-title:ACM Trans. Inf. Syst.

Author:

Yang Tianchi¹,Hu Linmei¹,Shi Chuan¹,Ji Houye¹,Li Xiaoli²,Nie Liqiang³

Affiliation:

1. Beijing University of Posts and Telecommunications, Beijing, China

2. Institute for Infocomm Research, Singapore

3. Shan Dong University, Shandong Province, China

Abstract

Short text classification has been widely explored in news tagging to provide more efficient search strategies and more effective search results for information retrieval. However, most existing studies, concentrating on long text classification, deliver unsatisfactory performance on short texts due to the sparsity issue and the insufficiency of labeled data. In this article, we propose a novel heterogeneous graph neural network-based method for semi-supervised short text classification, leveraging full advantage of limited labeled data and large unlabeled data through information propagation along the graph. Specifically, we first present a flexible heterogeneous information network (HIN) framework for modeling short texts, which can integrate any type of additional information and meanwhile capture their relations to address the semantic sparsity. Then, we propose Heterogeneous Graph Attention networks (HGAT) to embed the HIN for short text classification based on a dual-level attention mechanism, including node-level and type-level attentions. To efficiently classify new coming texts that do not previously exist in the HIN, we extend our model HGAT for inductive learning, avoiding re-training the model on the evolving HIN. Extensive experiments on single-/multi-label classification demonstrates that our proposed model HGAT significantly outperforms state-of-the-art methods across the benchmark datasets under both transductive and inductive learning.

Funder

National Natural Science Foundation of China

National Key Research and Development Program of China

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/3450352

Reference57 articles.

1. A Deep Learning Architecture for Psychometric Natural Language Processing

Cited by 99 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Edge-enhanced minimum-margin graph attention network for short text classification;Expert Systems with Applications;2024-10

2. A Survey on Recommender Systems using Graph Neural Network;ACM Transactions on Information Systems;2024-09-06

3. Graph Attention Networks: A Comprehensive Review of Methods and Applications;Future Internet;2024-09-03

4. Node and Edge Joint Embedding for Heterogeneous Information Network;Big Data Mining and Analytics;2024-09

5. DRML-Ensemble: drug repurposing method based on feature construction of multi-layer ensemble;Journal of Molecular Modeling;2024-07-31