Author:
Gao Yue,Fu Xiangling,Liu Xien,Wu Ji
Abstract
AbstractGraph-based neural networks and unsupervised pre-trained models are both cutting-edge text representation methods, given their outstanding ability to capture global information and contextualized information, respectively. However, both representation methods meet obstacles to further performance improvements. On one hand, graph-based neural networks lack knowledge orientation to guide textual interpretation during global information interaction. On the other hand, unsupervised pre-trained models imply rich semantic and syntactic knowledge which lacks sufficient induction and expression. Therefore, how to effectively integrate graph-based global information and unsupervised contextualized semantic and syntactic information to achieve better text representation is an important issue pending for solution. In this paper, we propose a representation method that deeply integrates Unsupervised Semantics and Syntax into heterogeneous Graphs (USS-Graph) for inductive text classification. By constructing a heterogeneous graph whose edges and nodes are totally generated by knowledge from unsupervised pre-trained models, USS-Graph can harmonize the two perspectives of information under a bidirectionally weighted graph structure and thereby realizing the intra-fusion of graph-based global information and unsupervised contextualized semantic and syntactic information. Based on USS-Graph, we also propose a series of optimization measures to further improve the knowledge integration and representation performance. Extensive experiments conducted on benchmark datasets show that USS-Graph consistently achieves state-of-the-art performances on inductive text classification tasks. Additionally, extended experiments are conducted to deeply analyze the characteristics of USS-Graph and the effectiveness of our proposed optimization measures for further knowledge integration and information complementation.
Funder
Natural Science Foundation of Beijing Municipality
National Natural Science Foundation of China
BUPT Excellent Ph.D. Students Foundation
Publisher
Springer Science and Business Media LLC
Subject
Computational Mathematics,Engineering (miscellaneous),Information Systems,Artificial Intelligence
Reference52 articles.
1. Shaaban MA, Hassan YF, Guirguis SK (2021) Deep convolutional forest: a dynamic deep ensemble approach for spam detection in text. Complex Intel Syst 8:4897–4909
2. Wang AH (2010) Don’t follow me: Spam detection in twitter. In: 2010 International Conference on Security and Cryptography (SECRYPT), pp. 1–10
3. Rashid A, Farooq MS, Abid A, Umer T, Bashir, AK, Zikria YB (2021) Social media intention mining for sustainable information systems: categories, taxonomy, datasets and challenges. Complex & Intelligent Systems
4. Shekhar S, Garg H, Agrawal R, Shivani S, Sharma B (2021) Hatred and trolling detection transliteration framework using hierarchical lstm in code-mixed social media text. Complex & Intelligent Systems
5. Che Z, Kale D, Li W, Bahadori MT, Liu Y (2015) Deep Computational Phenotyping, pp. 507–516. Association for Computing Machinery, New York, NY, USA