Affiliation:
1. College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao 266590, China
2. College of Mathematics and Systems Science, Shandong University of Science and Technology, Qingdao 266590, China
3. College of Mechanical and Electronic Engineering, Shandong University of Science and Technology, Qingdao 266590, China
Abstract
With the development of Internet technology, government affairs can be handled online. More and more citizens are using online platforms to report to government departments, which is generating a lot of textual data. Among them, the basic but important problem is to automatically classify the different categories of messages, so that staff from different departments can process relevant information quickly. However, government messages have problems such as fast update rate, a large amount of information, long texts, and difficulty in capturing key points, which make supervised learning methods unsuitable for processing such texts. To address these problems, we propose a semisupervised text classification method based on a transformer-based pointer generator network named Ptr4BERT, which uses the pointer generator network with BERT(bidirectional encoder representation from transformers) embedding as a preprocessor for feature extraction. In this method, text classification can achieve very good results with a small set of labeled data, by extracting features exclusively from the message text. In order to verify the effect of our proposed model, we performed some experiments. Besides, we designed a crawler program and obtained two datasets from different websites, which are named HNMes and QDMes. Experimental results have shown that the proposed method outperforms the state-of-the-art methods significantly.
Funder
National College Students Innovation and Entrepreneurship Training Program
Reference30 articles.
1. Abstractive Review Summarization based on Improved Attention Mechanism with Pointer Generator Network Model
2. Sentic Computing for Aspect-Based Opinion Summarization Using Multi-Head Attention with Feature Pooled Pointer Generator Network;A. Kumar;Cognitive Computation,2021
3. Extracting relational facts based on hybrid Syntax-Guided transformer and pointer network
4. DeepDepict
5. Entity relations based pointer-generator network for abstractive text summarization;T. Huang
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献