Topic-BERT: Detecting harmful information from social media-Reference-Cited by-同舟云学术

Topic-BERT: Detecting harmful information from social media

Published:2021-09-27 Issue:3 Volume:15 Page:333-342
ISSN:1872-4981
Container-title:Intelligent Decision Technologies
language:
Short-container-title:IDT

Author:

Gao Wang¹,Deng Hongtao¹,Zhu Xun¹,Fang Yuan²

Affiliation:

1. School of Artificial Intelligence, Jianghan University, Hubei, China

2. School of Computer Science and Technology, Wuhan University of Technology, Hubei, China

Abstract

Harmful information identification is a critical research topic in natural language processing. Existing approaches have been focused either on rule-based methods or harmful text identification of normal documents. In this paper, we propose a BERT-based model to identify harmful information from social media, called Topic-BERT. Firstly, Topic-BERT utilizes BERT to take additional information as input to alleviate the sparseness of short texts. The GPU-DMM topic model is used to capture hidden topics of short texts for attention weight calculation. Secondly, the proposed model divides harmful short text identification into two stages, and different granularity labels are identified by two similar sub-models. Finally, we conduct extensive experiments on a real-world social media dataset to evaluate our model. Experimental results demonstrate that our model can significantly improve the classification performance compared with baseline methods.

Publisher

IOS Press

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Human-Computer Interaction,Software

Reference32 articles.

1. Xu G, Qi C, Yu H, Xu S, Zhao C, Yuan J. Detecting sensitive information of unstructured text Using convolutional neural network. In: International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC); 2019. pp. 474–479.

2. Devlin J, Chang M, Lee K, Toutanova K. BERT: Pre-training of deep bidirectional transformers for language understanding. In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT); 2019. pp. 4171–4186.

3. Detecting disaster-related tweets via multimodal adversarial neural network;Gao;IEEE MultiMedia,2020

4. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, et al. Attention is all you need. In: Advances in Neural Information Processing Systems (NIPS); 2017. pp. 5998–6008.

5. A novel framework for augmenting the quality of explanations in recommender systems;Karacapilidis;Intelligent Decision Technologies,2017

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. TGNN: Topic-aware Graph Neural Network for Identifying Fake News from Social Media;2022 IEEE 2nd International Conference on Electronic Technology, Communication and Information (ICETCI);2022-05-27

2. Predict the popularity of social content during crisis based on CRFTM-BERT;International Conference on Electronic Information Engineering, Big Data, and Computer Technology (EIBDCT 2022);2022-05-06