A BERT-Based Hybrid Short Text Classification Model Incorporating CNN and Attention-Based BiGRU-Reference-Cited by-同舟云学术

A BERT-Based Hybrid Short Text Classification Model Incorporating CNN and Attention-Based BiGRU

Published:2021-11 Issue:6 Volume:33 Page:1-21
ISSN:1546-2234
Container-title:Journal of Organizational and End User Computing
language:en
Short-container-title:

Author:

Bao Tong¹,Ren Ni¹,Luo Rui²,Wang Baojia²,Shen Gengyu²,Guo Ting²

Affiliation:

1. Information Center, Jiangsu Academy of Agricultural Sciences & Institute of Science and Technology Information, Jiangsu University, China

2. Information Center, Jiangsu Academy of Agricultural Sciences, China

Abstract

Short text classification is a research focus for natural language processing (NLP), which is widely used in news classification, sentiment analysis, mail filtering and other fields. In recent years, deep learning techniques are applied to text classification and has made some progress. Different from ordinary text classification, short text has the problem of less vocabulary and feature sparsity, which raise higher request for text semantic feature representation. To address this issue, this paper propose a feature fusion framework based on the Bidirectional Encoder Representations from Transformers (BERT). In this hybrid method, BERT is used to train word vector representation. Convolutional neural network (CNN) capture static features. As a supplement, a bi-gated recurrent neural network (BiGRU) is adopted to capture contextual features. Furthermore, an attention mechanism is introduced to assign the weight of salient words. The experimental results confirmed that the proposed model significantly outperforms the other state-of-the-art baseline methods.

Publisher

IGI Global

Subject

Strategy and Management,Computer Science Applications,Human-Computer Interaction

Reference37 articles.

1. KNN based machine learning approach for text and document mining.;V.Bijalwan;International Journal of Database Theory and Application,2014

2. Chen, Z. (2019). Short text classification based on word2vec and improved TDFIDF merge weighting. Paper presented at the 2019 3rd International Conference on Electronic Information Technology and Computer Engineering (EITCE).

3. Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. (2014). Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078.

4. Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.

5. Sentence Classification Using Novel NIN.;Y.-P.Fu;Journal of Computers,2018

Cited by 23 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Empirical Study on Teaching Management and Quality Feedback Based on Human Behavior Detection;Journal of Organizational and End User Computing;2024-08-12

2. Deep learning-driven intelligent pricing model in retail: from sales forecasting to dynamic price optimization;Soft Computing;2024-07-26

3. Home Activity Recognition for Rural Elderly Based on Deep Learning and Smartphone Sensors;Journal of Organizational and End User Computing;2024-07-19

4. Research on Public Service Request Text Classification Based on BERT-BiLSTM-CNN Feature Fusion;Applied Sciences;2024-07-18

5. Building Materials Classification Model Based on Text Data Enhancement and Semantic Feature Extraction;Buildings;2024-06-19