Research on Long Text Classification Model Based on Multi-Feature Weighted Fusion-Reference-Cited by-同舟云学术

Research on Long Text Classification Model Based on Multi-Feature Weighted Fusion

Published:2022-06-28 Issue:13 Volume:12 Page:6556
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Yue Xi,Zhou Tao,He Lei,Li Yuxia

Abstract

Text classification in the long-text domain has become a development challenge due to the significant increase in text data, complexity enhancement, and feature extraction of long texts in various domains of the Internet. A long text classification model based on multi-feature weighted fusion is proposed for the problems of contextual semantic relations, long-distance global relations, and multi-sense words in long text classification tasks. The BERT model is used to obtain feature representations containing global semantic and contextual feature information of text, convolutional neural networks to obtain features at different levels and combine attention mechanisms to obtain weighted local features, fuse global contextual features with weighted local features, and obtain classification results by equal-length convolutional pooling. The experimental results show that the proposed model outperforms other models in terms of accuracy, precision, recall, F1 value, etc., under the same data set conditions compared with traditional deep learning classification models, and it can be seen that the model has more obvious advantages in long text classification.

Funder

Science and Technology Department of Sichuan Province

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/12/13/6556/pdf

Reference45 articles.

1. Predicting the performance of online consumer reviews: A sentiment mining approach to big data analytics

2. A tale of two epidemics: Contextual Word2Vec for classifying twitter streams during outbreaks

3. Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review

4. Deep learning based emotion analysis of microblog texts

5. Comparative study of deep learning models for analyzing online restaurant reviews in the era of the COVID-19 pandemic

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. RQ-OSPTrans: A Semantic Classification Method Based on Transformer That Combines Overall Semantic Perception and “Repeated Questioning” Learning Mechanism;Applied Sciences;2024-05-17

2. Advances in Artificial Intelligence for Perception Augmentation and Reasoning;Applied Sciences;2023-03-27