Text Classification Research with Attention-based Recurrent Neural Networks

Author:

Du Changshun,Huang Lei

Abstract

Text classification is one of the principal tasks of machine learning. It aims to design proper algorithms to enable computers to extract features and classify texts automatically. In the past, this has been mainly based on the classification of keywords and neural network semantic synthesis classification. The former emphasizes the role of keywords, while the latter focuses on the combination of words between roles. The method proposed in this paper considers the advantages of both methods. It uses an attention mechanism to learn weighting for each word. Under the setting, key words will have a higher weight, and common words will have lower weight. Therefore, the representation of texts not only considers all words, but also pays more attention to key words. Then we feed the feature vector to a softmax classifier. At last, we conduct experiments on two news classification datasets published by NLPCC2014 and Reuters, respectively. The proposed model achieves F-values by 88.5% and 51.8% on the two datasets. The experimental results show that our method outperforms all the traditional baseline systems.

Publisher

Agora University of Oradea

Subject

Computational Theory and Mathematics,Computer Networks and Communications,Computer Science Applications

Cited by 64 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. K-Nearest neighbor smart contract classification with semantic feature enhancement;The Computer Journal;2024-07-27

2. Email spam detection by deep learning models using novel feature selection technique and BERT;Egyptian Informatics Journal;2024-06

3. A text classification network model combining machine learning and deep learning;International Journal of Sensor Networks;2024

4. A New Multi-class Classification Method Based on Machine Learning to Document Classification.;2023 16th International Conference on Developments in eSystems Engineering (DeSE);2023-12-18

5. A Comparative Analysis of Deep Learning Approaches in Bangla Document Categorization;2023 26th International Conference on Computer and Information Technology (ICCIT);2023-12-13

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3