Affiliation:
1. School of Computer and Information Engineering, JiangXi Normal University, Nanchang 330224, China
2. School of Digital Industry, JiangXi Normal University, Shangrao 334000, China
Abstract
Manually tuning the hyperparameters of a deep learning model is not only a time-consuming and labor-intensive process, but it can also easily lead to issues like overfitting or underfitting, hindering the model’s full convergence. To address this challenge, we present a BiLSTM-TCSA model (BiLSTM combine TextCNN and Self-Attention) for deep learning-based sentiment analysis of short texts, utilizing an improved particle swarm optimization (IPSO). This approach mimics the global random search behavior observed in bird foraging, allowing for adaptive optimization of model hyperparameters. In this methodology, an initial step involves employing a Generative Adversarial Network (GAN) mechanism to generate a substantial corpus of perturbed text, augmenting the model’s resilience to disturbances. Subsequently, global semantic insights are extracted through Bidirectional Long Short Term Memory networks (BiLSTM) processing. Leveraging Convolutional Neural Networks for Text (TextCNN) with diverse convolution kernel sizes enables the extraction of localized features, which are then concatenated to construct multi-scale feature vectors. Concluding the process, feature vector refinement and the classification task are accomplished through the integration of Self-Attention and Softmax layers. Empirical results underscore the effectiveness of the proposed approach in sentiment analysis tasks involving succinct texts containing limited information. Across four distinct datasets, our method attains impressive accuracy rates of 91.38%, 91.74%, 85.49%, and 94.59%, respectively. This performance constitutes a notable advancement when compared against conventional deep learning models and baseline approaches.
Funder
National Natural Science Foundation of China
Natural Science Foundation project of JiangXi province
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference42 articles.
1. Vries, A., Mamoulis, N., and Nes, N. (2002, January 3–6). Efficient k-NN search on vertically decomposed data. Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, WI, USA.
2. Text Classification Based on Naive Bayes Algorithm with Feature Selection;Chen;Int. J. Inf.,2012
3. Automatic text Classification based on KNN+ Hierarchical SVM;Wang;Comput. Appl. Softw.,2016
4. Deep learning for sentiment analysis;Rojas;Ling. Linguist. Compass,2016
5. Kim, Y. (2014, January 25–29). Convolutional Neural Networks for Sentence Classification. Proceedings of the Conference on Empirical Methods in Natural Language Processing, Doha, Qatar.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献