Word Mining Research Based on Intelligent Algorithms

Author:

Chu Ruilin

Abstract

Wordle is a popular puzzle that The New York Times currently provides every day, and it has a high popularity. Among them, the number of results reported every day, the characteristics of words and other data have attracted widespread attention. This paper first used the ARIMA model to predict the number of daily reported outcomes and found that it was only accurate for the linear part of the data. Then, this paper used the LSTM neural network model to predict, and found that the LSTM model can predict the nonlinear part of the data well, which just makes up for the deficiency of the ARIMA model, and the predicted results are basically consistent with the original data. The data range of March 1st is [17586.36, 44379.83]. Further, this paper adopted the LSTM neural network model based on genetic algorithm optimization, which can solve the over-fitting problem that may occur in the LSTM neural network due to too few data sets. Finally, the SVM multi-classification model are used. According to the quantified word feature labels, the difficulty of words is divided into three categories: hard, medium, and easy. Using existing data tests, it’s proved that the classification accuracy is very high.

Publisher

Darcy & Roy Press Co. Ltd.

Reference10 articles.

1. Liu Chengliang. Research on Air Quality Index Evolution Prediction Model Combining GCN and LSTM [D]. Nanjing: Nanjing University of Posts and Telecommunications. 2022.

2. Longfuhai. Study on the feature selection method based on the optimization of genetic algorithms [D]. Guiyang: Guizhou National University, 2022.

3. Okkalioglu Murat. Imbalance text classification with relative imbalance ratio [J]. Expert Systems with Applications,2023, Volume 217, Issue.

4. Luo Mao. Research on Support Vector Machine Optimization Algorithm Based on Improved Multiverse Algorithm [D]. Changchun: Jilin University, 2022.

5. Hans van Halteren. Improving Accuracy in Word Class Tagging through the Combination of Machine Learning Systems [J]. Computational Linguistics ,2001, 27 (2): 199–229.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3