Analysis Of Wordle Games Based on Multiple Logistic Regression and Clustering

Author:

Liu Guangsheng,Cai Binjian,Liao Junlin

Abstract

The development of social media has led to an increasing number of people participating in the online guessing game Wordle, which has become a popular phenomenon on Twitter. Understanding the difficulty of the Wordle game is important for both players and game designers. This article aims to gain an in-depth understanding of the game by analyzing its gameplay reports and word difficulty. Firstly, this article predicts the distribution of Wordle report results. Using a multivariate logistic regression model, the probability distribution prediction is converted into a classification problem, with word attributes and the average number of guesses per player as independent variables for prediction. At the same time, this article introduces random noise into the model prediction to address low probability issues. The model is evaluated using MAE and RMSE, and sensitivity analysis is conducted using k-fold cross-validation. In order to further evaluate the difficulty of words in the game, this article uses the entropy weight-TOPSIS method combined with the Bi-Kmeans classification model to classify word difficulty. The model results show that word difficulty can be divided into five categories. Finally, this article evaluates the performance of the model using Davies-Bouldin Index and the Silhouette Coefficient, and the results show that the model has strong reliability and persuasiveness.

Publisher

Darcy & Roy Press Co. Ltd.

Reference14 articles.

1. Liu C L. Using wordle for learning to design and compare strategies[C]//2022 IEEE Conference on Games (CoG). IEEE, 2022: 465-472.

2. Pamungkas N A R. The Effects of Wordle Media on Students’ Vocabulary Mastery [J]. JETAL: Journal of English Teaching & Applied Linguistic, 2021, 2(2): 56-61.

3. Liu X, Teng SH, Long FAN et al. Research on early warning model of road icing based on multiple logistic regression[J]. Hunan Transportation Science and Technology,2022,48(04):101-107.

4. Ma, J., & Sun, Y. (2015). Multinomial logistic regression-based health risk assessment using biomonitoring data from the Canadian Health Measures Survey. Environmental Research, 138, 185-192.

5. Menard, S. (2002). Applied logistic regression analysis (Vol. 106). Sage.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3