Abstract
With the wide spread of massive open online courses ( MOOC ), millions of people have enrolled in many courses, but the dropout rate of most courses is more than 90%. Accurately predicting the dropout rate of MOOC is of great significance to prevent learners’ dropout behavior and reduce the dropout rate of students. Using the PH278x curriculum data on the Harvard X platform in spring 2013, and based on the statistical analysis of the factors that may affect learners’ final completion of the curriculum from two aspects: learners’ own characteristics and learners’ learning behavior, we established the MOOC dropout rate prediction models based on logical regression, K nearest neighbor and random forest, respectively. Experiments with five evaluation metrics (accuracy, precision, recall, F1 and AUC) show that the prediction model based on random forest has the highest accuracy, precision, F1 and AUC, which are 91.726%, 93.0923%, 95.4145%, 0.925341, respectively, its performance is better than that of the prediction model based on logical regression and that of the model based on K-nearest neighbor, whose values of these metrics are 91.395%, 92.8674%, 95.2337%, 0.912316 and 91.726%, 93.0923%, 95.4145% and 0.925341, respectively. As for recall metrics, the value of random forest is higher than that of KNN, but slightly lower than that of logistic regression, which are 0.992476, 0.977239 and 0.978555, respectively. Then, we conclude that random forests perform best in predicting the dropout rate of MOOC learners. This study can help education staff to know the trend of learners’ dropout behavior in advance, so as to put some measures to reduce the dropout rate before it occurs, thus improving the completion rate of the curriculum.
Funder
Foundation of Shandong Jiaotong University
Horizontal Project
Key project of art science in Shandong Province in 2022
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference30 articles.
1. Liu, R. (2017). A Study on the Prediction Model of Students’ MOOC Dropout. [Master’s Thesis, Central China Normal University].
2. Zheng, Y., and Yin, B. (2015, January 26–28). Big Data Analytics in MOOCs. Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing. Pervasive Intelligence and Computing. Proceedings of the 2015 IEEE International Conference on Computer and Information Technology, Liverpool, UK.
3. Educational Data Mining: Dropout Prediction in XuetangX MOOCs;Xu;Neural Process. Lett.,2022
4. MOOCs dropout rate prediction method based on deep learning;Xia;Comput. Eng. Sci.,2019
5. Taylor, C. (2014). Stopout Prediction in Massive Open Online Courses. [Master’s Thesis, Massachusetts Institute of Technology].
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献