Contextualizing the current state of research on the use of machine learning for student performance prediction: A systematic literature review

Author:

Alalawi Khalid12,Athauda Rukshan1,Chiong Raymond1ORCID

Affiliation:

1. School of Information and Physical Sciences The University of Newcastle Callaghan New South Wales Australia

2. College of Computer Science and Engineering Taibah University Medina Saudi Arabia

Abstract

AbstractToday, educational institutions produce large amounts of data with the deployment of learning management systems. These large datasets provide an untapped potential to support and enhance decision‐making and operations. In recent times, machine learning (ML) has been applied to develop models utilizing this “big” data to assist in decision‐making. This study presents a systematic literature review into the application of ML to predict student performance. A total of 162 research articles from January 2010 to October 2022 were critically reviewed and analyzed by applying Kitchenham's systematic literature review approach. Our analysis categorized the literature predicting students' academic performance into two categories: (i) predicting student performance in assessments, courses or programs, and identifying students at‐risk of failing their course/program (129 studies); and (ii) predicting student dropout or retention in a course or program (33 studies). Classification is the most commonly used approach for predicting student performance (138 studies), followed by regression (25 studies) and clustering (9 studies). Supervised learning methods are used more often than semi‐supervised learning. Five most popular ML algorithms include the Decision Tree, Random Forest, Naïve Bayes, Artificial Neural Network, and Support Vector Machine. Historical records of students' grades and class performance, academic related data from learning management systems, and students' demographics are the most common features used for predicting students' performance. The most common methods used for feature selection are Information Gain‐based selection algorithms, Correlation‐based feature selection, and Gain Ratio. The general platforms/tools/libraries used in the studies include WEKA, Python, R, Rapid Miner, and MATLAB. We also investigated possible actions considered in the literature to help at‐risk students. We only found very few studies that deployed remedial actions and evaluated their impact on students' performance. In conclusion, ML has shown great potential in the prediction of student performance, but also has many areas of further research.

Publisher

Wiley

Subject

General Engineering,General Computer Science

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3