Author:
Verma Swati, ,Yadav Rakesh Kumar,Kholiya Kuldeep
Abstract
In the current age, students' academic performance deterioration is a very crucial problem in engineering education. Prediction of low-performing students at an early stage is important so that their faculties and administration could provide timely support. The present study attempts to perform this prediction task at the entry-time with the help of four single supervised educational data mining algorithms, namely: Decision tree, Naïve Bayes, k-Nearest Neighbor, and Support Vector Machine along with an ensemble method called “Random Forest”. These classifiers have been applied to a students‟ dataset of an Indian Engineering College, having four categories of parameters viz., student‟s background, academic, social, and psychological parameters. Different libraries of Python programming language such as Pandas, Seaborn, Scikit-learn, and Scipy were used for analysis, visualization, classification, and statistics computation, respectively. The present study shows that among all of the five algorithms, Naïve Bayes gives the highest accuracy with 89%, and finally to improve the results, a model is proposed in which three Naïve Bayes classifiers were integrated with the help of 'Bagging'. The achieved accuracy with the proposed model was 91%, with the highest recall and highest precision for identifying low performers.
Subject
Computer Science Applications,Education
Reference30 articles.
1. [1] A. Buldu, and K. Üçgün, "Data mining application on students' data," Procedia Social and Behavioral Sciences, vol. 2, pp. 5251-5259, 2010.
2. [2] B. K. Bhardwaj and S. Pal, "Data mining: A prediction for performance improvement using classification," International Journal of Computer Science and Information Security, vol. 9, no. 4, pp. 136-140, 2011.
3. [3] D. Kabakchieva, "Predicting student performance by using data mining methods for classification," Cybernetics and Information Technologies, vol. 13, no. 1, pp. 61-72, 2013.
4. [4] A. K. Pal and S. Pal, "Analysis and mining of educational data for predicting the performance of students," International Journal of Electronics Communication and Computer Engineering, vol. 4, no. 5, pp. 1560-1565, 2013.
5. [5] S. Huang and N. Fang, "Predicting student academic performance in an engineering dynamics course: A comparison of four types of predictive mathematical models," Computers & Education, vol. 61, pp. 133-145, 2013.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Modeling Sentiment Analysis for Educational Texts by Combining BERT and FastText;2024 6th International Conference on Computer Science and Technologies in Education (CSTE);2024-04-19
2. Methodological Implementation for Predicting Student Performance Using Data Mining Classifiers and Machine Learning;2023 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS);2023-11-03
3. Estimating Answer Strategies using Online Handwritten Data: A Study using Geometry Problems;The 15th International Conference on Education Technology and Computers;2023-09-26