Author:
Selim Kamal Samy,Rezk Sahar Saeed
Abstract
AbstractCompulsory school-dropout is a serious problem affecting not only the education systems, but also the developmental progress of any country as a whole. Identifying the risk of dropping out, and characterizing its main determinants, could help the decision-makers to draw eradicating policies for this persisting problem and reducing its social and economic negativities over time. Based on a substantially imbalanced Egyptian survey dataset, this paper aims to develop a Logistic classifier capable of early predicting students at-risk of dropping out. Training any classifier with an imbalanced dataset, usually weaken its performance especially when it comes to false negative classification. Due to this fact, an extensive comparative analysis is conducted to investigate a variety of resampling techniques. More specifically, based on eight under-sampling techniques and four over-sampling ones, and their mutually exclusive mixed pairs, forty-five resampling experiments on the dataset are conducted to build the best possible Logistic classifier. The main contribution of this paper is to provide an explicit predictive model for school dropouts in Egypt which could be employed for identifying vulnerable students who are continuously feeding this chronic problem. The key factors of vulnerability the suggested classifier identified are student chronic diseases, co-educational, parents' illiteracy, educational performance, and teacher caring. These factors are matching with those found by many of the research previously conducted in similar countries. Accordingly, educational authorities could confidently monitor these factors and tailor suitable actions for early intervention.
Publisher
Springer Science and Business Media LLC
Subject
Library and Information Sciences,Education
Reference65 articles.
1. Agustianto, K., & Destarianto, P. (2019). Imbalance Data Handling using Neighborhood Cleaning Rule (NCL) Sampling Method for Precision Student Modeling. International Conference on Computer Science, Information Technology, and Electrical Engineering, ICOMITEE, 86–89.
2. Amin, A., Anwar, S., Adnan, A., Nawaz, M., Howard, N., Qadir, J., Hawalah, A., & Hussain, A. (2016). Comparing Oversampling Techniques to Handle the Class Imbalance Problem: A Customer Churn Prediction Case Study. IEEE Access, 4, 7940–7957.
3. Assaad, R. (2010). The Effect of Domestic Work on Girls’ Schooling: Evidence from Egypt. Feminist Economics, 16(1), 79–128.
4. Avon, V. (2016). Machine learning techniques for customer churn prediction in banking environments. University of Padua. An M.Sc. thesis retrieved from https://core.ac.uk/download/pdf/83461632.pdf. Accessed 12 June 2021.
5. Badr, M. (2012). School Effects on Educational Attainment in Egypt. CREDIT Research Paper, 12(5), 1–58.
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献