Abstract
Student dropout, defined as the abandonment of a high education program before obtaining the degree without reincorporation, is a problem that affects every higher education institution in the world. This study uses machine learning models over two Chilean universities to predict first-year engineering student dropout over enrolled students, and to analyze the variables that affect the probability of dropout. The results show that instead of combining the datasets into a single dataset, it is better to apply a model per university. Moreover, among the eight machine learning models tested over the datasets, gradient-boosting decision trees reports the best model. Further analyses of the interpretative models show that a higher score in almost any entrance university test decreases the probability of dropout, the most important variable being the mathematical test. One exception is the language test, where a higher score increases the probability of dropout.
Funder
Agencia Nacional de Investigación y Desarrollo
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference76 articles.
1. Draft Preliminary Report Concerning the Preparation of a Global Convention on the Recognition of Higher Education Qualificationshttps://unesdoc.unesco.org/ark:/48223/pf0000234743
2. 23 Remarkable Higher Education Statisticshttps://markinstyle.co.uk/higher-education-statistics/
3. A comparative analysis of machine learning techniques for student retention management
4. College Dropout Rateshttps://educationdata.org/college-dropout-rates/
5. UK Has ‘Lowest Drop-Out Rate in Europe’https://www.timeshighereducation.com/news/uk-has-lowest-drop-out-rate-in-europe/2012400.article
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献