Abstract
The aim of this research is to determine the important variables that predict the PISA 2018 reading comprehension achievement score of countries with different achievement levels, using 34 independent variables obtained from the student questionnaire given to the students who participated in PISA in 2018. For this purpose, 79 countries that participated PISA were ranked according to their success percentages then, these countries were sorted into lower, middle and upper group countries. A sample of lower, middle and upper group countries was formed then, three countries were selected from each of the lower group, middle group and upper group countries and a sample of lower, middle and upper group countries was formed. Data mining analyzes were carried out on the samples obtained by using the Classification and Regression Tree and Random Forest methods. It has been observed that the number of important variables that predict reading comprehension success can be reduced from 34 to three to eight. Like this; Data mining classification prediction models, which can predict the success level of PISA, were obtained by using a small number of variables. It has been determined that the models obtained have an acceptable level of predictive performance in predicting success in three categories (low, medium-high). The most important predictor variables obtained from the models are information and communication technologies resources, perception of reading difficulty, professional status expected from the student, perception of difficulty in the PISA test, reading pleasure, weekly test language learning time, disciplinary climate, socio-economic status index.
Publisher
Yuzuncu Yil Universitesi Egitim Fakultesi Dergisi
Reference41 articles.
1. Abad, F. M., & Lopez, A. C. (2016). Data-mining techniques in detecting factors linked to academic achievement. An International Journal of Research, Policy and Practice, 28 (1), 39-55. https://doi.org /10.1080/09243453.2016.1235591
2. Abdous, M., He, W., & Yen, C. J. (2012). Using data mining for predicting relationships between online question theme and final grade. Journal of Educational Technology ve Society, 15(3), 77-88.
3. Akman, M. (2010). Veri madenciliğine genel bakış ve random forests yönteminin incelenmesi: Sağlık alanında bir uygulama [Yayınlanmamış yüksek lisans tezi]. Ankara Üniversitesi Sağlık Bilimleri Enstitüsü, Ankara.
4. Aksu, G., & Güzeller, C. O. (2016). Classification of PISA 2012 mathematical literacy scores using decision-tree method. Journal of Education and Science, 41(185), 101-122. https://doi.org 10.15390/EB.2016.4766
5. Altunkaynak, B. (2019). Veri madenciliği yöntemleri ve r uygulamaları. (2.Baskı). Seçkin yayıncılık.