Affiliation:
1. Universidade Federal de Pelotas, Pelotas, Brazil
2. State Health Department of Rio Grande do Sul, Porto Alegre, Brazil
Abstract
Brazil’s education system lags behind international standards, with two-fifths of students scoring below the minimum level of proficiency in mathematics, science, and reading. Thus, this study combined machine learning with traditional statistics to identify the most important predictors and to interpret their effects on proficiency in the PISA 2018 mathematics, science, and reading tests. Predictors encompassed a wide range of variables, sociodemographic characteristics, teaching and learning processes, and non-cognitive skills. The outcome of the present study was proficiency in mathematics, science, and reading. PISA proficiency levels were grouped into “low proficiency” and “proficient” categories, using a classification system commonly employed in PISA reports. Using random forest analysis, a machine learning method, I compared the importance of predictors for proficiency in mathematics, science, and reading. I then adjusted multilevel logistic regression analyses to investigate the relationship between the top predictors and the outcomes. Among the top predictors for the three outcomes identified, annual household income, parents’ highest occupational status, and early childhood education and care were positively associated with proficiency in mathematics, science, and reading, while grade repetition and additional instruction were negatively associated with these outcomes. These findings urge Brazilian policymakers and educators to prioritize initiatives that strengthen early childhood programs, minimize grade repetition, and promote effective learning strategies.