Abstract
In recent years, schools have shown interest in utilizing data mining to improve the quality of education. To enhance academic performance, accurately predicting how students will perform in their classes is crucial, which is essential for their progress in further education. Some students encounter challenges upon entering higher education, and predicting their performance early on is vital to keeping them on the right track. Our research aims to assess student performance using various classification strategies to identify the most accurate one. We utilize a Kaggle dataset for this study. Initially, we clean up the dataset by removing duplicate records and filling in any missing information. Subsequently, we apply six different classifiers, including Neural Networks and methods such as Random Forest and Support Vector Machine, utilizing the Weka tool. Additionally, we employ Principal Component Analysis (PCA) to extract optimized features that enhance model accuracy. We evaluate all models on Training and Testing splits, as well as the 10-K Fold options provided by the Weka tool. Finally, we calculate Training Accuracy, Testing Accuracy, Precision, Recall, and F1-Score for each model and compare their results. Notably, Neural Networks and Random Forest demonstrate superior results compared to other models.