Affiliation:
1. School of Science and Technology, International Hellenic University, 57001 Thessaloniki, Greece
Abstract
The field of sports analytics has grown rapidly, with a primary focus on performance forecasting, enhancing the understanding of player capabilities, and indirectly benefiting team strategies and player development. This work aims to forecast and comparatively evaluate players’ goal-scoring likelihood in four elite football leagues (Premier League, Bundesliga, La Liga, and Serie A) by mining advanced statistics from 2017 to 2023. Six types of machine learning (ML) models were developed and tested individually through experiments on the comprehensive datasets collected for these leagues. We also tested the upper 30th percentile of the best-performing players based on their performance in the last season, with varied features evaluated to enhance prediction accuracy in distinct scenarios. The results offer insights into the forecasting abilities of those leagues, identifying the best forecasting methodologies and the factors that most significantly contribute to the prediction of players’ goal-scoring. XGBoost consistently outperformed other models in most experiments, yielding the most accurate results and leading to a well-generalized model. Notably, when applied to Serie A, it achieved a mean absolute error (MAE) of 1.29. This study provides insights into ML-based performance prediction, advancing the field of player performance forecasting.
Reference26 articles.
1. Sports Analytics and the Big-Data Era;Morgulev;Int. J. Data Sci. Anal.,2018
2. Evaluating the Effectiveness of Machine Learning Models for Performance Forecasting in Basketball: A Comparative Study;Papageorgiou;Knowl. Inf. Syst.,2024
3. Application of Machine Learning Approaches in Intrusion Detection System: A Survey;Haq;Int. J. Adv. Res. Artif. Intell.,2015
4. Papageorgiou, G., Sarlis, V., and Tjortjis, C. (2024). An Innovative Method for Accurate NBA Player Performance Forecasting and Line-up Optimization in Daily Fantasy Sports. Int. J. Data Sci. Anal.
5. Pantzalis, V.C., and Tjortjis, C. (2020, January 15–17). Sports Analytics for Football League Table and Player Performance Prediction. Proceedings of the 2020 11th International Conference on Information, Intelligence, Systems and Applications, Piraeus, Greece.