Affiliation:
1. Enterprise SSD Division, Micron Technology, Bengaluru 560103, India
2. Department of Computer Science, University of York, Deramore Lane, York YO10 5GH, UK
3. Department of Engineering, Manchester Metropolitan University, Chester Street, Manchester M1 5GD, UK
Abstract
Football (also known as Soccer), boasts a staggering fan base of 3.5 billion individuals spread across 200 countries, making it the world’s most beloved sport. The widespread adoption of advanced technology in sports has become increasingly prominent, empowering players, coaches, and team management to enhance their performance and refine team strategies. Among these advancements, player substitution plays a crucial role in altering the dynamics of a match. However, due to the absence of proven methods or software capable of accurately predicting substitutions, these decisions are often based on instinct rather than concrete data. The purpose of this research is to explore the potential of employing machine learning algorithms to predict substitutions in Football, and how it could influence the outcome of a match. This study investigates the effect of timely and tactical substitutions in football matches and their influence on the match results. Machine learning techniques such as Logistic Regression (LR), Decision tree (DT), K-nearest Neighbor (KNN), Support Vector Machine (SVM), Multinomial Naïve Bayes (MNB), Random Forest (RF) classifiers were implemented and tested to develop models and to predict player substitutions. Relevant data was collected from the Kaggle dataset, which contains data of 51,738 substitutions from 9074 European league football matches in 5 leagues spanning 6 seasons. Machine learning models were trained and tested using an 80-20 data split and it was observed that RF model provided the best accuracy of over 70% and the best F1-score of 0.65 on the test set across all football leagues. SVM model achieved the best Precision of almost 0.8. However, the worst computation time of up to 2 min was consumed. LR showed some overfitting issues with 100% accuracy in the training set, but only 60% accuracy was obtained for the test set. To conclude, based on the time of substitution and match score-line, it was possible to predict the players who can be substituted, which can provide a match advantage. The achieved results provided an effective way to decide on player substitutions for both the team manager and coaches.
Subject
Artificial Intelligence,Computer Science Applications,Information Systems,Management Information Systems
Reference43 articles.
1. Ritzer, G. (2012). The Wiley-Blackwell Encyclopedia of Globalization, Weily.
2. The three and six-substitution rules in football: A preliminary comparative analysis in quantitative replacing, game statistics, win rate and winning probability;Ribeiro;Mot. Rev. Educ. Física,2020
3. Teoldo, I., Guilherme, J., and Garganta, J. (2021). Football Intelligence: Training and Tactics for Soccer Success, Routledge. [1st ed.].
4. Supervised Machine Learning Algorithms: Classification and Comparison;Osisanwo;Int. J. Comput. Trends Technol.,2017
5. Anderson, C., and Sally, D. (2014). The Numbers Game. Why Everything You Know about Football Is Wrong, Penguin Books.