Abstract
Given a set of sequences comprised of time-ordered events, sequential pattern mining is useful to identify frequent subsequences from different sequences or within the same sequence. However, in sport, these techniques cannot determine the importance of particular patterns of play to good or bad outcomes, which is often of greater interest to coaches and performance analysts. In this study, we apply a recently proposed supervised sequential pattern mining algorithm called safe pattern pruning (SPP) to 490 labelled event sequences representing passages of play from one rugby team’s matches in the 2018 Japan Top League season. We obtain patterns that are the most discriminative between scoring and non-scoring outcomes from both the team’s and opposition teams’ perspectives using SPP, and compare these with the most frequent patterns obtained with well-known unsupervised sequential pattern mining algorithms when applied to subsets of the original dataset, split on the label. From our obtained results, line breaks, successful line-outs, regained kicks in play, repeated phase-breakdown play, and failed exit plays by the opposition team were found to be the patterns that discriminated most between the team scoring and not scoring. Opposition team line breaks, errors made by the team, opposition team line-outs, and repeated phase-breakdown play by the opposition team were found to be the patterns that discriminated most between the opposition team scoring and not scoring. It was also found that, probably because of the supervised nature and pruning/safe-screening mechanisms of SPP, compared to the patterns obtained by the unsupervised methods, those obtained by SPP were more sophisticated in terms of containing a greater variety of events, and when interpreted, the SPP-obtained patterns would also be more useful for coaches and performance analysts.
Funder
core research for evolutional science and technology
Ministry of Education, Culture, Sports, Science and Technology
RIKEN
Publisher
Public Library of Science (PLoS)
Reference45 articles.
1. The use of performance indicators in performance analysis;MD Hughes;Journal of sports sciences,2002
2. Agrawal R, Srikant R. Mining sequential patterns. In: Proceedings of the eleventh international conference on data engineering. 1995 Mar 6 (pp. 3–14). IEEE.
3. A taxonomy of sequential pattern mining algorithms;NR Mabroukeh;ACM Computing Surveys (CSUR),2010
4. Wang K, Xu Y, Yu JX. Scalable sequential pattern mining for biological sequences Proceedings of the thirteenth ACM international conference on Information and knowledge management. 2004 Nov 13 (pp. 178–187).
5. Ho J, Lukov L, Chawla S. Sequential pattern mining with constraints on large protein databases. In Proceedings of the 12th international conference on management of data (COMAD). 2005 (pp. 89–100).
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献