Affiliation:
1. Department of e-Business, School of Business Administration, Ajou University, Suwon, Korea
Abstract
In the past decade, many data mining researches have been conducted on the sports field. In particular, baseball has become an important subject of data mining due to the wide availability of massive data from games. Many researchers have conducted their studies to predict pitch types, i.e., fastball, cutter, sinker, slider, curveball, changeup, knuckleball, or part of them. In this research, we also develop a system that makes predictions related to pitches in baseball. The major difference between our research and the previous researches is that our system is to predict pitch types and pitch locations at the same time. Pitch location is the place where the pitched ball arrives among the imaginary grids drawn in front of the catcher. Another difference is the number of classes to predict. In the previous researches for predicting pitch types, the number of classes to predict was 2∼7. However, in our research, since we also predict pitch locations, the number of classes to predict is 34. We build our prediction system using ensemble model of deep neural networks. We describe in detail the process of building our prediction system while avoiding overfitting. In addition, the performances of our prediction system in various game situations, such as loss/draw/win, count and baserunners situation, are presented.
Reference8 articles.
1. Pitch Sequence Complexity and Long-Term Pitcher Performance;Bock,;Sports,2015
2. What the Heck is PITCHf/x;Fast,;Hardball Times Baseball Annual,2010
3. The Problem of Overfitting;Hawkins,;Journal of Chemical Information and Computer Science,2004
4. Lewis, M. , 2003, Moneyball: The Art ofWinning an Unfair Game, W. W. Norton & Company.
5. Linoff, G. and Berry, M. , 2011, Data Mining Techniques: For Marketing, Sales, and CRM, 3rd ed., Wiley Pub. Inc.,
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献