Affiliation:
1. North Carolina State University, Raleigh, NC, USA
2. Google, Mountain View, CA, USA
Abstract
For modern data analytics, practices from software development are increasingly necessary to manage data, but they must be incorporated alongside other statistical and scientific skills. Therefore, we ask: how does a community recontextualize software development through the unique pressures of their work? To answer this, we explore the analytic community around baseball, or sabermetrics. To discover software development's place in the search for robust statistical insight in sports, we interview 10 participants in the sabermetric community and survey over 120 more data analysts, both in baseball and not. We explore how their work lives at the intersection of science and entertainment, and as a consequence, baseball data serves as an accessible yet deep subject to practice analytic skills. Software development exists within an iterative research process that cycles between defining rigorous statistical methods and preserving the flexibility to chase interesting problems. In this question-driven process, members of the community inhabit several overlapping roles of intentional work, in which software development can become the priority to support research and statistical infrastructure, and we discuss the way that the community can foster the balance of these skills.
Funder
National Science Foundation
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Human-Computer Interaction,Social Sciences (miscellaneous)
Reference65 articles.
1. Systems and software engineering - Vocabulary ISO/IEC/IEEE 24765: 2010;IEEE Standards Association et al.;Iso/Iec/Ieee,2010
2. Analyze this! 145 questions for data scientists in software engineering
3. Who’s Got the Data? Interdependencies in Science and Technology Collaborations
4. Pierre Bourque Richard E Fairley etal 2014. Guide to the software engineering body of knowledge (SWEBOK (R)): Version 3.0. IEEE Computer Society Press. Pierre Bourque Richard E Fairley et al. 2014. Guide to the software engineering body of knowledge (SWEBOK (R)): Version 3.0. IEEE Computer Society Press.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Enabling Collaborative Data Science Development with the Ballet Framework;Proceedings of the ACM on Human-Computer Interaction;2021-10-13
2. Remote, but Connected;Proceedings of the ACM on Human-Computer Interaction;2021-04-13