Affiliation:
1. ETH Zurich
2. University of Washington
Abstract
In the domain of high-energy physics (HEP), query languages in general and SQL in particular have found limited acceptance. This is surprising since HEP data analysis matches the SQL model well: the data is fully structured and queried using mostly standard operators. To gain insights on why this is the case, we perform a comprehensive analysis of six diverse, general-purpose data processing platforms using an HEP benchmark. The result of the evaluation is an interesting and rather complex picture of existing solutions: Their query languages vary greatly in how natural and concise HEP query patterns can be expressed. Furthermore, most of them are also between one and two orders of magnitude slower than the domain-specific system used by particle physicists today. These observations suggest that, while database systems and their query languages are
in principle
viable tools for HEP, significant work remains to make them relevant to HEP researchers.
Publisher
Association for Computing Machinery (ACM)
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Reference77 articles.
1. Actian Corporation . Columnar Database for Big Data | Vector Analytic Database . Retrieved Aug. 18, 2021 from https://www.actian.com/analytic-database/vector-analytic-database/. Actian Corporation. Columnar Database for Big Data | Vector Analytic Database. Retrieved Aug. 18, 2021 from https://www.actian.com/analytic-database/vector-analytic-database/.
2. AsterixDB
3. A visual query language for HEP analysis
4. Spark SQL
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献