Author:
Choi KyungEon,Onyisi Peter
Abstract
Recent developments of HEP software allow novel approaches to physics analysis workflows. The novel data delivery system, ServiceX, can be very effective when accessing a fraction of large datasets at remote grid sites. ServiceX can deliver user-selected columns with filtering and run at scale. We introduce the ServiceX data management package, ServiceX DataBinder, for easy manipulations of ServiceX delivery requests and delivered data using a single configuration file. We show various practical use cases within analysis pipelines that range from a data delivery of a few columns for machine learning study to a data delivery for full-scale physics analysis.
Reference9 articles.
1. ServiceX A Distributed, Caching, Columnar Data Delivery Service
2. Towards Real-World Applications of ServiceX, an Analysis Data Transformation System
3. IRIS-HEP, https://iris-hep.org (2023), accessed: 2023-09-20
4. ServiceX frontend (version 2.6.2), https://github.com/ssl-hep/ServiceX_frontend (2023), accessed: 2023-09-20
5. ServiceX DataBinder (version 0.5.0), https://github.com/kyungeonchoi/ServiceXDataBinder (2023), accessed: 2023-09-20