The integration of heterogeneous resources in the CMS Submission Infrastructure for the LHC Run 3 and beyond
-
Published:2024
Issue:
Volume:295
Page:04046
-
ISSN:2100-014X
-
Container-title:EPJ Web of Conferences
-
language:
-
Short-container-title:EPJ Web of Conf.
Author:
Pérez-Calero Yzquierdo Antonio,Mascheroni Marco,Kizinevic Edita,Khan Farrukh Aftab,Kim Hyunwoo,Acosta Flechas Maria,Tsipinakis Nikos,Haleem Saqib
Abstract
While the computing landscape supporting LHC experiments is currently dominated by x86 processors at WLCG sites, this configuration will evolve in the coming years. LHC collaborations will be increasingly employing HPC and Cloud facilities to process the vast amounts of data expected during the LHC Run 3 and the future HL-LHC phase. These facilities often feature diverse compute resources, including alternative CPU architectures like ARM and IBM Power, as well as a variety of GPU specifications. Using these heterogeneous resources efficiently is thus essential for the LHC collaborations reaching their future scientific goals. The Submission Infrastructure (SI) is a central element in CMS Computing, enabling resource acquisition and exploitation by CMS data processing, simulation and analysis tasks. The SI must therefore be adapted to ensure access and optimal utilization of this heterogeneous compute capacity. Some steps in this evolution have been already taken, as CMS is currently using opportunistically a small pool of GPU slots provided mainly at the CMS WLCG sites. Additionally, Power9 processors have been validated for CMS production at the Marconi-100 cluster at CINECA. This note will describe the updated capabilities of the SI to continue ensuring the efficient allocation and use of computing resources by CMS, despite their increasing diversity. The next steps towards a full integration and support of heterogeneous resources according to CMS needs will also be reported.
Reference15 articles.
1. The Top500 list, https://top500.org/, accessed September, 2023.
2. The Worldwide LHC Computing Grid http://wlcg.web.cern.ch, accessed September, 2023.
3. A Roadmap for HEP Software and Computing R&D for the 2020s