Author:
Pérez-Calero Yzquierdo Antonio,Acosta Flechas Maria,Davila Foyo Diego,Haleem Saqib,Hurtado Anampa Kenyi,Ivanov Todor Trendafilov,Khan Farrukh Aftab,Kizinevič Edita,Larson Krista,Letts James,Mascheroni Marco,Mason David
Abstract
Efforts in distributed computing of the CMS experiment at the LHC at CERN are now focusing on the functionality required to fulfill the projected needs for the HL-LHC era. Cloud and HPC resources are expected to be dominant relative to resources provided by traditional Grid sites, being also much more diverse and heterogeneous. Handling their special capabilities or limitations and maintaining global flexibility and efficiency, while also operating at scales much higher than the current capacity, are the major challenges being addressed by the CMS Submission Infrastructure team. These proceedings discuss the risks to the stability and scalability of the CMS HTCondor infrastructure extrapolated to such a scenario, thought to be derived mostly from its growing complexity, with multiple Negotiators and schedulers flocking work to multiple federated pools. New mechanisms for enhanced customization and control over resource allocation and usage, mandatory in this future scenario, are also described.
Reference14 articles.
1. HTCondor public web site, https://research.cs.wisc.edu/htcondor/index.html
2. The Glidein-based Workflow Management System, https://glideinwms.fnal.gov/doc.prd/index.html
3. The Worldwide LHC Computing Grid http://wlcg.web.cern.ch
4. Experience with dynamic resource provisioning of the CMS online cluster using a cloud overlay
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献