Author:
Ebrahim Abdulla,Bocci Andrea,Elmedany Wael,Al-Ammal Hesham
Abstract
Particle track reconstruction for high energy physics experiments like CMS is computationally demanding but can benefit from GPU acceleration if properly tuned. This work develops an autotuning framework to automatically optimise the throughput of GPU-accelerated CUDA kernels in CMSSW. The proposed system navigates the complex parameter space by generating configurations, benchmarking performance, and leveraging multi-fidelity optimisation from simplified applications. The autotuned launch parameters improved CMSSW tracking throughput over the default settings by finding optimised, GPU-specific configurations. The successful application of autotuning to CMSSW demonstrates both performance portability across diverse accelerators and the potential of the methodology to optimise other HEP codebases.
Reference8 articles.
1. A Survey on Compiler Autotuning using Machine Learning
2. Kernel Tuner: A search-optimizing GPU code auto-tuner
3. Bringing heterogeneity to the CMS software framework
4. Ansel Jason, Kamil Shoaib, Veeramachaneni Kalyan, Ragan-Kelley Jonathan, Bosboom Jeffrey, O’Reilly Una-May, and Amarasinghe Saman. Proceedings of the 23rd International Conference on Parallel Architectures and Compilation (PACT). 303–316 (2014)