Affiliation:
1. AMD Research and Georgia Institute of Technology
2. AMD Research
3. AMD Research and University of California, San Diego
4. Georgia Institute of Technology
Abstract
In this paper, we address the problem of efficiently managing the relative power demands of a high-performance GPU and its memory subsystem. We develop a management approach that dynamically tunes the hardware operating configurations to maintain balance between the power dissipated in compute versus memory access across GPGPU application phases. Our goal is to reduce power with minimal performance degradation.
Accordingly, we construct predictors that assess the online sensitivity of applications to three hardware tunables---compute frequency, number of active compute units, and memory bandwidth. Using these sensitivity predictors, we propose a two-level coordinated power management scheme, Harmonia, which coordinates the hardware power states of the GPU and the memory system. Through hardware measurements on a commodity GPU, we evaluate Harmonia against a state-of-the-practice commodity GPU power management scheme, as well as an oracle scheme. Results show that Harmonia improves measured energy-delay squared (ED
2
) by up to 36% (12% on average) with negligible performance loss across representative GPGPU workloads, and on an average is within 3% of the oracle scheme.
Publisher
Association for Computing Machinery (ACM)
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. PC-oriented Prediction-based Runtime Power Management for GPGPU using Knowledge Transfer;Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures;2024-06-17
2. Unity ECC: Unified Memory Protection Against Bit and Chip Errors;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2023-11-11
3. Aggressive SRAM Voltage Scaling and Error Mitigation for Approximate DNN Inference;Proceedings of the 2nd Workshop on Smart Wearable Systems and Applications;2023-10-02
4. Footprint-Aware Power Capping for Hybrid Memory Based Systems;Lecture Notes in Computer Science;2020