Abstract
AbstractPower consumption is likely to remain a significant concern for exascale performance in the foreseeable future. In addition, graphics processing units (GPUs) have become an accepted architectural feature for exascale computing due to their scalable performance and power efficiency. In a recent study, we found that we can achieve a reasonable amount of power and energy savings based on the selection of algorithms. In this research, we suggest that we can save more power and energy by varying the block size in the kernel configuration. We show that we may attain more savings by selecting the optimum block size while executing the workload. We investigated two kernels on NVIDIA Tesla K40 GPU, a Bitonic Mergesort and Vector Addition kernels, to study the effect of varying block sizes on GPU power and energy consumption. The study should offer insights for upcoming exascale systems in terms of power and energy efficiency.
Funder
Deanship of Scientific Research, King Saud University
Publisher
Springer Science and Business Media LLC
Subject
Hardware and Architecture,Information Systems,Theoretical Computer Science,Software
Reference56 articles.
1. (date accessed February 10, 2019) Parallel Bitonic Mergesort. URL http://www.tools-of-computing.com/tc/CS/Sorts/bitonic_sort.htm
2. (date accessed February 11, 2019) Bitonic Sort. URL http://www.iti.fh-flensburg.de/lang/algorithmen/sortieren/bitonic/oddn.htm
3. Abulnaja OA, Ikram MJ, Al-Hashimi MA, Saleh ME (2018) Analyzing power and energy efficiency of bitonic mergesort based on performance evaluation. IEEE Access 6:42757–42774
4. Al-Hashimi M, Saleh M, Abulnaja O, Aljabri N (2014) Evaluation of control loop statements power efficiency: An experimental study. In: 2014 9th International Conference on Informatics and Systems, IEEE, pp 45–48
5. Al-Hashimi MA, Abulnaja OA, Saleh ME, Ikram MJ (2017) Evaluating power and energy efficiency of bitonic mergesort on graphics processing unit. IEEE Access 5:16429–16440
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. BTO, Block and Thread Optimization of GPU Kernels on Geophysical Exploration;2024 32nd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP);2024-03-20
2. Optimizing Medical Image Analysis: Leveraging Efficient Hardware and AI Algorithms;2024 37th International Conference on VLSI Design and 2024 23rd International Conference on Embedded Systems (VLSID);2024-01-06
3. Energy-Efficient Parallel Computing: Challenges to Scaling;Information;2023-04-20