Affiliation:
1. University of Waterloo, Canada
2. Intel of Canada, Canada
Abstract
We present a predictable wavefront splitting (PWS) technique for graphics processing units (GPUs). PWS improves the performance of GPU applications by reducing the impact of branch divergence while ensuring that worst-case execution time (WCET) estimates can be computed. This makes PWS an appropriate technique to use in safety-critical applications, such as autonomous driving systems, avionics, and space, that require strict temporal guarantees. In developing PWS on an AMD-based GPU, we propose microarchitectural enhancements to the GPU, and a compiler pass that eliminates branch serializations to reduce the WCET of a wavefront. Our analysis of PWS exhibits a performance improvement of 11% over existing architectures with a lower WCET than prior works in wavefront splitting.
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Software
Reference23 articles.
1. Advanced Micro Devices. 2016. Graphics Core Next Architecture Reference Guide. (2016).
2. Advanced Micro Devices. 2019. Introducing RDNA Architecture. (2019).
3. GPU Scheduling on the NVIDIA TX2: Hidden Details Revealed
4. Computer and Redundancy Solution for the Full Self-Driving Computer