Affiliation:
1. The Ohio State University
2. Inria
3. Louisiana State University
Abstract
The roofline model is a popular approach for “bound and bottleneck” performance analysis. It focuses on the limits to the performance of processors because of limited bandwidth to off-chip memory. It models upper bounds on performance as a function of operational intensity, the ratio of computational operations per byte of data moved from/to memory. While operational intensity can be directly measured for a specific implementation of an algorithm on a particular target platform, it is of interest to obtain broader insights on bottlenecks, where various semantically equivalent implementations of an algorithm are considered, along with analysis for variations in architectural parameters. This is currently very cumbersome and requires performance modeling and analysis of many variants.
In this article, we address this problem by using the roofline model in conjunction with upper bounds on the operational intensity of computations as a function of cache capacity, derived from lower bounds on data movement. This enables bottleneck analysis that holds across all dependence-preserving semantically equivalent implementations of an algorithm. We demonstrate the utility of the approach in assessing fundamental limits to performance and energy efficiency for several benchmark algorithms across a design space of architectural variations.
Funder
U.S. Department of Energy
U.S. National Science Foundation
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Information Systems,Software
Reference36 articles.
1. A model for hierarchical memory
2. The input/output complexity of sorting and related problems
3. Minimizing Communication in Numerical Linear Algebra
4. Graph expansion and communication costs of fast matrix multiplication
5. Keren Bergman Shekhar Borkar Dan Campbell William Carlson William Dally Monty Denneau Paul Franzon William Harrod Kerry Hill Jon Hiller etal 2008. Exascale Computing Study: Technology Challenges in Achieving Exascale Systems. Tech. Rep. Defense Advanced Research Projects Agency Information Processing Techniques Office (DARPA IPTO). Keren Bergman Shekhar Borkar Dan Campbell William Carlson William Dally Monty Denneau Paul Franzon William Harrod Kerry Hill Jon Hiller et al. 2008. Exascale Computing Study: Technology Challenges in Achieving Exascale Systems. Tech. Rep. Defense Advanced Research Projects Agency Information Processing Techniques Office (DARPA IPTO).
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献