Affiliation:
1. Department of CSE, IIT Hyderabad, India and Sorbonne Université, CNRS, LIP6, Paris, France
Abstract
For Affine Control Programs or Static Control Programs (SCoP), symbolic counting of reuse distances could induce polynomials for each reuse pair. These polynomials along with cache capacity constraints lead to non-affine (semi-algebraic) sets; and counting these sets is considered to be a hard problem. The state-of-the-art methods use various exact enumeration techniques relying on existing cardinality algorithms that can efficiently count affine sets.
We propose
BullsEye
, a novel, scalable, accurate, and problem-size independent approximation framework. It is an analytical cache model for fully associative caches with LRU replacement policy focusing on sampling and linearization of non-affine stack distance polynomials. First, we propose a simple domain sampling method that can improve the scalability of exact enumeration. Second, we propose linearization techniques relying on
Handelman’s theorem
and
Bernstein’s representation
. To improve the scalability of the
Handelman’s theorem
linearization technique, we propose template (Interval or Octagon) sub-polyhedral approximations.
Our methods obtain significant compile-time improvements with high-accuracy when compared to
HayStack
on important polyhedral compilation kernels such as
nussinov
,
cholesky
, and
adi
from
PolyBench
, and
harris
,
gaussianblur
from
LLVM
-TestSuite. Overall, on
PolyBench
kernels, our methods show up to 3.31× (geomean) speedup with errors below ≈ 0.08% (geomean) for the octagon sub-polyhedral approximation.
Funder
DST, Govt of India, through the Data Science cluster of the ICPS program
NSM research grant
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Information Systems,Software
Reference72 articles.
1. V. Baldoni, N. Berline, J. A. De Loera, B. Dutra, M. Koppe, S. Moreinis, G. Pinto, M. Vergne, and J. Wu. 2013. A User’s Guide for LattE Integrale v1.7.2. Retrieved from http://www.math.ucdavis.edu/latte/.
2. Analytical modeling of cache behavior for affine programs
3. A Polynomial Time Algorithm for Counting Integral Points in Polyhedra When the Dimension is Fixed
4. Collected works, vol. 1;Bernstein S.;USSR Acad. Sci.,1952
5. Collected works, vol. 2;Bernstein S.;USSR Acad. Sci.,1954
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Falcon: A Scalable Analytical Cache Model;Proceedings of the ACM on Programming Languages;2024-06-20