Affiliation:
1. ALCHEMY Group, INRIA Saclay -- Ile-de-France and Paris-Sud University, Orsay, France
2. Dept. of Computer & Information Sciences, University of Delaware, Newark, DE, USA
Abstract
High-level loop optimizations are necessary to achieve good performance over a wide variety of processors. Their performance impact can be significant because they involve in-depth program transformations that aim to sustain a balanced workload over the computational, storage, and communication resources of the target architecture. Therefore, it is mandatory that the compiler accurately models the target architecture as well as the effects of complex code restructuring.
However, because optimizing compilers (1) use simplistic performance models that abstract away many of the complexities of modern architectures, (2) rely on inaccurate dependence analysis, and (3) lack frameworks to express complex interactions of transformation sequences, they typically uncover only a fraction of the peak performance available on many applications. We propose a complete iterative framework to address these issues. We rely on the polyhedral model to construct and traverse a large and expressive search space. This space encompasses only legal, distinct versions resulting from the restructuring of any static control loop nest. We first propose a feedback-driven iterative heuristic tailored to the search space properties of the polyhedral model. Though, it quickly converges to good solutions for small kernels, larger benchmarks containing higher dimensional spaces are more challenging and our heuristic misses opportunities for significant performance improvement. Thus, we introduce the use of a genetic algorithm with specialized operators that leverage the polyhedral representation of program dependences. We provide experimental evidence that the genetic algorithm effectively traverses huge optimization spaces, achieving good performance improvements on large loop nests.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Graphics and Computer-Aided Design,Software
Cited by
33 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Discovering Parallelisms in Python Programs;Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering;2023-11-30
2. VICO;Proceedings of the 36th ACM International Conference on Supercomputing;2022-06-28
3. Mapping parallelism in a functional IR through constraint satisfaction: a case study on convolution for mobile GPUs;Proceedings of the 31st ACM SIGPLAN International Conference on Compiler Construction;2022-03-18
4. On the Impact of Affine Loop Transformations in Qubit Allocation;ACM Transactions on Quantum Computing;2021-09-30
5. PolyGym: Polyhedral Optimizations as an Environment for Reinforcement Learning;2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT);2021-09