MiCOMP-Reference-Cited by-同舟云学术

MiCOMP

Published:2017-09-30 Issue:3 Volume:14 Page:1-28
ISSN:1544-3566
Container-title:ACM Transactions on Architecture and Code Optimization
language:en
Short-container-title:ACM Trans. Archit. Code Optim.

Author:

Ashouri Amir H.¹^ORCID,Bignoli Andrea²,Palermo Gianluca²,Silvano Cristina²,Kulkarni Sameer³,Cavazos John³

Affiliation:

1. University of Toronto, ON Canada

2. Politecnico di Milano, Italy

3. University of Delaware, USA

Abstract

Recent compilers offer a vast number of multilayered optimizations targeting different code segments of an application. Choosing among these optimizations can significantly impact the performance of the code being optimized. The selection of the right set of compiler optimizations for a particular code segment is a very hard problem, but finding the best ordering of these optimizations adds further complexity. Finding the best ordering represents a long standing problem in compilation research, named the phase-ordering problem. The traditional approach of constructing compiler heuristics to solve this problem simply cannot cope with the enormous complexity of choosing the right ordering of optimizations for every code segment in an application. This article proposes an automatic optimization framework we call MiCOMP, which Mitigates the Compiler Phase-ordering problem. We perform phase ordering of the optimizations in LLVM’s highest optimization level using optimization sub-sequences and machine learning. The idea is to cluster the optimization passes of LLVM’s O3 setting into different clusters to predict the speedup of a complete sequence of all the optimization clusters instead of having to deal with the ordering of more than 60 different individual optimizations. The predictive model uses (1) dynamic features, (2) an encoded version of the compiler sequence, and (3) an exploration heuristic to tackle the problem. Experimental results using the LLVM compiler framework and the Cbench suite show the effectiveness of the proposed clustering and encoding techniques to application-based reordering of passes, while using a number of predictive models. We perform statistical analysis on the results and compare against (1) random iterative compilation, (2) standard optimization levels, and (3) two recent prediction approaches. We show that MiCOMP’s iterative compilation using its sub-sequences can reach an average performance speedup of 1.31 (up to 1.51). Additionally, we demonstrate that MiCOMP’s prediction model outperforms the -O1, -O2, and -O3 optimization levels within using just a few predictions and reduces the prediction error rate down to only 5%. Overall, it achieves 90% of the available speedup by exploring less than 0.001% of the optimization space.

Funder

EU Commission H2020-FET-HPC program

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3124452

Reference55 articles.

1. Using Machine Learning to Focus Iterative Optimization

2. Finding effective compilation sequences

3. Amir Hossein Ashouri. 2012. Design space exploration methodology for compiler parameters in VLIW processors. Master’s thesis. Politecnico di Milano Italy. http://hdl.handle.net/10589/72083. Amir Hossein Ashouri. 2012. Design space exploration methodology for compiler parameters in VLIW processors. Master’s thesis. Politecnico di Milano Italy. http://hdl.handle.net/10589/72083.

4. Amir Hossein Ashouri Andrea Bignoli Gianluca Palermo and Cristina Silvano. 2016. Predictive modeling methodology for compiler phase-ordering. In Proceedings of 7th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and 5th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms. ACM. 10.1145/2872421.2872424 Amir Hossein Ashouri Andrea Bignoli Gianluca Palermo and Cristina Silvano. 2016. Predictive modeling methodology for compiler phase-ordering. In Proceedings of 7th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and 5th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms. ACM. 10.1145/2872421.2872424

Cited by 59 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Two-Stage LLVM Option Sequence Optimization Method to Minimize Energy Consumption;Swarm and Evolutionary Computation;2024-07

2. Two-Level Software Obfuscation with Cooperative Co-Evolutionary Algorithms;2024 IEEE Congress on Evolutionary Computation (CEC);2024-06-30

3. Tile Size and Loop Order Selection using Machine Learning for Multi-/Many-Core Architectures;Proceedings of the 38th ACM International Conference on Supercomputing;2024-05-30

4. Compiler Autotuning through Multiple-phase Learning;ACM Transactions on Software Engineering and Methodology;2024-04-18

5. Exploring compiler optimization space for control flow obfuscation;Computers & Security;2024-04