Affiliation:
1. University of California Irvine
Abstract
The GCC is an auto-vectorization compiler across iterations of loops to parallelism data. Turning GCC compiler optimizations flags for auto-vectorization is a way to improve the performance ability, which is a popular approach to speed up program performance. However, there are many options in GCC compiler and selecting the best combination of these options to improve program performance through vectorization is non-trivial ( as the search space is very large ).In this work we focus on the selection of compiler transformations to auto-vectorize loops with conditional statements. The selection of compiler transformations is based on the correlation between program features, speed-up, and the analysis of the code generated and a small number of passes of iterative compilation. Our preliminary experimental results show that proposed technique attains performance improvements the best ~ 6x using loops in the TSVC benchmark suite on the state-of-the-art Intel Core i3 processor.
Publisher
Trans Tech Publications, Ltd.
Reference11 articles.
1. Information on http: /www. intel. com/content/www/us/en/architecture-and-technology/64-ia-32- architectures-optimization-manual. html.
2. Information on http: /gcc. gnu. org/onlinedocs/gcc-4. 7. 2/gcc. pdf.
3. Aydin Buluç and John R. Gilbert, SIAM Journal of Scientific Computing (SISC). 34(2012)170 - 191.
4. Saeed Maleki and Maria J. Garzarán, PACT '11 Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques. (2011)372-382.
5. Chiu, J. and Y. Chou, Microprocessors and Microsystems. 34(2010) 247–258.