Tiling arbitrarily nested loops by means of the transitive-Reference-Cited by-同舟云学术

Tiling arbitrarily nested loops by means of the transitive

Published:2016-12-01 Issue:4 Volume:26 Page:919-939
ISSN:2083-8492
Container-title:International Journal of Applied Mathematics and Computer Science
language:en
Short-container-title:

Author:

Bielecki Włodzimierz¹,Pałkowski Marek¹

Affiliation:

1. Faculty of Computer Science West Pomeranian University of Technology, Żołnierska 49, 71-210 Szczecin, Poland

Abstract

Abstract A novel approach to generation of tiled code for arbitrarily nested loops is presented. It is derived via a combination of the polyhedral and iteration space slicing frameworks. Instead of program transformations represented by a set of affine functions, one for each statement, it uses the transitive closure of a loop nest dependence graph to carry out corrections of original rectangular tiles so that all dependences of the original loop nest are preserved under the lexicographic order of target tiles. Parallel tiled code can be generated on the basis of valid serial tiled code by means of applying affine transformations or transitive closure using on input an inter-tile dependence graph whose vertices are represented by target tiles while edges connect dependent target tiles. We demonstrate how a relation describing such a graph can be formed. The main merit of the presented approach in comparison with the well-known ones is that it does not require full permutability of loops to generate both serial and parallel tiled codes; this increases the scope of loop nests to be tiled.

Publisher

Walter de Gruyter GmbH

Subject

Applied Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Reference55 articles.

1. Ahmed, N., Mateev, N. and Pingali, K. (2000). Tiling imperfectly-nested loop nests, ACM/IEEE 2000 Conference on Supercomputing, Dallas, TX, USA, Article No. 31.

2. Andonov, R., Balev, S., Rajopadhye, S. and Yanev, N. (2001). Optimal semi-oblique tiling, IEEE Transactions on Parallel and Distributed Systems 14(9): 940-966.

3. Bastoul, C. (2004). Code generation in the polyhedral model is easier than you think, PACT’13, IEEE International Conference on Parallel Architecture and Compilation Techniques, Juan-les-Pins, France, pp. 7-16.

4. Bastoul, C. and Feautrier, P. (2003). Improving data locality by chunking, International Conference on Compiler Construction, Warsaw, Poland, pp. 320-335.

5. Beletska, A., Bielecki, W., Cohen, A., Palkowski, M. and Siedlecki, K. (2011). Coarse-grained loop parallelization: Iteration space slicing vs affine transformations, Parallel Computing 37(8): 479-497.

Cited by 21 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. NPDP benchmark suite for the evaluation of the effectiveness of automatic optimizing compilers;Parallel Computing;2023-07

2. Optimal uniformization for non-uniform two-level loops using a hybrid method;The Journal of Supercomputing;2023-03-19

3. NPDP Benchmark Suite for Loop Tiling Effectiveness Evaluation;Parallel Processing and Applied Mathematics;2023

4. Automatic code optimization for computing the McCaskill partition functions;Annals of Computer Science and Information Systems;2022-09-26

5. TLP: Towards three‐level loop parallelisation;IET Computers & Digital Techniques;2022-08-09