On the Transformation Optimization for Stencil Computation-Reference-Cited by-同舟云学术

On the Transformation Optimization for Stencil Computation

Published:2021-12-23 Issue:1 Volume:11 Page:38
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Su Huayou,Zhang Kaifang,Mei Songzhu

Abstract

Stencil computation optimizations have been investigated quite a lot, and various approaches have been proposed. Loop transformation is a vital kind of optimization in modern production compilers and has proved successful employment within compilers. In this paper, we combine the two aspects to study the potential benefits some common transformation recipes may have for stencils. The recipes consist of loop unrolling, loop fusion, address precalculation, redundancy elimination, instruction reordering, load balance, and a forward and backward update algorithm named semi-stencil. Experimental evaluations of diverse stencil kernels, including 1D, 2D, and 3D computation patterns, on two typical ARM and Intel platforms, demonstrate the respective effects of the transformation recipes. An average speedup of 1.65× is obtained, and the best is 1.88× for the single transformation recipes we analyze. The compound recipes demonstrate a maximum speedup of 1.92×.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/11/1/38/pdf

Reference32 articles.

1. Algorithm 942

2. The Titan Graphics Supercomputer architecture

3. Compiler transformations for high-performance computing

4. Loop Transformations for Restructuring Compilers: The Foundations;Banerjee,1993