Automatic data and computation decomposition on distributed memory parallel computers

Author:

Lee Peizong1,Kedem Zvi Meir2

Affiliation:

1. Academia Sinica, Taipei, Taiwan, Republic of China

2. New York University, New York, NY

Abstract

To exploit parallelism on shared memory parallel computers (SMPCs), it is natural to focus on decomposing the computation (mainly by distributing the iterations of the nested Do-Loops). In contrast, on distributed memory parallel computers (DMPCs), the decomposition of computation and the distribution of data must both be handled---in order to balance the computation load and to minimize the migration of data. We propose and validate experimentally a method for handling computations and data synergistically to minimize the overall execution time on DMPCs. The method is based on a number of novel techniques, also presented in this article. The core idea is to rank the "importance" of data arrays in a program and specify some of the dominant. The intuition is that the dominant arrays are the ones whose migration would be the most expensive. Using the correspondence between iteration space mapping vectors and distributed dimensions of the dominant data array in each nested Do-loop, allows us to design algorithms for determining data and computation decompositions at the same time. Based on data distribution, computation decomposition for each nested Do-loop is determined based on either the "owner computes" rule or the "owner stores" rule with respect to the dominant data array. If all temporal dependence relations across iteration partitions are regular, we use tiling to allow pipelining and the overlapping of computation and communication. However, in order to use tiling on DMPCs, we needed to extend the existing techniques for determining tiling vectors and tile sizes, as they were originally suited for SMPCs only. The overall method is illustrated on programs for the 2D heat equation, for the Gaussian elimination with pivoting, and for the 2D fast Fourier transform on a linear processor array and on a 2D processor grid.

Publisher

Association for Computing Machinery (ACM)

Subject

Software

Reference70 articles.

1. Automatic partitioning of parallel loops and data arrays for distributed shared-memory multiprocessors

2. Automatic translation of FORTRAN programs to vector form

3. Anderson J. 1997. Automatic computation and data decomposition for multiprocessors. Ph.D. dissertation. Dept. of EE and CS Stanford Univ. Stanford Calif.]] Anderson J. 1997. Automatic computation and data decomposition for multiprocessors. Ph.D. dissertation. Dept. of EE and CS Stanford Univ. Stanford Calif.]]

Cited by 17 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Code generation for accurate array redistribution on automatic distributed-memory parallelization;International Journal of Networked and Distributed Computing;2014

2. Data Decomposition for Code Parallelization in Practice: What Do the Experts Need?;2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing;2013-11

3. Code Generation for Accurate Array Redistribution on Automatic Distributed-Memory Parallelization;2013 14th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing;2013-07

4. An Automatic Computation and Data Decomposition Algorithm of Prioritized Dominant Array;2012 13th International Conference on Parallel and Distributed Computing, Applications and Technologies;2012-12

5. An Improvement to Affine Decomposition on Distributed Memory Architecture;2012 11th International Symposium on Distributed Computing and Applications to Business, Engineering & Science;2012-10

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3