Forma-Reference-Cited by-同舟云学术

Forma

Published:2007-11 Issue:1 Volume:30 Page:2
ISSN:0164-0925
Container-title:ACM Transactions on Programming Languages and Systems
language:en
Short-container-title:ACM Trans. Program. Lang. Syst.

Author:

Zhao Peng¹,Cui Shimin²,Gao Yaoqing²,Silvera Raúl²,Amaral José Nelson¹

Affiliation:

1. University of Alberta, Edmonton, AB, Canada

2. IBM Toronto Software Laboratory, ON, Canada

Abstract

This article presents Forma , a practical, safe, and automatic data reshaping framework that reorganizes arrays to improve data locality. Forma splits large aggregated data-types into smaller ones to improve data locality. Arrays of these large data types are then replaced by multiple arrays of the smaller types. These new arrays form natural data streams that have smaller memory footprints, better locality, and are more suitable for hardware stream prefetching. Forma consists of a field-sensitive alias analyzer, a data type checker, a portable structure reshaping planner, and an array reshaper. An extensive experimental study compares different data reshaping strategies in two dimensions: (1) how the data structure is split into smaller ones ( maximal partition × frequency-based partition × affinity-based partition ); and (2) how partitioned arrays are linked to preserve program semantics ( address arithmetic-based reshaping × pointer-based reshaping ). This study exposes important characteristics of array reshaping. First, a practical data reshaper needs not only an inter-procedural analysis but also a data-type checker to make sure that array reshaping is safe. Second, the performance improvement due to array reshaping can be dramatic: standard benchmarks can run up to 2.1 times faster after array reshaping. Array reshaping may also result in some performance degradation for certain benchmarks. An extensive micro-architecture-level performance study identifies the causes for this degradation. Third, the seemingly naive maximal partition achieves best or close-to-best performance in the benchmarks studied. This article presents an analysis that explains this surprising result. Finally, address-arithmetic-based reshaping always performs better than its pointer-based counterpart.

Publisher

Association for Computing Machinery (ACM)

Subject

Software

Link

https://dl.acm.org/doi/pdf/10.1145/1290520.1290522

Reference42 articles.

1. Automatic loop interchange

2. Data and computation transformations for multiprocessors

3. Evaluating the impact of memory system performance on software prefetching and locality optimizations

4. Analysis of pointers and structures

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Region-Based Data Layout via Data Reuse Analysis;Proceedings of the 33rd ACM SIGPLAN International Conference on Compiler Construction;2024-02-17

2. Data layout optimization based on the spatio-temporal model of field access;2022 5th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE);2022-04

3. A unifying abstraction for data structure splicing;Proceedings of the International Symposium on Memory Systems;2019-09-30

4. LWPTool: A Lightweight Profiler to Guide Data Layout Optimization;IEEE Transactions on Parallel and Distributed Systems;2018-11-01

5. Modular design of a factor-graph-based inference engine on a System-On-Chip (SoC);Microprocessors and Microsystems;2018-07