Load-reuse analysis

Author:

Bodík Rastislav1,Gupta Rajiv1,Soffa Mary Lou1

Affiliation:

1. Dept. of Computer Science, University of Pittsburgh, Pittsburgh, PA

Abstract

Load-reuse analysis finds instructions that repeatedly access the same memory location. This location can be promoted to a register, eliminating redundant loads by reusing the results of prior memory accesses. This paper develops a load-reuse analysis and designs a method for evaluating its precision.In designing the analysis, we aspire for completeness ---the goal of exposing all reuse that can be harvested by a subsequent program transformation. For register promotion, a suitable transformation is partial redundancy elimination (PRE). To approach the ideal goal of PRE-completeness, the load-reuse analysis is phrased as a data-flow problem on a program representation that is path-sensitive , as it detects reuse even when it originates in a different instruction along each control flow path. Furthermore, the analysis is comprehensive , as it treats scalar, array and pointer-based loads uniformly.In evaluating the analysis, we compare it with an ideal analysis. By observing the run-time stream of memory references, we collect all PRE-exploitable reuse and treat it as the ideal analysis performance. To compare the (static) load-reuse analysis with the (dynamic) ideal reuse, we use an estimator algorithm that computes, given a data-flow solution and a program profile, the dynamic amount of reuse detected by the analysis. We developed a family of estimators that differ in how well they bound the profiling error inherent in the edge profile. By bounding the error, the estimators offer a precise and practical method for determining the run-time optimization benefit.Our experiments show that about 55% of loads executed in Spec95 exhibit reuse. Of those, our analysis exposes about 80%.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design,Software

Reference34 articles.

1. Type feedback vs. concrete type inference

2. Edge profiling versus path profiling

3. Rastislav Bodik. Path-Sensitive Value-Flow Optimizations. PhD thesis University of Pittsburgh in preparation. Rastislav Bodik. Path-Sensitive Value-Flow Optimizations. PhD thesis University of Pittsburgh in preparation.

4. Path-sensitive value-flow analysis

Cited by 18 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. OmpMemOpt: Optimized Memory Movement for Heterogeneous Computing;Euro-Par 2020: Parallel Processing;2020

2. Studying Optimal Spilling in the Light of SSA;ACM Transactions on Architecture and Code Optimization;2015-01-09

3. Inter-iteration Scalar Replacement Using Array SSA Form;Lecture Notes in Computer Science;2014

4. Eliminating Redundant Computation and Exposing Parallelism through Data-Triggered Threads;IEEE Micro;2012-05

5. On Reducing Hidden Redundant Memory Accesses for DSP Applications;IEEE Transactions on Very Large Scale Integration (VLSI) Systems;2011-06

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3