Using cache line coloring to perform aggressive procedure inlining-Reference-Cited by-同舟云学术

Using cache line coloring to perform aggressive procedure inlining

Published:2000-03 Issue:1 Volume:28 Page:62-71
ISSN:0163-5964
Container-title:ACM SIGARCH Computer Architecture News
language:en
Short-container-title:SIGARCH Comput. Archit. News

Author:

Aydin Hakan¹,Kaeli David¹

Affiliation:

1. Department of Electrical and Computer Engineering, Northeastern University, Boston, MA

Abstract

Memory hierarchy performance has always been an important issue in computer architecture design. The likelihood of a bottleneck in the memory hierarchy is increasing, as improvements in microprocessor performance continue to outpace those made in the memory system. As a result, effective utilization of cache memories is essential in today's architectures.The nature of procedural software poses visibility problems when attempting to perform program optimization. One approach to increasing visibility in procedural design is to perform procedure inlining. The main downside of using inlining is that inlined procedures can place excess pressure on the instruction cache.To address this issue we attempt to perform code reordering. By combining reordering with aggressive inlining, a larger executable image produced through inlining can be effectively remapped onto the cache address space, while not noticeably increasing the instruction cache miss rate.In this paper, we evaluate our ability to perform aggressive inlining by employing cache line coloring. We have implemented three variations of our coloring algorithm in the Alto toolset and compare them against Alto's aggressive basic block reordering algorithms. Alto allows us to generate optimized executables, that can be run on hardware to generate results. We find that by using our algorithms, we can achieve up a 21% reduction is execution runtime over the base Compaq optimizing compiler, and a 6.4% reduction when compared to Alto's interprocedural basic block reordering algorithm.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/346023.346046

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Combining code reordering and cache configuration;ACM Transactions on Embedded Computing Systems;2012-12

2. Low-Energy Instruction Cache Optimization Techniques for Embedded Systems;Handbook of Energy-Aware and Green Computing, Volume 1;2012-01-24

3. Aggressive Function Inlining: Preventing Loop Blockings in the Instruction Cache;High Performance Embedded Architectures and Compilers;2008

4. Thread coloring;ACM SIGOPS Operating Systems Review;2005-04