Affiliation:
1. Uppsala University, Uppsala, Sweden
Abstract
Performance tools based on hardware counters can efficiently profile the cache behavior of an application and help software developers improve its cache utilization. Simulator-based tools can potentially provide more insights and flexibility and model many different cache configurations, but have the drawback of large run-time overhead.We present StatCache, a performance tool based on a statistical cache model. It has a small run-time overhead while providing much of the flexibility of simulator-based tools. A monitor process running in the background collects sparse memory access statistics about the analyzed application running natively on a host computer. Generic locality information is derived and presented in a code-centric and/or data-centric view.We evaluate the accuracy and performance of the tool using ten SPEC CPU2000 benchmarks. We also exemplify how the flexibility of the tool can be used to better understand the characteristics of cache-related performance problems.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Software
Cited by
30 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. GPU Scale-Model Simulation;2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2024-03-02
2. DroidPerf: Profiling Memory Objects on Android Devices;Proceedings of the 29th Annual International Conference on Mobile Computing and Networking;2023-07-10
3. FLORIA: A Fast and Featherlight Approach for Predicting Cache Performance;Proceedings of the 37th International Conference on Supercomputing;2023-06-21
4. Precise event sampling‐based data locality tools for AMD multicore architectures;Concurrency and Computation: Practice and Experience;2023-04-03
5. DJXPerf: Identifying Memory Inefficiencies via Object-Centric Profiling for Java;Proceedings of the 21st ACM/IEEE International Symposium on Code Generation and Optimization;2023-02-17