Affiliation:
1. The University of Manchester, Manchester, UK
Abstract
Collecting hardware event counts is essential to understanding program execution behavior. Contemporary systems offer few Performance Monitoring Counters (PMCs), thus only a small fraction of hardware events can be monitored simultaneously. We present new techniques to acquire counts for all available hardware events with high accuracy by multiplexing PMCs across multiple executions of the same program, then carefully reconciling and merging the multiple profiles into a single, coherent profile. We present a new metric for assessing the similarity of statistical distributions of event counts and show that our execution profiling approach performs significantly better than Hardware Event Multiplexing.
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Information Systems,Software
Cited by
10 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Efficient Cross-platform Multiplexing of Hardware Performance Counters via Adaptive Grouping;ACM Transactions on Architecture and Code Optimization;2024-01-19
2. Strategies and software support for the management of hardware performance counters;Software: Practice and Experience;2023-07-17
3. Profile-driven memory bandwidth management for accelerators and CPUs in QoS-enabled platforms;Real-Time Systems;2022-04-26
4. BayesPerf: minimizing performance monitoring errors using Bayesian statistics;Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems;2021-04-17
5. MUCH;Proceedings of the 36th Annual ACM Symposium on Applied Computing;2021-03-22