Affiliation:
1. Digital Equipment Corp., Palo Alto, CA
2. Digital Equipment Corporation, Palo Alto, CA
Abstract
This article describes the Digital Continuous Profiling Infrastructure, a sampling-based profiling system designed to run continuously on production systems. The system supports multiprocessors, works on unmodified executables, and collects profiles for entire systems, including user programs, shared libraries, and the operating system kernel. Samples are collected at a high rate (over 5200 samples/sec. per 333MHz processor), yet with low overhead (1–3% slowdown for most workloads). Analysis tools supplied with the profiling system use the sample data to produce a precise and accurate accounting, down to the level of pipeline stalls incurred by individual instructions, of where time is bring spent. When instructions incur stalls, the tools identify possible reasons, such as cache misses, branch mispredictions, and functional unit contention. The fine-grained instruction-level analysis guides users and automated optimizers to the causes of performance problems and provides important insights for fixing them.
Publisher
Association for Computing Machinery (ACM)
Cited by
111 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. DiagConfig: Configuration Diagnosis of Performance Violations in Configurable Software Systems;Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering;2023-11-30
2. TrivialSpy: Identifying Software Triviality via Fine-grained and Dataflow-based Value Profiling;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2023-11-11
3. TEA: Time-Proportional Event Analysis;Proceedings of the 50th Annual International Symposium on Computer Architecture;2023-06-17
4. Precise Event Sampling on AMD Versus Intel: Quantitative and Qualitative Comparison;IEEE Transactions on Parallel and Distributed Systems;2023-05
5. Precise event sampling‐based data locality tools for AMD multicore architectures;Concurrency and Computation: Practice and Experience;2023-04-03