Affiliation:
1. University of California, San Diego
Abstract
In a single second a modern processor can execute billions of instructions. Obtaining a bird's eye view of the behavior of a program at these speeds can be a difficult task when all that is available is cycle by cycle examination. In many programs, behavior is anything but steady state, and understanding the patterns of behavior, at run-time, can unlock a multitude of optimization opportunities.In this paper, we present a unified profiling architecture that can efficiently capture, classify, and predict phase-based program behavior on the largest of time scales. By examining the proportion of instructions that were executed from different sections of code, we can find generic phases that correspond to changes in behavior across many metrics. By classifying phases generically, we avoid the need to identify phases for each optimization, and enable a unified prediction scheme that can forecast future behavior. Our analysis shows that our design can capture phases that account for over 80% of execution using less that 500 bytes of on-chip memory.
Publisher
Association for Computing Machinery (ACM)
Cited by
37 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A Neural Network-Based Approach to Dynamic Core Morphing for AMPs;2023 IEEE International Symposium on Smart Electronic Systems (iSES);2023-12-18
2. Sieve: Stratified GPU-Compute Workload Sampling;2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS);2023-04
3. Production-Run Noise Detection;Performance Analysis of Parallel Applications for HPC;2023
4. Detecting Performance Variance for Parallel Applications Without Source Code;IEEE Transactions on Parallel and Distributed Systems;2022-12-01
5. AppEKG: A Simple Unifying View of HPC Applications in Production;2022 IEEE/ACM International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS);2022-11