Affiliation:
1. Stony Brook University, Stony Brook, NY, USA
Abstract
Most hardware and software venders suggest disabling hardware prefetching in virtualized environments. They claim that prefetching is detrimental to application performance due to inaccurate prediction caused by workload diversity and VM interference on shared cache. However, no comprehensive or quantitative measurements to support this belief have been performed.
This paper is the first to systematically measure the influence of hardware prefetching in virtualized environments. We examine a wide variety of benchmarks on three types of chip-multiprocessors (CMPs) to analyze the hardware prefetching performance. We conduct extensive experiments by taking into account a number of important virtualization factors. We find that hardware prefetching has minimal destructive influence under most configurations. Only with certain application combinations does prefetching influence the overall performance.
To leverage these findings and make hardware prefetching effective across a diversity of virtualized environments, we propose a dynamic prefetching-aware VCPU-core binding approach (PAVCB), which includes two phases - classifying and binding. The workload of each VM is classified into different cache sharing constraint categories based upon its cache access characteristics, considering both prefetch requests and demand requests. Then following heuristic rules, the VCPUs of each VM are scheduled onto appropriate cores subject to cache sharing constraints. We show that the proposed approach can improve performance by 12% on average over the default scheduler and 46% over manual system administrator bindings across different workload combinations in the presence of hardware prefetching.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Graphics and Computer-Aided Design,Software
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Reinforcement Learning Based Prefetch-Control Mechanism;2023 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS);2023-11-19
2. CluMP: Clustered Markov Chain for Storage I/O Prefetch;Electronics;2023-07-31
3. Puppeteer: A Random Forest Based Manager for Hardware Prefetchers Across the Memory Hierarchy;ACM Transactions on Architecture and Code Optimization;2022-12-16
4. Machine Learning for Fine-Grained Hardware Prefetcher Control;Proceedings of the 48th International Conference on Parallel Processing;2019-08-05
5. Make the Most out of Last Level Cache in Intel Processors;Proceedings of the Fourteenth EuroSys Conference 2019;2019-03-25