Performance Evaluation of Intel Optane Memory for Managed Workloads-Reference-Cited by-同舟云学术

Performance Evaluation of Intel Optane Memory for Managed Workloads

Published:2021-06 Issue:3 Volume:18 Page:1-26
ISSN:1544-3566
Container-title:ACM Transactions on Architecture and Code Optimization
language:en
Short-container-title:ACM Trans. Archit. Code Optim.

Author:

Akram Shoaib¹

Affiliation:

1. Australian National University, Canberra, Australia

Abstract

Intel Optane memory offers non-volatility, byte addressability, and high capacity. It suits managed workloads that prefer large main memory heaps. We investigate Optane as the main memory for managed (Java) workloads, focusing on performance scalability. As the workload (core count) increases, we note Optane’s performance relative to DRAM. A few workloads incur a slight slowdown on Optane memory, which helps conserve limited DRAM capacity. Unfortunately, other workloads scale poorly beyond a few core counts. This article investigates scaling bottlenecks for Java workloads on Optane memory, analyzing the application, runtime, and microarchitectural interactions. Poorly scaling workloads allocate objects rapidly and access objects in Optane memory frequently. These characteristics slow down the mutator and substantially slow down garbage collection (GC). At the microarchitecture level, load, store, and instruction miss penalties rise. To regain performance, we partition heaps across DRAM and Optane memory, a hybrid that scales considerably better than Optane alone. We exploit state-of-the-art GC approaches to partition heaps. Unfortunately, existing GC approaches needlessly waste DRAM capacity because they ignore runtime behavior. This article also introduces performance impact-guided memory allocation (PIMA) for hybrid memories. PIMA maximizes Optane utilization, allocating in DRAM only if it improves performance. It estimates the performance impact of allocating heaps in either memory type by sampling. We target PIMA at graph analytics workloads, offering a novel performance estimation method and detailed evaluation. PIMA identifies workload phases that benefit from DRAM with high (94.33%) accuracy, incurring only a 2% sampling overhead. PIMA operates stand-alone or combines with prior approaches to offer new performance versus DRAM capacity trade-offs. This work opens up Optane memory to a real-life role as the main memory for Java workloads.

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3451342

Reference71 articles.

1. Managing hybrid memories by predicting object write intensity

2. DEP+BURST: Online DVFS Performance Prediction for Energy-Efficient Managed Language Execution

3. Write-rationing garbage collection for hybrid memories

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unified Shared Memory: Friend or Foe? Understanding the Implications of Unified Memory on Managed Heaps;Proceedings of the 20th ACM SIGPLAN International Conference on Managed Programming Languages and Runtimes;2023-10-19

2. Carbon-Aware Memory Placement;Proceedings of the 2nd Workshop on Sustainable Computer Systems;2023-07-09

3. Flexible and Effective Object Tiering for Heterogeneous Memory Systems;Proceedings of the 2023 ACM SIGPLAN International Symposium on Memory Management;2023-06-06

4. Analyzing and Improving the Scalability of In-Memory Indices for Managed Search Engines;Proceedings of the 2023 ACM SIGPLAN International Symposium on Memory Management;2023-06-06

5. On the Implications of Heterogeneous Memory Tiering on Spark In-Memory Analytics;2023 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW);2023-05