Hiding the Long Latency of Persist Barriers Using Speculative Execution-Reference-Cited by-同舟云学术

Hiding the Long Latency of Persist Barriers Using Speculative Execution

Published:2017-09-14 Issue:2 Volume:45 Page:175-186
ISSN:0163-5964
Container-title:ACM SIGARCH Computer Architecture News
language:en
Short-container-title:SIGARCH Comput. Archit. News

Author:

Shin Seunghee¹,Tuck James¹,Solihin Yan¹

Affiliation:

1. Dept. of Electrical and Computer Engineering, North Carolina State University, NC, USA

Abstract

Byte-addressable non-volatile memory technology is emerging as an alternative for DRAM for main memory. This new Non-Volatile Main Memory (NVMM) allows programmers to store important data in data structures in memory instead of serializing it to the file system, thereby providing a substantial performance boost. However, modern systems reorder memory operations and utilize volatile caches for better performance, making it difficult to ensure a consistent state in NVMM. Intel recently announced a new set of persistence instructions, clflushopt, clwb, and pcommit. These new instructions make it possible to implement fail-safe code on NVMM, but few workloads have been written or characterized using these new instructions. In this work, we describe how these instructions work and how they can be used to implement write-ahead logging based transactions. We implement several common data structures and kernels and evaluate the performance overhead incurred over traditional non-persistent implementations. In particular, we find that persistence instructions occur in clusters along with expensive fence operations, they have long latency, and they add a significant execution time overhead, on average by 20.3% over code with logging but without fence instructions to order persists. To deal with this overhead and alleviate the performance bottleneck, we propose to speculate past long latency persistency operations using checkpoint-based processing. Our speculative persistence architecture reduces the execution time overheads to only 3.6%.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3140659.3080240

Reference49 articles.

1. Resistive Random Access Memory (ReRAM) Based on Metal Oxides

2. Unbounded Transactional Memory

3. Write-Aware Management of NVM-based Memory Extensions

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dolos: Improving the Performance of Persistent Applications in ADR-Supported Secure Memory;MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture;2021-10-17

2. Bonsai Merkle Forests: Efficiently Achieving Crash Consistency in Secure Persistent Memory;MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture;2021-10-17

3. COSPlay: Leveraging Task-Level Parallelism for High-Throughput Synchronous Persistence;MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture;2021-10-17

4. CCHL: Compression-Consolidation Hardware Logging for Efficient Failure-Atomic Persistent Memory Updates;49th International Conference on Parallel Processing - ICPP;2020-08-17