Affiliation:
1. Huazhong University of Science and Technology, Wuhan, China
2. Microsoft Research, Beijing, China
Abstract
Data persistence is necessary for many in-memory applications. However, the disk-based data persistence largely slows down in-memory applications. Emerging non-volatile memory (NVM) offers an opportunity to achieve in-memory data persistence at the DRAM-level performance. Nevertheless, NVM typically requires a software library to operate NVM data, which brings significant overhead.
This article demonstrates that a hardware-based high-frequency checkpointing mechanism can be used to achieve efficient in-memory data persistence on NVM. To maintain checkpoint consistency, traditional logging and copy-on-write techniques incur excessive NVM writes that impair both performance and endurance of NVM; recent work attempts to solve the issue but requires a large amount of metadata in the memory controller. Hence, we design a new
dual-page checkpointing
system, which achieves low metadata cost and eliminates most excessive NVM writes at the same time. It breaks the traditional trade-off between metadata space cost and extra data writes. Our solution outperforms the state-of-the-art NVM software libraries by 13.6× in throughput, and leads to 34% less NVM wear-out and 1.28× higher throughput than state-of-the-art hardware checkpointing solutions, according to our evaluation with OLTP, graph computing, and machine-learning workloads.
Funder
National Science Foundation of China
Pre-research Project of Beifang
National Key Research and Development Program
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Information Systems,Software
Reference78 articles.
1. Adaptive incremental checkpointing for massively parallel systems
2. Alluxio Open Foundation. 2017. Open Source Memory Speed Virtual Distributed Storage. Retrieved from http://www.alluxio.org/. Alluxio Open Foundation. 2017. Open Source Memory Speed Virtual Distributed Storage. Retrieved from http://www.alluxio.org/.
3. Spin-transfer torque magnetic random access memory (STT-MRAM)
4. Spark SQL
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Using Logging-on-Write to Improve Non-Volatile Memory Checkpoints via Processing-in-Memory;2023 IEEE 35th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD);2023-10-17
2. DONUTS: An efficient method for checkpointing in non‐volatile memories;Concurrency and Computation: Practice and Experience;2023-01-24
3. UCat: heterogeneous memory management for unikernels;Frontiers of Computer Science;2022-08-08
4. libcrpm;Proceedings of the 59th ACM/IEEE Design Automation Conference;2022-07-10
5. NVOverlay: Enabling Efficient and Scalable High-Frequency Snapshotting to NVM;2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA);2021-06