Transformer: An OS-Supported Reconfigurable Hybrid Memory Architecture
-
Published:2022-12-18
Issue:24
Volume:12
Page:12995
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Chi Ye, Liu Haikun, Peng Ganwei, Liao Xiaofei, Jin HaiORCID
Abstract
Non-volatile memories (NVMs) have aroused vast interest in hybrid memory systems due to their promising features of byte-addressability, high storage density, low cost per byte, and near-zero standby energy consumption. However, since NVMs have limited write endurance, high write latency, and high write energy consumption, it is still challenging to directly replace traditional dynamic random access memory (DRAM) with NVMs. Many studies propose to utilize NVM and DRAM in a hybrid memory system, and explore sophisticated memory management schemes to alleviate the impact of slow NVM on the performance of applications. A few studies architected DRAM and NVM in a cache/memory hierarchy. However, the storage and performance overhead of the cache metadata (i.e., tags) management is rather expensive in this hierarchical architecture. Some other studies architected NVM and DRAM in a single (flat) address space to form a parallel architecture. However, the hot page monitoring and migration are critical for the performance of applications in this architecture. In this paper, we propose Transformer, an OS-supported reconfigurable hybrid memory architecture to efficiently use DRAM and NVM without redesigning the hardware architecture. To identify frequently accessed (hot) memory pages for migration, we propose to count the number of page accesses in OSes by sampling the access bit of pages periodically. We further migrate the identified hot pages from NVM to DRAM to improve the performance of hybrid memory system. More importantly, Transformer can simulate a hierarchical hybrid memory architecture while DRAM and NVM are physically managed in a flat address space, and can dynamically shift the logical memory architecture between parallel and hierarchical architectures according to applications’ memory access patterns. Experimental results show that Transformer can improve the application performance by 62% on average (up to 2.7×) compared with an NVM-only system, and can also improve performance by up to 79% and 42% (21% and 24% on average) compared with hierarchical and parallel architectures, respectively.
Funder
National Natural Science Foundation of China National Key Research and Development Program of China
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference27 articles.
1. Malladi, K.T., Shaeffer, I., Gopalakrishnan, L., Lo, D., Lee, B.C., and Horowitz, M. (2012, January 1–5). Rethinking DRAM power modes for energy proportionality. Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture, Vancouver, BC, Canada. 2. Recent progress of integrated circuits and optoelectronic chips;Hao;Sci. China Inf. Sci.,2021 3. Active low-power modes for main memory with memscale;Deng;IEEE Micro,2012 4. Lee, B.C., Ipek, E., Mutlu, O., and Burger, D. (2009, January 20–24). Architecting phase change memory as a scalable DRAM alternative. Proceedings of the 2009 36th Annual International Symposium on Computer Architecture (ISCA), Austin, TX, USA. 5. Xu, C., Niu, D., Muralimanohar, N., Jouppi, N.P., and Xie, Y. (June, January 29). Understanding the trade-offs in multi-level cell ReRAM memory design. Proceedings of the 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC), Austin, TX, USA.
|
|