Design Strategies of 40 nm Split-Gate NOR Flash Memory Device for Low-Power Compute-in-Memory Applications
-
Published:2023-09-07
Issue:9
Volume:14
Page:1753
-
ISSN:2072-666X
-
Container-title:Micromachines
-
language:en
-
Short-container-title:Micromachines
Author:
Yook Chan-Gi1ORCID, Kim Jung Nam2ORCID, Kim Yoon2, Shim Wonbo1ORCID
Affiliation:
1. Department of Electrical and Information Engineering, Seoul National University of Science and Technology, Seoul 01811, Republic of Korea 2. School of Electrical and Computer Engineering, University of Seoul, Seoul 02504, Republic of Korea
Abstract
The existing von Neumann architecture for artificial intelligence (AI) computations suffers from excessive power consumption and memory bottlenecks. As an alternative, compute-in-memory (CIM) technology has been emerging. Among various CIM device candidates, split-gate NOR flash offers advantages such as a high density and low on-state current, enabling low-power operation, and benefiting from a high level of technological maturity. To achieve high energy efficiency and high accuracy in CIM inference chips, it is necessary to optimize device design by targeting low power consumption at the device level and surpassing baseline accuracy at the system level. In split-gate NOR flash, significant factors that can cause CIM inference accuracy drop are the device conductance variation, caused by floating gate charge variation, and a low on-off current ratio. Conductance variation generally has a trade-off relationship with the on-current, which greatly affects CIM dynamic power consumption. In this paper, we propose strategies for designing optimal devices by adjusting oxide thickness and other structural parameters. As a result of setting Tox,FG to 13.4 nm, TIPO to 4.6 nm and setting other parameters to optimal points, the design achieves erase on-current below 2 μA, program on-current below 10 pA, and off-current below 1 pA, while maintaining an inference accuracy of over 92%.
Funder
Seoul National University of Science and Technology
Subject
Electrical and Electronic Engineering,Mechanical Engineering,Control and Systems Engineering
Reference26 articles.
1. A multifunctional in-memory inference processor using a standard 6T SRAM array;Kang;IEEE J. Solid-State Circuits,2018 2. Dong, Q., Sinangil, M.E., Erbagci, B., Sun, D., Khwa, W.-S., Liao, H.-J., Wang, Y., and Chang, J. (2020, January 16–20). A 351TOPS/W and 372.4 GOPS compute-in-memory SRAM macro in 7 nm FinFET CMOS for machine-learning applications. Proceedings of the IEEE International Solid-State Circuits Conference (ISSCC), San Francisco, CA, USA. 3. Su, J.-W., Si, X., Chou, Y.-C., Chang, T.-W., Huang, W.-H., Tu, Y.-N., Liu, R., Lu, P.-J., Liu, T.-W., and Wang, J.-H. (2020, January 16–20). 15.2 A 28 nm 64 Kb Inference-Training Two-Way Transpose Multibit 6T SRAM Compute-in-Memory Macro for AI Edge Chips. Proceedings of the 2020 IEEE International Solid- State Circuits Conference—(ISSCC), San Francisco, CA, USA. 4. Monolithically Integrated RRAM- and CMOS-Based In-Memory Computing Optimizations for Efficient Deep Learning;Yin;IEEE Micro,2019 5. Xue, C.-X., Huang, T.-Y., Liu, J.-S., Chang, T.-W., Kao, H.-Y., Wang, J.-H., Liu, T.-W., Wei, S.-Y., Huang, S.-P., and Wei, W.-C. (2020, January 16–20). A 22 nm 2 Mb ReRAM compute-in-memory macro with 121-28TOPS/W for multibit MAC computing for tiny AI edge devices. Proceedings of the IEEE International Solid-State Circuits Conference (ISSCC), San Francisco, CA, USA.
|
|