Power, Area, and Performance Optimization of Standard Cell Memory Arrays Through Controlled Placement

Author:

Teman Adam1,Rossi Davide2,Meinerzhagen Pascal3,Benini Luca4,Burg Andreas5

Affiliation:

1. École Polytechnique Fédérale de Lausanne, Bar-Ilan University, Switzerland

2. University of Bologna, Italy

3. Bar-Ilan University, Ramat Gan, Israel

4. University of Bologna, ETH-Zurich, Italy

5. École Polytechnique Fédérale de Lausanne, Switzerland

Abstract

Embedded memory remains a major bottleneck in current integrated circuit design in terms of silicon area, power dissipation, and performance; however, static random access memories (SRAMs) are almost exclusively supplied by a small number of vendors through memory generators, targeted at rather generic design specifications. As an alternative, standard cell memories (SCMs) can be defined, synthesized, and placed and routed as an integral part of a given digital system, providing complete design flexibility, good energy efficiency, low-voltage operation, and even area efficiency for small memory blocks. Yet implementing an SCM block with a standard digital flow often fails to exploit the distinct and regular structure of such an array, leaving room for optimization. In this article, we present a design methodology for optimizing the physical implementation of SCM macros as part of the standard design flow. This methodology introduces controlled placement, leading to a structured, noncongested layout with close to 100% placement utilization, resulting in a smaller silicon footprint, reduced wire length, and lower power consumption compared to SCMs without controlled placement. This methodology is demonstrated on SCM macros of various sizes and aspect ratios in a state-of-the-art 28nm fully depleted silicon-on-insulator technology, and compared with equivalent macros designed with the noncontrolled, standard flow, as well as with foundry-supplied SRAM macros. The controlled SCMs provide an average 25% reduction in area as compared to noncontrolled implementations while achieving a smaller size than SRAM macros of up to 1Kbyte. Power and performance comparisons of controlled SCM blocks of a commonly found 256 × 32 (1 Kbyte) memory with foundry-provided SRAMs show greater than 65% and 10% reduction in read and write power, respectively, while providing faster access than their SRAM counterparts, despite being of an aspect ratio that is typically unfavorable for SCMs. In addition, the SCM blocks function correctly with a supply voltage as low as 0.3V, well below the lower limit of even the SRAM macros optimized for low-voltage operation. The controlled placement methodology is applied within a full-chip physical implementation flow of an OpenRISC-based test chip, providing more than 50% power reduction compared to equivalently sized compiled SRAMs under a benchmark application.

Funder

Swiss Government Excellence Scholarship

HiPer Consortium under the “Magnet” program of the office of the chief scientist in the Israeli Ministry of Economy

FP7 project PHIDIAS

Nano-Tera.ch with Swiss Confederation financing under the IcySoC project

Publisher

Association for Computing Machinery (ACM)

Subject

Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Science Applications

Reference21 articles.

1. Dual-VT 4kb sub-VT memories with <1 pW/bit leakage in 65 nm CMOS

2. Energy-aware design of embedded memories

3. A 256-kb 65-nm Sub-threshold SRAM Design for Ultra-Low-Voltage Operation

4. Process and design solutions for exploiting FD-SOI technology towards energy efficient SOCs

5. ITRS. 2015. International Technology Roadmap for Semiconductors 2015 Edition. Available at http://www.itrs2.net/itrs-reports.html. ITRS. 2015. International Technology Roadmap for Semiconductors 2015 Edition. Available at http://www.itrs2.net/itrs-reports.html.

Cited by 37 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Energy and Bandwidth Efficient Sparse Programmable Dataflow Accelerator;IEEE Transactions on Circuits and Systems I: Regular Papers;2024-09

2. A novel 1T-1D single ended SRAM cell using FinFET technology for low power applications;i-manager’s Journal on Electronics Engineering;2024

3. SENECA: building a fully digital neuromorphic processor, design trade-offs and challenges;Frontiers in Neuroscience;2023-06-23

4. Reconfigurable spatial-parallel stochastic computing for accelerating sparse convolutional neural networks;Science China Information Sciences;2023-05-17

5. Energy Efficiency of Opportunistic Refreshing for Gain-Cell Embedded DRAM;IEEE Transactions on Circuits and Systems I: Regular Papers;2023-04

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3