Design of Synthesis-time Vectorized Arithmetic Hardware for Tapered Floating-point Addition and Subtraction-Reference-Cited by-同舟云学术

Design of Synthesis-time Vectorized Arithmetic Hardware for Tapered Floating-point Addition and Subtraction

Published:2023-03-22 Issue:3 Volume:28 Page:1-35
ISSN:1084-4309
Container-title:ACM Transactions on Design Automation of Electronic Systems
language:en
Short-container-title:ACM Trans. Des. Autom. Electron. Syst.

Author:

Bommana Ashish Reddy¹^ORCID,Siddamshetty Susheel Ujwal¹^ORCID,Pudi Dhilleswararao¹^ORCID,Thumatti K. R. Arvind¹^ORCID,Boppu Srinivas¹^ORCID,Sabarimalai Manikandan M¹^ORCID,Cenkeramaddi Linga Reddy²^ORCID

Affiliation:

1. Indian Institute of Technology Bhubaneswar, Bhubaneswar, Odisha, India

2. University of Agder, Grimstad, Norway

Abstract

Energy efficiency has become the new performance criterion in this era of pervasive embedded computing; thus, accelerator-rich multi-processor system-on-chips are commonly used in embedded computing hardware. Once computationally intensive machine learning applications gained much traction, they are now deployed in many application domains due to abundant and cheaply available computational capacity. In addition, there is a growing trend toward developing hardware accelerators for machine learning applications for embedded edge devices where performance and energy efficiency are critical. Although these hardware accelerators frequently use floating-point operations for accuracy, reduced-width floating-point formats are also used to reduce hardware complexity; thus, power consumption while maintaining accuracy. Vectorization concepts can also be used to improve performance, energy efficiency, and memory bandwidth. We propose the design of a vectorized floating-point adder/subtractor that supports arbitrary length floating-point formats with varying exponent and mantissa widths in this article. In comparison to existing designs in the literature, the proposed design is 2.57× area- and 1.56× power-efficient, and it supports true vectorization with no restrictions on exponent and mantissa widths.

Publisher

Association for Computing Machinery (ACM)

Subject

Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Science Applications

Link

https://dl.acm.org/doi/pdf/10.1145/3567423

Reference32 articles.

1. Aqil M. Azmi and Fabrizio Lombardi. 1989. On a tapered floating point system. In Proceedings of the 9th Symposium on Computer Arithmetic. IEEE, 2–9.

2. High-Precision Floating-Point Arithmetic in Scientific Computation

3. Shifted and squeezed 8-bit floating point format for low-precision training of deep neural networks;Cambier Léopold;arXiv preprint arXiv:2001.05674,2020

4. Custom arithmetic datapath design for FPGAs using the FloPoCo core generator;Dinechin Florent De;IEEE Des. Test Comput.,2011

5. Jean-Pierre Deschamps, Gery J. A. Bioul, and Gustavo D. Sutter. 2006. Synthesis of Arithmetic Circuits: FPGA, ASIC and Embedded Systems. John Wiley & Sons.