Affiliation:
1. Indian Institute of Technology Bhubaneswar, Bhubaneswar, Odisha, India
2. University of Agder, Grimstad, Norway
Abstract
Energy efficiency has become the new performance criterion in this era of pervasive embedded computing; thus, accelerator-rich multi-processor system-on-chips are commonly used in embedded computing hardware. Once computationally intensive machine learning applications gained much traction, they are now deployed in many application domains due to abundant and cheaply available computational capacity. In addition, there is a growing trend toward developing hardware accelerators for machine learning applications for embedded edge devices where performance and energy efficiency are critical. Although these hardware accelerators frequently use floating-point operations for accuracy, reduced-width floating-point formats are also used to reduce hardware complexity; thus, power consumption while maintaining accuracy. Vectorization concepts can also be used to improve performance, energy efficiency, and memory bandwidth. We propose the design of a vectorized floating-point adder/subtractor that supports arbitrary length floating-point formats with varying exponent and mantissa widths in this article. In comparison to existing designs in the literature, the proposed design is 2.57× area- and 1.56× power-efficient, and it supports true vectorization with no restrictions on exponent and mantissa widths.
Publisher
Association for Computing Machinery (ACM)
Subject
Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Science Applications
Reference32 articles.
1. Aqil M. Azmi and Fabrizio Lombardi. 1989. On a tapered floating point system. In Proceedings of the 9th Symposium on Computer Arithmetic. IEEE, 2–9.
2. High-Precision Floating-Point Arithmetic in Scientific Computation
3. Shifted and squeezed 8-bit floating point format for low-precision training of deep neural networks;Cambier Léopold;arXiv preprint arXiv:2001.05674,2020
4. Custom arithmetic datapath design for FPGAs using the FloPoCo core generator;Dinechin Florent De;IEEE Des. Test Comput.,2011
5. Jean-Pierre Deschamps, Gery J. A. Bioul, and Gustavo D. Sutter. 2006. Synthesis of Arithmetic Circuits: FPGA, ASIC and Embedded Systems. John Wiley & Sons.