Affiliation:
1. The Chair for Processor Design, Center for Advancing Electronics Dresden (CfAED), Technische Universität Dresden, Dresden, Germany
Abstract
Approximate arithmetic operators, such as adders and multipliers, are increasingly used to satisfy the energy and performance requirements of resource-constrained embedded systems. However, most of the available approximate operators have an application-agnostic design methodology, and the efficacy of these operators can only be evaluated by employing them in the applications. Furthermore, the various available libraries of approximate operators do not share any standard approximation-induction policy to design new operators according to an application’s accuracy and performance constraints. These limitations also hinder the utilization of machine learning models to explore and determine approximate operators according to an application’s requirements. In this work, we present a generic design methodology for implementing FPGA-based application-specific approximate arithmetic operators. Our proposed technique utilizes lookup tables and carry-chains of FPGAs to implement approximate operators according to the input configurations. For instance, for an
\( \text{M}\times \text{N} \)
accurate multiplier utilizing
K
lookup tables, our methodology utilizes
K
-bit configurations to design
\( 2^K \)
approximate multipliers. We then utilize various machine learning models to evaluate and select configurations satisfying application accuracy and performance constraints. We have evaluated our proposed methodology for three benchmark applications, i.e., biomedical signal processing, image processing, and ANNs. We report more non-dominated approximate multipliers with better hypervolume contribution than state-of-the-art designs for these benchmark applications with the proposed design methodology.
Funder
German Research Foundation
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Software
Reference38 articles.
1. Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, et al. 2016. Tensorflow: A system for large-scale machine learning. In Proceedings of the 12th \( \lbrace \) USENIX \( \rbrace \) Symposium on Operating Systems Design and Implementation. 265–283.
2. Anonymous. 2021. MNIST-cnn. Retrieved February 2 2021 from https://github.com/integeruser/MNIST-cnn.
3. Data mining with decision trees and decision rules
4. A Two's Complement Parallel Array Multiplication Algorithm
5. A parallel global multiobjective framework for optimization: pagmo
Cited by
17 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献