Understanding Timing Error Characteristics from Overclocked Systolic Multiply–Accumulate Arrays in FPGAs-Reference-Cited by-同舟云学术

Understanding Timing Error Characteristics from Overclocked Systolic Multiply–Accumulate Arrays in FPGAs

Published:2024-01-09 Issue:1 Volume:14 Page:4
ISSN:2079-9268
Container-title:Journal of Low Power Electronics and Applications
language:en
Short-container-title:JLPEA

Author:

Chamberlin Andrew¹,Gerber Andrew¹,Palmer Mason¹,Goodale Tim¹,Gundi Noel Daniel¹^ORCID,Chakraborty Koushik¹,Roy Sanghamitra¹

Affiliation:

1. Bridge Lab, Electrical and Computer Engineering, Utah State University, Logan, UT 84321, USA

Abstract

Artificial Intelligence (AI) hardware accelerators have seen tremendous developments in recent years due to the rapid growth of AI in multiple fields. Many such accelerators comprise a Systolic Multiply–Accumulate Array (SMA) as its computational brain. In this paper, we investigate the faulty output characterization of an SMA in a real silicon FPGA board. Experiments were run on a single Zybo Z7-20 board to control for process variation at nominal voltage and in small batches to control for temperature. The FPGA is rated up to 800 MHz in the data sheet due to the max frequency of the PLL, but the design is written using Verilog for the FPGA and C++ for the processor and synthesized with a chosen constraint of a 125 MHz clock. We then operate the system at a frequency range of 125 MHz to 450 MHz for the FPGA and the nominal 667 MHz for the processor core to produce timing errors in the FPGA without affecting the processor. Our extensive experimental platform with a hardware–software ecosystem provides a methodological pathway that reveals fascinating characteristics of SMA behavior under an overclocked environment. While one may intuitively expect that timing errors resulting from overclocked hardware may produce a wide variation in output values, our post-silicon evaluation reveals a lack of variation in erroneous output values. We found an intriguing pattern where error output values are stable for a given input across a range of operating frequencies far exceeding the rated frequency of the FPGA.

Funder

National Science Foundation

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9268/14/1/4/pdf

Reference34 articles.

1. Artificial Intelligence Image Recognition Method Based on Convolutional Neural Network Algorithm;Tian;IEEE Access,2020

2. Methodologic Guide for Evaluating Clinical Performance and Effect of Artificial Intelligence Technology for Medical Diagnosis and Prediction;Park;Radiology,2018

3. Amberkar, A., Awasarmol, P., Deshmukh, G., and Dave, P. (2018, January 1–3). Speech Recognition using Recurrent Neural Networks. Proceedings of the ICCTCT, Coimbatore, India.

4. Roelke, A., Zhang, R., Mazumdar, K., Wang, K., Skadron, K., and Stan, M.R. (2017, January 5–8). Pre-RTL Voltage and Power Optimization for Low-Cost, Thermally Challenged Multicore Chips. Proceedings of the 2017 IEEE International Conference on Computer Design (ICCD), Boston, MA, USA.

5. Swaminathan, K., Chandramoorthy, N., Cher, C.Y., Bertran, R., Buyuktosunoglu, A., and Bose, P. (2017, January 4–8). BRAVO: Balanced Reliability-Aware Voltage Optimization. Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture (HPCA), Austin, TX, USA.