Understanding Timing Error Characteristics from Overclocked Systolic Multiply–Accumulate Arrays in FPGAs
-
Published:2024-01-09
Issue:1
Volume:14
Page:4
-
ISSN:2079-9268
-
Container-title:Journal of Low Power Electronics and Applications
-
language:en
-
Short-container-title:JLPEA
Author:
Chamberlin Andrew1, Gerber Andrew1, Palmer Mason1, Goodale Tim1, Gundi Noel Daniel1ORCID, Chakraborty Koushik1, Roy Sanghamitra1
Affiliation:
1. Bridge Lab, Electrical and Computer Engineering, Utah State University, Logan, UT 84321, USA
Abstract
Artificial Intelligence (AI) hardware accelerators have seen tremendous developments in recent years due to the rapid growth of AI in multiple fields. Many such accelerators comprise a Systolic Multiply–Accumulate Array (SMA) as its computational brain. In this paper, we investigate the faulty output characterization of an SMA in a real silicon FPGA board. Experiments were run on a single Zybo Z7-20 board to control for process variation at nominal voltage and in small batches to control for temperature. The FPGA is rated up to 800 MHz in the data sheet due to the max frequency of the PLL, but the design is written using Verilog for the FPGA and C++ for the processor and synthesized with a chosen constraint of a 125 MHz clock. We then operate the system at a frequency range of 125 MHz to 450 MHz for the FPGA and the nominal 667 MHz for the processor core to produce timing errors in the FPGA without affecting the processor. Our extensive experimental platform with a hardware–software ecosystem provides a methodological pathway that reveals fascinating characteristics of SMA behavior under an overclocked environment. While one may intuitively expect that timing errors resulting from overclocked hardware may produce a wide variation in output values, our post-silicon evaluation reveals a lack of variation in erroneous output values. We found an intriguing pattern where error output values are stable for a given input across a range of operating frequencies far exceeding the rated frequency of the FPGA.
Funder
National Science Foundation
Reference34 articles.
1. Artificial Intelligence Image Recognition Method Based on Convolutional Neural Network Algorithm;Tian;IEEE Access,2020 2. Methodologic Guide for Evaluating Clinical Performance and Effect of Artificial Intelligence Technology for Medical Diagnosis and Prediction;Park;Radiology,2018 3. Amberkar, A., Awasarmol, P., Deshmukh, G., and Dave, P. (2018, January 1–3). Speech Recognition using Recurrent Neural Networks. Proceedings of the ICCTCT, Coimbatore, India. 4. Roelke, A., Zhang, R., Mazumdar, K., Wang, K., Skadron, K., and Stan, M.R. (2017, January 5–8). Pre-RTL Voltage and Power Optimization for Low-Cost, Thermally Challenged Multicore Chips. Proceedings of the 2017 IEEE International Conference on Computer Design (ICCD), Boston, MA, USA. 5. Swaminathan, K., Chandramoorthy, N., Cher, C.Y., Bertran, R., Buyuktosunoglu, A., and Bose, P. (2017, January 4–8). BRAVO: Balanced Reliability-Aware Voltage Optimization. Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture (HPCA), Austin, TX, USA.
|
|