A Hardware Design Framework for Computer Vision Models Based on Reconfigurable Devices-Reference-Cited by-同舟云学术

A Hardware Design Framework for Computer Vision Models Based on Reconfigurable Devices

Published:2024-01-15 Issue:1 Volume:17 Page:1-31
ISSN:1936-7406
Container-title:ACM Transactions on Reconfigurable Technology and Systems
language:en
Short-container-title:ACM Trans. Reconfigurable Technol. Syst.

Author:

Fan Zimeng¹^ORCID,Hu Wei¹^ORCID,Liu Fang²^ORCID,Xu Dian¹^ORCID,Guo Hong¹^ORCID,He Yanxiang³^ORCID,Peng Min³^ORCID

Affiliation:

1. Wuhan University of Science and Technology, China

2. Wuhan University and Wuhan Institute of City, China

3. Wuhan University, China

Abstract

In computer vision, the joint development of the algorithm and computing dimensions cannot be separated. Models and algorithms are constantly evolving, while hardware designs must adapt to new or updated algorithms. Reconfigurable devices are recognized as important platforms for computer vision applications because of their reconfigurability. There are two typical design approaches: customized and overlay design. However, existing work is unable to achieve both efficient performance and scalability to adapt to a wide range of models. To address both considerations, we propose a design framework based on reconfigurable devices to provide unified support for computer vision models. It provides software-programmable modules while leaving unit design space for problem-specific algorithms. Based on the proposed framework, we design a model mapping method and a hardware architecture with two processor arrays to enable dynamic and static reconfiguration, thereby relieving redesign pressure. In addition, resource consumption and efficiency can be balanced by adjusting the hyperparameter. In experiments on CNN, vision Transformer, and vision MLP models, our work’s throughput is improved by 18.8x–33.6x and 1.4x–2.0x compared to CPU and GPU. Compared to others on the same platform, accelerators based on our framework can better balance resource consumption and efficiency.

Funder

National Natural Science Foundation of China

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3635157

Reference55 articles.

1. Mohamed S. Abdelfattah, David Han, Andrew Bitar, Roberto DiCecco, Shane O’Connell, Nitika Shanker, Joseph Chu, Ian Prins, Joshua Fender, Andrew C. Ling, and Gordon R. Chiu. 2018. DLA: Compiler and FPGA overlay for neural network inference acceleration. In 2018 28th International Conference on Field Programmable Logic and Applications (FPL’18). IEEE, 411–4117.

2. FFConv: An FPGA-based accelerator for fast convolution layers in convolutional neural networks;Ahmad Afzal;ACM Transactions on Embedded Computing Systems (TECS),2020

3. Aman Arora, Zhigang Wei, and Lizy K. John. 2020. Hamamu: Specializing FPGAs for ML applications by adding hard matrix multiplier blocks. In 2020 IEEE 31st International Conference on Application-specific Systems, Architectures and Processors (ASAP’20). IEEE, 53–60.

4. Layer normalization;Ba Jimmy Lei;arXiv preprint arXiv:1607.06450,2016

5. Mohammed Bahoura and Chan-Wang Park. 2011. FPGA-implementation of high-speed MLP neural network. In 2011 18th IEEE International Conference on Electronics, Circuits, and Systems. IEEE, 426–429.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Image-Retrieval Method Based on Cross-Hardware Platform Features;Applied System Innovation;2024-07-23

2. ViT Hybrid Channel Fit Pruning Algorithm for Co-optimization of Hardware and Software for Edge Device;Lecture Notes in Computer Science;2024