Author:
Haijoub Abdelilah,Hatim Anas,Arioua Mounir,Hammia Slama,Eloualkadi Ahmed,Guerrero-González Antonio
Abstract
The implementation of CNN FPGA is of increasing importance due to the growing demand for low-power and high-performance edge AI applications. This paper presents a comprehensive survey and research on the topic, with a focus on comparing and evaluating the performance of two main FPGA architectures, streaming and single unit computing. The study includes a detailed evaluation of the state-of-the-art CNNs, LeNet-5 and YOLOv2, on both FPGA architectures. The results provide useful insights into the trade-offs involved, limitations, challenges, and the complexity of implementing CNNs on FPGAs. The paper highlights the difficulties and intricacies involved in implementing CNNs on FPGAs and provides potential solutions for improving performance and efficiency.
Reference22 articles.
1. A High-Throughput and Power-Efficient FPGA Implementation of YOLO CNN for Object Detection
2. Redmon J., Farhadi A., Yolo9000: Better, faster, stronger. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017)
3. Wang E., PYNQ Classification-Python on Zynq FPGA for Neural Networks, Imperial College London, Final Year Project Report, (2017)
4. Object Detection in 20 Years: A Survey
5. Sharma A., Singh V., Rani A., Implementation of CNN on Zynq based FPGA for realtime object detection. 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT) (2019)
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献