Energy-Performance Considerations for Data Offloading to FPGA-Based Accelerators Over PCIe-Reference-Cited by-同舟云学术

Energy-Performance Considerations for Data Offloading to FPGA-Based Accelerators Over PCIe

Published:2018-03-31 Issue:1 Volume:15 Page:1-24
ISSN:1544-3566
Container-title:ACM Transactions on Architecture and Code Optimization
language:en
Short-container-title:ACM Trans. Archit. Code Optim.

Author:

Mbakoyiannis Dimitrios¹,Tomoutzoglou Othon¹,Kornaros George¹^ORCID

Affiliation:

1. Technological Educational Institute of Crete, Crete, Greece

Abstract

Modern data centers increasingly employ FPGA-based heterogeneous acceleration platforms as a result of their great potential for continued performance and energy efficiency. Today, FPGAs provide more hardware parallelism than is possible with GPUs or CPUs, whereas C-like programming environments facilitate shorter development time, even close to software cycles. In this work, we address limitations and overheads in access and transfer of data to accelerators over common CPU-accelerator interconnects such as PCIe. We present three different FPGA accelerator dispatching methods for streaming applications (e.g., multimedia, vision computing). The first uses zero-copy data transfers and on-chip scratchpad memory (SPM) for energy efficiency, and the second uses also zero-copy but shared copy engines among different accelerator instances and local external memory. The third uses the processor’s memory management unit to acquire the physical address of user pages and uses scatter-gather data transfers with SPM. Even though all techniques exhibit advantages in terms of scalability and relieve the processor from control overheads through using integrated schedulers, the first method presents the best energy-efficient acceleration in streaming applications.

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3180263

Reference33 articles.

1. Brad Brech Juan Rubio and Michael Hollinger. 2014. Data Engine for NoSQL —IBM Power Systems Edition. White Paper. IBM. https://www-304.ibm.com/webapp/set2/sas/f/capi/CAPI_FlashWhitePaper.pdf. Brad Brech Juan Rubio and Michael Hollinger. 2014. Data Engine for NoSQL —IBM Power Systems Edition. White Paper. IBM. https://www-304.ibm.com/webapp/set2/sas/f/capi/CAPI_FlashWhitePaper.pdf.

2. Instruction Set Innovations for the Convey HC-1 Computer

3. A quantitative analysis on microarchitectures of modern CPU-FPGA platforms

4. An Analysis of Accelerator Coupling in Heterogeneous Architectures

5. ffLink

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Flexible Updating of Internet of Things Computing Functions through Optimizing Dynamic Partial Reconfiguration;ACM Transactions on Embedded Computing Systems;2024-03-18

2. Fair Resource Allocation in Virtualized O-RAN Platforms;Proceedings of the ACM on Measurement and Analysis of Computing Systems;2024-02-16

3. Theoretical Validation and Hardware Implementation of Dynamic Adaptive Scheduling for Heterogeneous Systems on Chip;Journal of Low Power Electronics and Applications;2023-10-17

4. Virtualizing a Post-Moore’s Law Analog Mesh Processor: The Case of a Photonic PDE Accelerator;ACM Transactions on Embedded Computing Systems;2023-01-24

5. Portrait: A holistic computation and bandwidth balanced performance evaluation model for heterogeneous systems;Sustainable Computing: Informatics and Systems;2022-09