Efficient custom computing of fully-streamed lattice boltzmann method on tightly-coupled FPGA cluster-Reference-Cited by-同舟云学术

Efficient custom computing of fully-streamed lattice boltzmann method on tightly-coupled FPGA cluster

Published:2013-12-18 Issue:5 Volume:41 Page:47-52
ISSN:0163-5964
Container-title:ACM SIGARCH Computer Architecture News
language:en
Short-container-title:SIGARCH Comput. Archit. News

Author:

Sano Kentaro¹,Kono Yoshiaki¹,Suzuki Hayato¹,Chiba Ryotaro¹,Ito Ryo¹,Ueno Tomohiro¹,Koizumi Kyo¹,Yamamoto Satoru¹

Affiliation:

1. Tohoku University, Sendai, Japan

Abstract

This paper presents the detailed design of a custom computing machine for fully-streamed LBM computation on multiple FPGAs, and evaluates its efficiency with prototype implementation. We design a unit for completely streamed computation including boundary treatment with a newly introduced cell attribute. Experimental results demonstrate that the proposed machine achieves high utilization of PEs, 99 % of the peak performance, for one and two FPGAs computing a large lattice. This is due to our fully-streamed design to allow all arithmetic units to be efficienly utilized with a constant memory bandwidth, and the architecture to exploit a low-latency accelerator domain network (ADN) of a tightly-coupled FPGA cluster for scalable computation.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/2641361.2641369

Reference18 articles.

1. Initial and boundary conditions for the lattice Boltzmann method

2. Radiative Heat Transfer Simulation Using Programmable Graphics Hardware

3. The GRAPE project

4. Anton, a special-purpose machine for molecular dynamics simulation

5. FPGAs vs. CPUs

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. FPGA-Dedicated Network vs. Server Network for Pipelined Computing with Multiple FPGAs;International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies;2022-06-09

2. BurstZ+: Eliminating The Communication Bottleneck of Scientific Computing Accelerators via Accelerated Compression;ACM Transactions on Reconfigurable Technology and Systems;2022-01-31

3. Zynq SoC based acceleration of the lattice Boltzmann method;Concurrency and Computation: Practice and Experience;2019-02-07

4. FPGA-based Stream Computing for High-Performance N-Body Simulation using Floating-Point DSP Blocks;Proceedings of the 8th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies;2017-06-07

5. Stream Computation of Shallow Water Equation Solver for FPGA-based 1D Tsunami Simulation;ACM SIGARCH Computer Architecture News;2016-04-22