Affiliation:
1. Tsinghua University, China
2. University of Waterloo, Canada
Abstract
With the unique feature of fine-grained parallelism,
field-programmable gate arrays
(FPGAs) show great potential for streaming algorithm acceleration. However, the lack of a design framework, restrictions on FPGAs, and ineffective tools impede the utilization of FPGAs in practice. In this study, we provide a design paradigm to support streaming algorithm acceleration on FPGAs. We first propose an abstract model to describe streaming algorithms with homogeneous sub-functions (HSF) and stable data dependency (SDD), which we call the HSF-SDD model. Using this model, we then develop an FPGA framework, PE-Ring, that has the advantages of (1) fully exploiting algorithm parallelism to achieve high performance, (2) leveraging block RAM to serve large scale parameters, and (3) enabling flexible parameter adjustments. Based on the proposed model and framework, we finally implement a specific converter to generate the register-transfer level representation of the PE-Ring. Experimental results show that our method outperforms ordinary FPGA design tools by one to two orders of magnitude. Experiments also demonstrate the scalability of the PE-Ring.
Funder
973 project
Huawei Technologies Co. Ltd
National Natural Science Foundation of China
Joint fund of Equipment pre-Research and Ministry of Education
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Software
Reference31 articles.
1. Mergeable summaries
2. Transaction processing on confidential data using cipherbase
3. The DataPath system
4. A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models;Bilmes Jeff A.;Int. Comput. Sci. Inst.,1998
5. Querying and mining of time series data