Affiliation:
1. Department of Computer Science, ETH Zurich, Zurich, Switzerland
Abstract
We present a hardware generator for computations with regular structure including the fast Fourier transform (FFT), sorting networks, and others. The input of the generator is a high-level description of the algorithm; the output is a token-based, synchronized design in the form of RTL-Verilog. Building on prior work, the generator uses several layers of domain-specific languages (DSLs) to represent and optimize at different levels of abstraction to produce a RAM- and area-efficient hardware implementation. Two of these layers and DSLs are novel. The first one allows the use and domain-specific optimization of state-of-the-art streaming permutations. The second DSL enables the automatic pipelining of a streaming hardware dataflow and the synchronization of its data-independent control signals. The generator including the DSLs are implemented in Scala, leveraging its type system, and uses concepts from lightweight modular staging (LMS) to handle the constraints of streaming hardware. Particularly, these concepts offer genericity over hardware number representation, including seamless switching between fixed-point arithmetic and FloPoCo generated IEEE floating-point operators, while ensuring type-safety. We show benchmarks of generated FFTs, sorting networks, and Walsh-Hadamard transforms that outperform prior generators.
Publisher
Association for Computing Machinery (ACM)
Reference53 articles.
1. Jacques Hadamard. 1893. Résolution d’une question relative aux déterminants. Bulletin des sciences mathématiques 17 (1893) 240--246. Jacques Hadamard. 1893. Résolution d’une question relative aux déterminants. Bulletin des sciences mathématiques 17 (1893) 240--246.
2. Architecture-oriented regular algorithms for discrete sine and cosine transforms
3. Sorting networks and their applications
4. Parallel Processing with the Perfect Shuffle
5. Váaclav Edvard Beneš. 1965. Mathematical Theory of Connecting Networks and Telephone Traffic. Academic Press. Váaclav Edvard Beneš. 1965. Mathematical Theory of Connecting Networks and Telephone Traffic. Academic Press.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. FPT: A Fixed-Point Accelerator for Torus Fully Homomorphic Encryption;Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security;2023-11-15
2. Optimizing data reshaping operations in functional IRs for high-level synthesis;Proceedings of the 23rd ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems;2022-06-14
3. Memory-Aware Functional IR for Higher-Level Synthesis of Accelerators;ACM Transactions on Architecture and Code Optimization;2022-01-31