Affiliation:
1. Massachusetts Institute of Technology
Abstract
As multicore architectures enter the mainstream, there is a pressing demand for high-level programming models that can effectively map to them. Stream programming offers an attractive way to expose coarse-grained parallelism, as streaming applications (image, video, DSP, etc.) are naturally represented by independent filters that communicate over explicit data channels.In this paper, we demonstrate an end-to-end stream compiler that attains robust multicore performance in the face of varying application characteristics. As benchmarks exhibit different amounts of task, data, and pipeline parallelism, we exploit all types of parallelism in a unified manner in order to achieve this generality. Our compiler, which maps from the StreamIt language to the 16-core Raw architecture, attains a 11.2x mean speedup over a single-core baseline, and a 1.84x speedup over our previous work.
Publisher
Association for Computing Machinery (ACM)
Reference41 articles.
1. Raza Microelectronics Inc. http://www.razamicroelectronics.com/products/xlr.htm.]] Raza Microelectronics Inc. http://www.razamicroelectronics.com/products/xlr.htm.]]
2. StreamIt Language Specification. http://cag.lcs.mit.edu/streamit/papers/streamit-lang-spec.pdf.]] StreamIt Language Specification. http://cag.lcs.mit.edu/streamit/papers/streamit-lang-spec.pdf.]]
3. Optimizing stream programs using linear state space analysis
4. Xbox 360 System Architecture
5. Partitioning and pipelining for performance-constrained hardware/software systems
Cited by
151 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A computational framework based on the dynamic pipeline approach;Journal of Logical and Algebraic Methods in Programming;2024-06
2. BLQ: Light-Weight Locality-Aware Runtime for Blocking-Less Queuing;Proceedings of the 33rd ACM SIGPLAN International Conference on Compiler Construction;2024-02-17
3. Phloem: Automatic Acceleration of Irregular Applications with Fine-Grain Pipeline Parallelism;2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2023-02
4. Hyperion;Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems;2022-11-06
5. Piper: Pipelining OpenMP Offloading Execution Through Compiler Optimization For Performance;2022 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC);2022-11