Affiliation:
1. University of Pennsylvania, Philadelphia
Abstract
How do we develop programs that are easy to express, easy to reason about, and able to achieve high performance on massively parallel machines? To address this problem, we introduce GraphStep, a domain-specific compute model that captures algorithms that act on static, irregular, sparse graphs. In GraphStep, algorithms are expressed directly without requiring the programmer to explicitly manage parallel synchronization, operation ordering, placement, or scheduling details. Problems in the sparse graph domain are usually highly concurrent and communicate along graph edges. Exposing concurrency and communication structure allows scheduling of parallel operations and management of communication that is necessary for performance on a spatial computer. We study the performance of a semantic network application, a shortest-path application, and a max-flow/min-cut application. We introduce a language syntax for GraphStep applications. The total speedup over sequential versions of the applications studied ranges from a factor of 19 to a factor of 15,000. Spatially-aware graph optimizations (e.g., node decomposition, placement and route scheduling) delivered speedups from 3 to 30 times over a spatially-oblivious mapping.
Publisher
Association for Computing Machinery (ACM)
Subject
Software,Computer Science (miscellaneous),Control and Systems Engineering
Reference42 articles.
1. Implementation of a portable nested data-parallel language
2. Brook Project. 2004. Brook project web page. http://brook.sourceforge.net. Brook Project. 2004. Brook project web page. http://brook.sourceforge.net.
3. Improved algorithms for hypergraph bipartitioning
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Applications and Techniques for Fast Machine Learning in Science;Frontiers in Big Data;2022-04-12
2. Forebody shock control devices for drag and aero-heating reduction: A comprehensive survey with a practical perspective;Progress in Aerospace Sciences;2020-01
3. GraVF-M;ACM Transactions on Reconfigurable Technology and Systems;2019-11-27
4. An FPGA framework for edge-centric graph processing;Proceedings of the 15th ACM International Conference on Computing Frontiers;2018-05-08
5. Accelerating Graph Analytics on CPU-FPGA Heterogeneous Platform;2017 29th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD);2017-10