Affiliation:
1. University of Delaware, Newark, DE
2. IBM TJ Watson Research Center, Howthorne, NY
Abstract
Efficient fine-grain synchronization is extremely important to effectively harness the computational power of many-core architectures. However, designing and implementing finegrain synchronization in such architectures presents several challenges, including issues of synchronization induced overhead, storage cost, scalability, and the level of granularity to which synchronization is applicable. This paper proposes the
Synchronization State Buffer
(
SS
B), a scalable architectural design for fine-grain synchronization that efficiently performs synchronizations between concurrent threads. The design of SSB is motivated by the following observation:
at any instance during the parallel execution only a small fraction of memory locations are actively participating in synchronization.
Based on this observation we present a fine-grain synchronization design that records and manages the states of frequently synchronized data using modest hardware support. We have implemented the SSB design in the context of the 160-core IBM Cyclops-64 architecture. Using detailed simulation, we present our experience for a set of benchmarks with different workload characteristics.
Publisher
Association for Computing Machinery (ACM)
Reference44 articles.
1. HPC challenge benchmark. http://icl.cs.utk.edu/hpcc/. HPC challenge benchmark. http://icl.cs.utk.edu/hpcc/.
2. Meet Larrabee Intel's answer to a GPU. http://www.theinquirer.net/default.aspx?article=37548. Meet Larrabee Intel's answer to a GPU. http://www.theinquirer.net/default.aspx?article=37548.
3. Sparcle: an evolutionary processor design for large-scale multiprocessors
4. The Tera computer system
Cited by
33 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献