Synchronization state buffer-Reference-Cited by-同舟云学术

Synchronization state buffer

Published:2007-06-09 Issue:2 Volume:35 Page:35-45
ISSN:0163-5964
Container-title:ACM SIGARCH Computer Architecture News
language:en
Short-container-title:SIGARCH Comput. Archit. News

Author:

Zhu Weirong¹,Sreedhar Vugranam C²,Hu Ziang¹,Gao Guang R.¹

Affiliation:

1. University of Delaware, Newark, DE

2. IBM TJ Watson Research Center, Howthorne, NY

Abstract

Efficient fine-grain synchronization is extremely important to effectively harness the computational power of many-core architectures. However, designing and implementing finegrain synchronization in such architectures presents several challenges, including issues of synchronization induced overhead, storage cost, scalability, and the level of granularity to which synchronization is applicable. This paper proposes the Synchronization State Buffer ( SS B), a scalable architectural design for fine-grain synchronization that efficiently performs synchronizations between concurrent threads. The design of SSB is motivated by the following observation: at any instance during the parallel execution only a small fraction of memory locations are actively participating in synchronization. Based on this observation we present a fine-grain synchronization design that records and manages the states of frequently synchronized data using modest hardware support. We have implemented the SSB design in the context of the 160-core IBM Cyclops-64 architecture. Using detailed simulation, we present our experience for a set of benchmarks with different workload characteristics.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/1273440.1250668

Reference44 articles.

1. HPC challenge benchmark. http://icl.cs.utk.edu/hpcc/. HPC challenge benchmark. http://icl.cs.utk.edu/hpcc/.

2. Meet Larrabee Intel's answer to a GPU. http://www.theinquirer.net/default.aspx?article=37548. Meet Larrabee Intel's answer to a GPU. http://www.theinquirer.net/default.aspx?article=37548.

3. Sparcle: an evolutionary processor design for large-scale multiprocessors

4. The Tera computer system

Cited by 33 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DynAMO: Improving Parallelism Through Dynamic Placement of Atomic Memory Operations;Proceedings of the 50th Annual International Symposium on Computer Architecture;2023-06-17

2. Adaptive Contention Management for Fine-Grained Synchronization on Commodity GPUs;ACM Transactions on Architecture and Code Optimization;2022-09-16

3. Highly Parallel Multi-FPGA System Compilation from Sequential C/C++ Code in the AWS Cloud;ACM Transactions on Reconfigurable Technology and Systems;2022-08-08

4. Role of Big Data in Internet of Things Networks;Research Anthology on Big Data Analytics, Architectures, and Applications;2022

5. Hardware support for thread synchronisation in an experimental manycore system;International Journal of Grid and Utility Computing;2020