An integrated pseudo-associativity and relaxed-order approach to hardware transactional memory
-
Published:2013-01
Issue:4
Volume:9
Page:1-26
-
ISSN:1544-3566
-
Container-title:ACM Transactions on Architecture and Code Optimization
-
language:en
-
Short-container-title:ACM Trans. Archit. Code Optim.
Author:
Yan Zhichao1,
Jiang Hong2,
Tan Yujuan3,
Feng Dan1
Affiliation:
1. Huazhong University of Science and Technology, China
2. University of Nebraska - Lincoln
3. Chongqing University, China
Abstract
Our experimental study and analysis reveal that the bottlenecks of existing hardware transactional memory systems are largely rooted in the extra data movements in version management and in the inefficient scheduling of conflicting transactions in conflict management, particularly in the presence of high-contention and coarse-grained applications. In order to address this problem, we propose an integrated Pseudo-Associativity and Relaxed-Order approach to hardware Transactional Memory, called PARO-TM. It exploits the extra pseudo-associative space in the data cache to hold the new value of each transactional modification, and maintains the mappings between the old and new versions via an implicit pseudo-associative hash algorithm (i.e., by inverting the specific bit of the SET index). PARO-TM can branch out the speculative version from the old version upon each transactional modification on demand without a dedicated hardware component to hold the uncommitted data. This means that it is able to automatically access the proper version upon the transaction's commit or abort. Moreover, PARO-TM augments multi-version support in a chained directory to schedule conflicting transactions in a relaxed-order manner to further reduce their overheads. We compare PARO-TM with the state-of-the-art LogTM-SE, TCC, DynTM, and SUV-TM systems and find that PARO-TM consistently outperforms these four representative HTMs. This performance advantage of PARO-TM is far more pronounced under the high-contention and coarse-grained applications in the STAMP benchmark suite, for which PARO-TM is motivated and designed.
Funder
Ministry of Science and Technology of the People's Republic of China
National Natural Science Foundation of China
Division of Information and Intelligent Systems
Central Universities Fundamental Research Foundation
Division of Computer and Network Systems
Division of Computing and Communication Foundations
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Information Systems,Software
Reference58 articles.
1. Adl-Tabatbai A.-R. Shpeisman T. and Gottsclich J. 2011. Draft specification of transactional language constructs for c++. http://www.open-std.org/Jtc1/sc22/wg14/www/docs/n1613.pdf. Adl-Tabatbai A.-R. Shpeisman T. and Gottsclich J. 2011. Draft specification of transactional language constructs for c++. http://www.open-std.org/Jtc1/sc22/wg14/www/docs/n1613.pdf.
2. Column-associative caches
3. Unbounded Transactional Memory
4. Lee-TM: A Non-trivial Benchmark Suite for Transactional Memory
5. Using a Reconfigurable L1 Data Cache for Efficient Version Management in Hardware Transactional Memory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A priority scheduling for TM pathologies;The Journal of Supercomputing;2014-12-23