Revisiting the Complexity of Hardware Cache Coherence and Some Implications

Author:

Komuravelli Rakesh1,Adve Sarita V.1,Chou Ching-Tsun2

Affiliation:

1. University of Illinois at Urbana-Champaign, Urbana, IL, USA

2. Intel Corporation, Juliette Lane Santa Clara, CA, USA

Abstract

Cache coherence is an integral part of shared-memory systems but is also widely considered to be one of the most complex parts of such systems. Much prior work has addressed this complexity and the verification techniques to prove the correctness of hardware coherence. Given the new multicore era with increasing number of cores, there is a renewed debate about whether the complexity of hardware coherence has been tamed or whether it should be abandoned in favor of software coherence. This article revisits the complexity of hardware cache coherence by verifying a publicly available, state-of-the-art implementation of the widely used MESI protocol, using the Murφ model checking tool. To our surprise, we found six bugs in this protocol, most of which were hard to analyze and took several days to fix. To compare the complexity, we also verified the recently proposed DeNovo protocol, which exploits disciplined software programming models. We found three relatively easy to fix bugs in this less mature protocol. After fixing these bugs, our verification experiments showed that, compared to DeNovo, MESI had 15X more reachable states leading to a 20X increase in verification (model checking) time. Although we were eventually successful in verifying the protocols, the tool required making several simplifying assumptions (e.g., two cores, one address). Our results have several implications: (1) they indicate that hardware coherence protocols remain complex; (2) they reinforce the need for protocol designers to embrace formal verification tools to demonstrate correctness of new protocols and extensions; (3) they reinforce the need for formal verification tools that are both scalable and usable by non-expert; and (4) they show that a system based on hardware-software co-design can offer a simpler approach for cache coherence, thus reducing the overall verification effort and allowing verification of more detailed models and protocol extensions that are otherwise limited by computing resources.

Funder

University of Illinois at Urbana-Champaign

Division of Computing and Communication Foundations

Defense Advanced Research Projects Agency

Semiconductor Research Corporation

Microelectronics Advanced Research Corporation

Qualcomm

Intel and Microsoft through the Universal Parallel Computing Research Center (UPCRC) at Illinois

Center for Future Architectures Research

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Cited by 19 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A local search approach to protocol verification;Theoretical Computer Science;2024-09

2. Determining the Minimum Number of Virtual Networks for Different Coherence Protocols;2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA);2024-06-29

3. Ditty: Directory-based Cache Coherence for Multicore Safety-critical Systems;2023 Design, Automation & Test in Europe Conference & Exhibition (DATE);2023-04

4. A Case for Fine-grain Coherence Specialization in Heterogeneous Systems;ACM Transactions on Architecture and Code Optimization;2022-08-22

5. QEI: Query Acceleration Can be Generic and Efficient in the Cloud;2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2021-02

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3