Affiliation:
1. University of Illinois at Urbana-Champaign, Urbana, IL, USA
2. Intel Corporation, Juliette Lane Santa Clara, CA, USA
Abstract
Cache coherence is an integral part of shared-memory systems but is also widely considered to be one of the most complex parts of such systems. Much prior work has addressed this complexity and the verification techniques to prove the correctness of hardware coherence. Given the new multicore era with increasing number of cores, there is a renewed debate about whether the complexity of hardware coherence has been tamed or whether it should be abandoned in favor of software coherence. This article revisits the complexity of hardware cache coherence by verifying a publicly available, state-of-the-art implementation of the widely used MESI protocol, using the Murφ model checking tool. To our surprise, we found six bugs in this protocol, most of which were hard to analyze and took several days to fix. To compare the complexity, we also verified the recently proposed DeNovo protocol, which exploits disciplined software programming models. We found three relatively easy to fix bugs in this less mature protocol. After fixing these bugs, our verification experiments showed that, compared to DeNovo, MESI had 15X more reachable states leading to a 20X increase in verification (model checking) time. Although we were eventually successful in verifying the protocols, the tool required making several simplifying assumptions (e.g., two cores, one address). Our results have several implications: (1) they indicate that hardware coherence protocols remain complex; (2) they reinforce the need for protocol designers to embrace formal verification tools to demonstrate correctness of new protocols and extensions; (3) they reinforce the need for formal verification tools that are both scalable and usable by non-expert; and (4) they show that a system based on hardware-software co-design can offer a simpler approach for cache coherence, thus reducing the overall verification effort and allowing verification of more detailed models and protocol extensions that are otherwise limited by computing resources.
Funder
University of Illinois at Urbana-Champaign
Division of Computing and Communication Foundations
Defense Advanced Research Projects Agency
Semiconductor Research Corporation
Microelectronics Advanced Research Corporation
Qualcomm
Intel and Microsoft through the Universal Parallel Computing Research Center (UPCRC) at Illinois
Center for Future Architectures Research
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Information Systems,Software
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A local search approach to protocol verification;Theoretical Computer Science;2024-09
2. Determining the Minimum Number of Virtual Networks for Different Coherence Protocols;2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA);2024-06-29
3. Ditty: Directory-based Cache Coherence for Multicore Safety-critical Systems;2023 Design, Automation & Test in Europe Conference & Exhibition (DATE);2023-04
4. A Case for Fine-grain Coherence Specialization in Heterogeneous Systems;ACM Transactions on Architecture and Code Optimization;2022-08-22
5. QEI: Query Acceleration Can be Generic and Efficient in the Cloud;2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2021-02