Affiliation:
1. MIT CSAIL, Cambridge, MA
2. Harvard University, Cambridge, MA
Abstract
What opportunities for multicore scalability are latent in software interfaces, such as system call APIs? Can scalability challenges and opportunities be identified even before any implementation exists, simply by considering interface specifications? To answer these questions, we introduce the scalable commutativity rule:
whenever interface operations commute, they can be implemented in a way that scales.
This rule is useful throughout the development process for scalable multicore software, from the interface design through implementation, testing, and evaluation.
This article formalizes the scalable commutativity rule. This requires defining a novel form of commutativity,
SIM commutativity
, that lets the rule apply even to complex and highly stateful software interfaces.
We also introduce a suite of software development tools based on the rule. Our
Commuter
tool accepts high-level interface models, generates tests of interface operations that commute and hence could scale, and uses these tests to systematically evaluate the scalability of implementations. We apply
Commuter
to a model of 18 POSIX file and virtual memory system operations. Using the resulting 26,238 scalability tests,
Commuter
highlights Linux kernel problems previously observed to limit application scalability and identifies previously unknown bottlenecks that may be triggered by future workloads or hardware.
Finally, we apply the scalable commutativity rule and
Commuter
to the design and implementation sv6, a new POSIX-like operating system. sv6’s novel file and virtual memory system designs enable it to scale for 99% of the tests generated by
Commuter
. These results translate to linear scalability on an 80-core x86 machine for applications built on sv6’s commutative operations.
Funder
VMware
National Science Foundation
Quanta Computer
Microsoft Research
Alfred P. Sloan Foundation
Google
Publisher
Association for Computing Machinery (ACM)
Cited by
41 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. TCSA: Efficient Localization of Busy-Wait Synchronization Bugs for Latency-Critical Applications;IEEE Transactions on Parallel and Distributed Systems;2024-02
2. Diciclo: Flexible User-level Services for Efficient Multitenant Isolation;ACM Transactions on Computer Systems;2023-12-30
3. Beyond isolation: OS verification as a foundation for correct applications;Proceedings of the 19th Workshop on Hot Topics in Operating Systems;2023-06-22
4. CFS: Scaling Metadata Service for Distributed File System via Pruned Scope of Critical Sections;Proceedings of the Eighteenth European Conference on Computer Systems;2023-05-08
5. In-Situ Concolic Testing of JavaScript;2023 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER);2023-03