Abstract
This paper presents a high-performance consensus protocol, Nezha, which can be deployed by cloud tenants without support from cloud providers. Nezha bridges the gap between protocols such as Multi-Paxos and Raft, which can be readily deployed, and protocols such as NOPaxos and Speculative Paxos, that provide better performance, but require access to technologies such as programmable switches and in-network prioritization, which cloud tenants do not have.
Nezha uses a new multicast primitive called deadline-ordered multicast (DOM). DOM uses high-accuracy software clock synchronization to synchronize sender and receiver clocks. Senders tag messages with deadlines in synchronized time; receivers process messages in deadline order, on or after their deadline.
We compare Nezha with Multi-Paxos, Fast Paxos, Raft, (optimized) NOPaxos, and 2 recent protocols, Domino and TOQ-EPaxos, that use synchronized clocks. In throughput, Nezha outperforms all baselines by a median of 5.4X (range: 1.9--20.9X). In latency, Nezha outperforms five baselines by a median of 2.3X (range: 1.3--4.0X), with one exception: it sacrifices 33% of latency compared with our optimized NOPaxos in one test. We also prototype two applications, a key-value store and a fair-access stock exchange, on top of Nezha to show that Nezha only modestly reduces their performance relative to an unreplicated system.
Publisher
Association for Computing Machinery (ACM)
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Reference51 articles.
1. Marcos K. Aguilera , Naama Ben-David , Rachid Guerraoui , Virendra J. Marathe , Athanasios Xygkis , and Igor Zablotchi . 2020 . Microsecond Consensus for Microsecond Applications . In Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20) . USENIX Association, 599--616. https://www.usenix.org/conference/osdi20/presentation/aguilera Marcos K. Aguilera, Naama Ben-David, Rachid Guerraoui, Virendra J. Marathe, Athanasios Xygkis, and Igor Zablotchi. 2020. Microsecond Consensus for Microsecond Applications. In Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20). USENIX Association, 599--616. https://www.usenix.org/conference/osdi20/presentation/aguilera
2. Dissecting the Performance of Strongly-Consistent Replication Protocols
3. S-Paxos: Offloading the Leader for High Throughput State Machine Replication
4. The weakest failure detector for solving consensus
5. PigPaxos: Devouring the Communication Bottlenecks in Distributed Consensus
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Low-Latency Consensus with Weak-Leader Using Timestamp by Synchronized Clocks;Lecture Notes in Computer Science;2024
2. Improving Raft Performance with Bulk Transfers;2023 Eleventh International Symposium on Computing and Networking Workshops (CANDARW);2023-11-27
3. Performance evaluation of portable time synchronization method using eBPF;Concurrency and Computation: Practice and Experience;2023-11-16