Affiliation:
1. Cornell Univ., Ithaca, NY
Abstract
The design and correctness of a communication facility for a distributed computer system are reported on. The facility provides support for
fault-tolerant process groups
in the form of a family of reliable multicast protocols that can be used in both local- and wide-area networks. These protocols attain high levels of concurrency, while respecting application-specific delivery ordering constraints, and have varying cost and performance that depend on the degree of ordering desired. In particular, a protocol that enforces causal delivery orderings is introduced and shown to be a valuable alternative to conventional asynchronous communication protocols. The facility also ensures that the processes belonging to a fault-tolerant process group will observe consistent orderings of events affecting the group as a whole, including process failures, recoveries, migration, and dynamic changes to group properties like member rankings. A review of several uses for the protocols in the ISIS system, which supports fault-tolerant resilient objects and bulletin boards, illustrates the significant simplification of higher level algorithms made possible by our approach.
Publisher
Association for Computing Machinery (ACM)
Cited by
445 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Detecting causality in the presence of Byzantine processes: The case of synchronous systems;Information and Computation;2024-12
2. Approaches to Conflict-free Replicated Data Types;ACM Computing Surveys;2024-09-09
3. Decentagram: Highly-Available Decentralized Publish/Subscribe Systems;2024 54th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN);2024-06-24
4. Invited Paper: Causal Mutual Byzantine Broadcast;Proceedings of the 2024 Workshop on Advanced Tools, Programming Languages, and PLatforms for Implementing and Evaluating algorithms for Distributed systems;2024-06-17
5. An in-depth and insightful exploration of failure detection in distributed systems;Computer Networks;2024-06