Deterministic Fault-Tolerant Distributed Computing in Linear Time and Communication

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We develop deterministic algorithms for the problems of consensus, gossiping and checkpointing with nodes prone to failing. Distributed systems are modeled as synchronous complete networks. Failures are represented either as crashes or Byzantine faults with authentication. The algorithmic goal is to have both linear running time and linear amount of communication for as large an upper bound t on the number of faults as possible, with respect to the number of nodes n. For crash failures, these bounds of optimality are t = O(n / (log n)) for consensus and t = O(n / (log2 n)) for gossiping and checkpointing. For the model of Byzantine faults with authentication, we show how to reach consensus in both linear running time and communication for t = O(√n). We show how to implement the consensus algorithm for crash failures in the single-port model such as to preserve the range of t for which both the running time and communication are optimal. We prove a lower bound ω (t+log n) on the running time of algorithms for each of the considered problems.

Original languageEnglish (US)
Title of host publicationPODC 2023 - Proceedings of the 2023 ACM Symposium on Principles of Distributed Computing
PublisherAssociation for Computing Machinery
Pages344-354
Number of pages11
ISBN (Electronic)9798400701214
DOIs
StatePublished - Jun 19 2023
Event42nd ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing, PODC 2023 - Orlando, United States
Duration: Jun 19 2023Jun 23 2023

Publication series

NameProceedings of the Annual ACM Symposium on Principles of Distributed Computing

Conference

Conference42nd ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing, PODC 2023
Country/TerritoryUnited States
CityOrlando
Period6/19/236/23/23

Keywords

  • authentication
  • byzantine fault
  • checkpointing
  • consensus
  • crash failure
  • gossiping
  • lower bound
  • ramanujan graph

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Deterministic Fault-Tolerant Distributed Computing in Linear Time and Communication'. Together they form a unique fingerprint.

Cite this