14

1 Dealing with Byzantine Faults CS 686 Final Project brought to you by Chris Sosa

Handling Byzantine Faults

Download PPT Report

Upload
awesomesos
View
938
Download
0

Embed Size (px)

DESCRIPTION

My presentation on handling byzantine faults in distributed systems given for my graduate dependability course

Citation preview

Page 1: Handling Byzantine Faults

1

Dealing with Byzantine Faults

CS 686 Final Projectbrought to you by Chris Sosa

Page 2: Handling Byzantine Faults

2

Overview

Motivation in Dependable Systems

Common Types of Byzantine Faults

Solutions in Real Systems

Page 3: Handling Byzantine Faults

3

The Myths Hardware cannot be

“traitorous”! Anthropomorphic model Any system with

consensus is susceptible It’s never happened

before Often misclassified Legionnaire's Disease

Page 4: Handling Byzantine Faults

4

The Awful Truth Time-Triggered Architecture

Radioactive Fault injection to one node

Messed up timing protocol (SOS) Formed Cliques until system failed

Quad Redundant Control System No message exchange Lots of redundancy One fault propagated to look like

many

Professor Knight’s Computer

Page 5: Handling Byzantine Faults

5

Trends in Dependable Systems

1. Device Physics• Smaller and faster not always

better• Cosmic Rays, etc.

2. Movement to Distributed Topologies

3. Usage of Commercial off-the-shelf (COTS) Technology

Page 6: Handling Byzantine Faults

6

Common Types of Observed Faults1. Value

• Issues related to digital values being the extreme of analog

• Propagation2. Temporal

• Different observations at same time• Synchronization doesn’t help very much

3. Value + Temporal

Page 7: Handling Byzantine Faults

7

Solutions?

Page 8: Handling Byzantine Faults

8

Solutions (1) Full Exchange

Uses classical Byzantine agreement SPIDER – bus (ROBUS) design

Page 9: Handling Byzantine Faults

9

Solutions (2) Hierarchical

Uses hierarchy of different fault tolerant techniques including Byzantine Agreement

Seen with Fail-Stop processors SAFEbus

Communication backplane for Boeing 777 Uses two buses which are themselves dual

redundant –different forms of parity detect errors

Uses self-checking pairs on top of buses

Page 10: Handling Byzantine Faults

10

Solutions (3)

Filtering Targets propagation of Byzantine faults Tries to either

Mask faults by forcing output to some straight value (removes value-type faults)

Segments system into Fault Containment Regions (FCR’s) where we put protections to stop propagation

Page 11: Handling Byzantine Faults

11

Ignorance is not Bliss

Can invalidate failure model Propagation of one fault can be

disastrous No amount of redundancy can help

Large Economic Factor Possible costs of recall and redeployment

Page 12: Handling Byzantine Faults

12

Conclusions

Byzantine faults are real! Problems with Ignoring them No amount of Redundancy can

tolerate them w/out message exchange

Three categories of solutions to deal with them

Page 13: Handling Byzantine Faults

13

Questions?

Page 14: Handling Byzantine Faults

14

BGP Quick Review Algorithm is expensive:

Each processor has to broadcast its values for many rounds

Chooses majority value Requires n > 3f where f is # of failures

and n is the # of processors With signed messages

Can tolerate more failures Still expensive

Abhainn Ash’s Byzantine Candy Cane Captive Inverted Round Celtic Byzantine BoxBox (2 colors) Byzantine Byzantine (2 colors) Byzee Bugs Cloud Cover Celtic

Abhainn Ash’s Byzantine Candy Cane Captive Inverted Round Celtic Byzantine BoxBox (2 colors) Byzantine Byzantine (2 colors) Byzee Bugs Cloud Cover Celtic

Documents

Tolerating Byzantine Faults in Database Systems using Commit Barrier Scheduling

Tolerating Byzantine Faults in Database Systems using Commit Barrier Scheduling

Documents

istanbul byzantine circular - The Byzantine City of Amorium

istanbul byzantine circular - The Byzantine City of Amorium

Documents

Byzantine Art in Post-Byzantine Southern Italy

Byzantine Art in Post-Byzantine Southern Italy

Documents

Resource-efficient Byzantine Fault Tolerance · dling such arbitrary faults in a generic fashion requires Byzantine fault tolerance (BFT). In the past, BFT systems have mainly been

Resource-efficient Byzantine Fault Tolerance · dling such arbitrary faults in a generic fashion requires Byzantine fault tolerance (BFT). In the past, BFT systems have mainly been

Documents

Byzantine Ordered Consensus without Byzantine Oligarchy...Byzantine OrderedConsensus without Byzantine Oligarchy Yunhao Zhang†, Srinath Setty *, Qi Chen*, LidongZhouand Lorenzo Alvisi†

Byzantine Ordered Consensus without Byzantine Oligarchy...Byzantine OrderedConsensus without Byzantine Oligarchy Yunhao Zhang†, Srinath Setty *, Qi Chen*, LidongZhouand Lorenzo Alvisi†

Documents

ZZ: Cheap Practical BFT using Virtualizationlass.cs.umass.edu/projects/virtualization/files/zz.old.pdf · approach to tolerate byzantine faults, demonstrating its feasibility through

ZZ: Cheap Practical BFT using Virtualizationlass.cs.umass.edu/projects/virtualization/files/zz.old.pdf · approach to tolerate byzantine faults, demonstrating its feasibility through

Documents

Making Byzantine Fault Tolerant Systems Tolerate Byzantine Faults

Making Byzantine Fault Tolerant Systems Tolerate Byzantine Faults

Documents

Byzantine Lutheranism? Byzantine Lutheranism! - … Byzantine Lutheranism? Byzantine Lutheranism! The Divine Liturgy of the Ukrainian Evangelical Church of the Augsburg Confession

Byzantine Lutheranism? Byzantine Lutheranism! - … Byzantine Lutheranism? Byzantine Lutheranism! The Divine Liturgy of the Ukrainian Evangelical Church of the Augsburg Confession

Documents

Application-Level Fault Tolerance for MPI Programstullio/SCD/2006/Materiale/Distributed_Snapshot.pdf · – Byzantine: arbitrary failures • Our focus: – Fail-Stop Faults ... •

Application-Level Fault Tolerance for MPI Programstullio/SCD/2006/Materiale/Distributed_Snapshot.pdf · – Byzantine: arbitrary failures • Our focus: – Fail-Stop Faults ... •

Documents

CheapBFT: Resource-efficient Byzantine Fault Tolerance · Handling such arbitrary faults in a generic fashion re-quires Byzantine fault tolerance (BFT). ... different workloads and

CheapBFT: Resource-efficient Byzantine Fault Tolerance · Handling such arbitrary faults in a generic fashion re-quires Byzantine fault tolerance (BFT). ... different workloads and

Documents

Modeling faults - cs.cornell.edu · failure models Crash Arbitrary failures with message authentication Arbitrary (Byzantine) failures Send Omission General Omission Receive Omission

Modeling faults - cs.cornell.edu · failure models Crash Arbitrary failures with message authentication Arbitrary (Byzantine) failures Send Omission General Omission Receive Omission

Documents

Local Tolerance to Unbounded Byzantine Faults

Local Tolerance to Unbounded Byzantine Faults

Documents

1 The Case for Byzantine Fault Detection. 2 Challenge: Byzantine faults Distributed systems are subject to a variety of failures and attacks Hacker break-in

1 The Case for Byzantine Fault Detection. 2 Challenge: Byzantine faults Distributed systems are subject to a variety of failures and attacks Hacker break-in

Documents

Practical Byzantine Fault Tolerance (The Byzantine Generals Problem)

Practical Byzantine Fault Tolerance (The Byzantine Generals Problem)

Documents

Handling Common Faults and Alarms on the RTN Network-20110711-A

Handling Common Faults and Alarms on the RTN Network-20110711-A

Documents

The Byzantine Empire The Byzantine Empire Ch. 21 The Byzantine Empire

The Byzantine Empire The Byzantine Empire Ch. 21 The Byzantine Empire

Documents

Rise of Byzantine Empire Justinian Byzantine Ruler

Rise of Byzantine Empire Justinian Byzantine Ruler

Documents

Bridging the Gap: Byzantine Faults and Self-stabilization

Bridging the Gap: Byzantine Faults and Self-stabilization

Documents

Yarnell Research House Data for Air Handling Faults · Yarnell Research House Data for Air Handling Faults 1. Recorded Data 1. Zip File Yarnell 7_15.zip contains: 1. High speed sample

Yarnell Research House Data for Air Handling Faults · Yarnell Research House Data for Air Handling Faults 1. Recorded Data 1. Zip File Yarnell 7_15.zip contains: 1. High speed sample

Documents

Secure Network Provenance - NetDB@Penn · [Software]: Operating Systems—Reliability General Terms Algorithms, Design, Reliability, Security Keywords Accountability, Byzantine faults,

Secure Network Provenance - NetDB@Penn · [Software]: Operating Systems—Reliability General Terms Algorithms, Design, Reliability, Security Keywords Accountability, Byzantine faults,

Documents

BYZANTINE & POST-BYZANTINE ART: CROSSING BORDERS

BYZANTINE & POST-BYZANTINE ART: CROSSING BORDERS

Documents

Handling Software Faults with Redundancysoftware.imdea.org/~alessandra.gorla/papers/... · to tolerate development faults as well as manufacturing faults in circuits [5{7]. An analogous

Handling Software Faults with Redundancysoftware.imdea.org/~alessandra.gorla/papers/... · to tolerate development faults as well as manufacturing faults in circuits [5{7]. An analogous

Documents

The Byzantine Generals Problem - Cornell University...Two Generals Problem Reaching Agreement in the Presence of Faults The Byzantine Generals Problem Talk Overview Byzantine Generals

The Byzantine Generals Problem - Cornell University...Two Generals Problem Reaching Agreement in the Presence of Faults The Byzantine Generals Problem Talk Overview Byzantine Generals

Documents

Recognizing faults Practice with thrust faults and normal faults Practice with thrust faults and normal faults

Recognizing faults Practice with thrust faults and normal faults Practice with thrust faults and normal faults

Documents

Anish Arora Ohio State University Mikhail Nesterenko Kent State University Local Tolerance to Unbounded Byzantine Faults

Anish Arora Ohio State University Mikhail Nesterenko Kent State University Local Tolerance to Unbounded Byzantine Faults

Documents

Windows Password Handling and Security Faults Nate Prosser Lenny Calabrese Travis Stitt

Windows Password Handling and Security Faults Nate Prosser Lenny Calabrese Travis Stitt

Documents

Byzantine Ordered Consensus without Byzantine Oligarchy...Byzantine Ordered Consensus without Byzantine Oligarchy Yunhao Zhang,†Srinath Setty, ⋆Qi Chen, Lidong Zhou,⋆and Lorenzo

Byzantine Ordered Consensus without Byzantine Oligarchy...Byzantine Ordered Consensus without Byzantine Oligarchy Yunhao Zhang,†Srinath Setty, ⋆Qi Chen, Lidong Zhou,⋆and Lorenzo

Documents

Practical Byzantine Fault Tolerancecastro/thesis.pdfByzantine faults such as software bugs, operator mistakes, and malicious attacks are the major cause of service interruptions. This

Practical Byzantine Fault Tolerancecastro/thesis.pdfByzantine faults such as software bugs, operator mistakes, and malicious attacks are the major cause of service interruptions. This

Documents

Dynamic Adaptation of Byzantine Fault Tolerant Protocolsler/reports/carloscarvalhomsc.pdf · occur and the network is stable. On the other hand, when faults frequently occur, Aardvark

Dynamic Adaptation of Byzantine Fault Tolerant Protocolsler/reports/carloscarvalhomsc.pdf · occur and the network is stable. On the other hand, when faults frequently occur, Aardvark

Documents