layerxcom / safety-oracle Goto Github PK

License: MIT License

safety-oracle's Introduction

Safety Oracle

This document is a comprehensive summary of safety oracles, finality detection mechanisms of CBC Casper.
(See this slide for a general introduction of CBC Casper and this post about finality detection in CBC Casper.)
We introduce various existing proposals on safety oracles, describe the algorithms, and compare them. The goal of this project is here.

Definitions

t: Byzantine (or equivocation) fault tolerance.

CAN_ESTIMATE: The candidate estimate.

ADV_ESTIMATE: The estimate that the adversary wants to finalize.

MessageDAG

MessageDAG is a DAG (directed acyclic graph) of messages where edges are directed from a message to the messages in its justification.

Message types

Example

Notations

V: The number of validators.

M: The number of messages(vertices) in the MessageDAG.

J: The number of edges in the MessageDAG.

In practice, we can assume that V <= M <= J <= MV.

Lobbying Graph

We construct a lobbying graph G(V, E) as follows.

Let V be a set of validators that estimates CAN_ESTIMATE in their latest message and G be a directed graph with a set of vertices V.
An edge is directed from v1 to v2 that satisfies the following conditions.
- v1 ≠ v2
- The justification of the latest message of v1 includes a message of v2
- The justification of the latest message of v2 includes a message of v1
- The latest message of v2 in the justification of the latest message of v1 does not conflict with CAN_ESTIMATE
- The latest message of v1 in the justification of the latest message of v2 does not conflict with CAN_ESTIMATE
- No message conflicts with CAN_ESTIMATE among the messages of v2 that v1 have not seen yet
- No message conflicts with CAN_ESTIMATE among the messages of v1 that v2 have not seen yet

Example

The lobbying graph constructed from the above MessageDAG is:

Complexity

	Construct	Update for a new message
Time	O(V^2 + VM)	O(V)
Space	O(V^2)	-

In 2, it requires O(1) time on average to check if the justification of a message includes another validator using a hash table. For each validator, it requires O(M) to get the latest messages of the other validators (to check if the latest message conflicts with CAN_ESTIMATE). Therefore, the overall complexity is O(VM).

However, if you keep the lobbying graph and update every time you receive a message, this process can be improved. It only requires O(V) to update the graph in receiving a message because the number of newly connected edges in the MessageDAG is at most V for a message.

The space complexity is O(V^2) because E is O(V^2) for any graph. Of course, the space complexity of the MessageDAG is O(J). However, a validator must always have it, so we do not consider it's space in safety oracles.

Clique Oracle

The clique oracle is a family of algorithms to find a clique of validators.

2-round Clique Oracle

This oracle is the most naive clique oracle which tries to find a maximal weight clique.

Algorithm

Construct the lobbying graph or get it.
Convert the lobbying graph to an undirected graph by connecting vertices (validators) connected bidirectionally.
Find the maximum weighted clique C of G in exponential time.
Byzantine fault tolerance is t = ceil(W(C) - W(V)/2) - 1. (q > n/2 + t)

See: https://en.wikipedia.org/wiki/Clique_problem#Finding_maximum_cliques_in_arbitrary_graphs

Metrics

	From scratch	For a new message
Time complexity	exponential	exponential
Space complexity	-	-

Finding a clique requires O*(2^V) time. Even the fastest algorithm requires O*(1.1888^V). ( O*(f(k)) = O(f(k) poly(x)) )

Why q > n/2 + t?

q > n/2 + t, so t < q - n/2.

In the above case, n = 8, q = 7, t < 7 - 8/2 = 3.

This result means that up to t-1=2 equivocation failures can be tolerated.

2 equivocating validators:

3 equivocating validators:

Therefore, the clique oracle fault tolerant threshold formula is not q > n/2 + t/2.

When satisfying the formula q > n/2 + t/2, t < 2q - n = 14 - 8 = 6.

Turán Oracle

Turán theorem

This theorem gives a lower bound on the size of a clique in graphs. If a graph has many edges, it contains a large clique.

Let G be any graph with n vertices, such that G is K_{r+1}-free graph that does not contain (r+1)-vertex clique.

The upper bound of the number of edges is

Let E be the set of edges in G.

r is a lower bound on the size of a clique in graphs with n vertices and |E| edges.

Algorithm

Construct the lobbying graph or get it.
Convert the lobbying graph to an undirected graph by connecting edges (validators) connected bidirectionally.
Calculate the minimum size of the maximum weighted clique using the above formula in O(1) time.
Calculate the maximum weighted clique C.
t = ceil(W(C) - W(V)/2) - 1. (q > n/2 + t)

Metrics

	From scratch	For a new message
Time complexity	O(V^2 + VM)	O(1)
Space complexity	O(V^2)	-

Finality Inspector

Simple Inspector

The Simple Inspector is a generalization of 2-round clique oracle.

Algorithm

Construct the lobbying graph G or get it.
Let q > n/2 + t.
Compute outdegree of vertices in G.
C = V
Look for the vertice with outdegree of q or less in C, remove it from C, and update the outdegree counters.
Repeat 5.
If W(C) >= q, the property is considered finalized.

Metrics

	From scratch	For a new message
Time complexity	O(V^2 + VM)	O(V^2)
Space complexity	O(V^2)	-

Inspector

Algorithm

See: https://hackingresear.ch/cbc-inspector/

For some q (<= V), the algorithm needs to find the maximum t.

This image is the contour plot about t where n = 100.

Computing the levels takes V times at worst, and the computation runs in O(J) time.

Therefore the total time complexity is O(V * V * J) = O(V^2J).

Metrics

	From scratch	For a new message
Time complexity	O(V^2J)	O(V^2J)
Space complexity	O(J)	-

Adversary Oracle

Adversary oracles is a family of algorithms based on the simulation of an adversary.

Adversary Oracle (straightforward)

Ref: ethereum/cbc-casper

This oracle is a simple simulation-based algorithm.

Algorithm

Construct the lobbying graph or get it.
From view, we can see that there are validators from which the estimate of the latest messages is CAN_ESTIMATE or ADV_ESTIMATE or unknown.

If the estimate is unknown, we assume that it is ADV_ESTIMATE in O(V^2) time.

 C = set()
 for v in V:
     if v in view.latest_messages and view.latest_messages[v].estimate == CAN_ESTIMATE:
         C.add(v)

Build viewables with the following pseudocode in O(V^2) time.

 for v1 in V:
     for v2 in V:
         justification = view.latest_messages[v1].justification
         if v2 not in justification:
             viewables[v1][v2] = ADV_ESTIMATE
         elif (there is a v2 message that estimate ADV_ESTIMATE and have seen from v1):
             viewables[v1][v2] = ADV_ESTIMATE
         else:
             viewables[v1][v2] = (message estimate that the hash is justification[v2])

Assume that all validator estimates that may change from CAN_ESTIMATE to ADV_ESTIMATE are ADV_ESTIMATE.

 progress_mode = True
 while progress_mode:
     progress_mode = False

     to_remove = set()
     fault_tolerance = -1
     for v in C:
         can_weight = sum(v2.weight for v2 in viewables[v] if viewables[v2] == CAN_ESTIMATE and v2 in C))
         adv_weight = sum(v2.weight for v2 in viewables[v] if viewables[v2] == ADV_ESTIMATE or v2 not in C))

         if can_weight <= adv_weight:
             to_remove.add(v)
             progress_mode = True
         else:
             if fault_tolerance == -1:
                 fault_tolerance = can_weight - adv_weight
             fault_tolerance = ceil(min(fault_tolerance, can_weight - adv_weight) / 2) - 1
         
     C.difference_update(to_remove)
 
 if (total weight of CAN_ESTIMATE) > (total weight of ADV_ESTIMATE):
     the property is finalized

If (total weight of CAN_ESTIMATE) > (total weight of ADV_ESTIMATE) is finally satisfied, the property is finalized.
t = ceil(min_{v in C}{(can_weight of v) - (adv_weight of v)} / 2) - 1.

N.B. The original ethereum/casper's fault tolerance t is the minimum validator weight in C, but we think that it is min{can_weight - adv_weight}/2

Metrics

	From scratch	For a new message
Time complexity	O(V^3 + VM)	O(V^3)
Space complexity	O(V^2)	-

Adversary Oracle with Priority Queue (WIP)

We improve the above algorithm using a priority queue.

Metrics

	From scratch	For a new message
Time complexity	O(V^2 + VM)	O(V^2)
Space complexity	O(V^2)	-

Ideal Oracle

We can construct an oracle which is equivalent to the necessary and sufficient conditions of finality by simulating all the possible state transitions. If the consensus value does not change in any reachable future states, the property is considered finalized. Although this is the best about the completeness, it would be extraordinary inefficient, so we omit here about the detail.

Comparison

Complexity

Detection from scratch

	Clique Oracle	Turán Oracle	Simple Inspector	Inspector	Adversary Oracle	Adversary Oracle with Priority Queue
Time	exponential	O(V^2 + VM)	O(V^2 + VM)	O(V^2J)	O(V^3 + VM)	O(V^2 + VM)
Space	-	O(V^2)	O(V^2)	O(J)	O(V^2)	O(V^2)

Detection for a new message

	Clique Oracle	Turán Oracle	Simple Inspector	Inspector	Adversary Oracle	Adversary Oracle with Priority Queue
Time	exponential	O(1)	O(V^2)	O(V^2J)	O(V^3)	O(V^2)

Fault tolerance and quorum

This image shows the relationship between Byzantine fault tolerance (for both safety and liveness) and the quorum of safety oracles. The q = n - t line is the maximum number of honest validators. For liveness, the quorum must be less than or equal to this line. The safety oracles whose quorum condition is q > n/2 + t achieve n/4 Byzantine fault tolerance. On the other hand, safety oracles that use q > n/2 + t/2 achieve n/3.

Example 1

	Clique Oracle	Turán Oracle	Simple Inspector	Inspector	Adversary Oracle
t	1	1	1	1	1

Example 2

	Clique Oracle	Turán Oracle	Simple Inspector	Inspector	Adversary Oracle
t	1	1	1	2	1

In this sample, the Inspector fault tolerance threshold is:

t = ceil((1-2^(-2))(2q - n)) - 1 = ceil((3/4)*(8-4)) - 1 = 2.

Example 3

	Clique Oracle	Turán Oracle	Simple Inspector	Inspector	Adversary Oracle
t	N/A	N/A	1	1	1

In this case, the clique oracle and the Turán oracle have not yet detected finality.

{A,B,C,D,E} is the quorum set and q = 4, so fault tolerance threshold of the Simple Inspector and the Inspector t = ceil(q - n/2) - 1 = ceil(4 - 5/2) - 1 = 1.

The t of the adversary oracle is also ceil((4-1)/2) - 1 = 1.

Example 4

	Clique Oracle	Turán Oracle	Simple Inspector	Inspector	Adversary Oracle
t	2	N/A	2	2	2

In the Turán oracle, n^2/(n^2-2|E|) = 8^2 / (8^2 - 7*6) = 64/22 = 2.9... < n/2.

Conclusion

The Simple Inspector is better than the 2-round clique oracle and the original adversary oracle.
The adversary oracle is equivalent to the Simple Inspector for detecting finality.
The Inspector is just a little slow algorithm but can achieve its fault tolerance threshold of up to n/3.

safety-oracle's People

Contributors

Stargazers

Watchers

Forkers

minaminao

safety-oracle's Issues

Infinite layer for n/3 Byzantine fault tolerance is not because of computational efficiency, but fundamental

Redefine `q` with `F(σ)`

This is necessary for formal verification

Add intro of CBC Casper, safety oracle

This is not in the main scope of this project but helpful for some readers.

It'd be nice to put some links like https://hackingresear.ch/cbc-finality/

Add conclusion

Claim 1: Simple Inspector is better than 2-round clique oracle and the original adversary oracle
- The simple inspector is much better than 2-round clique oracle in both efficiency and completeness
- The simple inspector is better than the original adversary oracle in efficiency
  - Is the completeness the same? (#13 )

etc...

Explain Turan oracle requires more than `q` of validators

Add the definition of the "k-round" clique oracle

Explain the rationale of "higher-round" oracles

Explain why we want equivocations to be observable, which in turn explains why CBC Casper has justification.
Also, note the trade-off between higher rounds and latency to finality detection for comparison.

Related to #3

Initial detection of the latest finalized block

For a given blockchain and a set of messages, how efficiently detect the latest finalized block?
To start from the genesis block seems suboptimal.

E.g. Perform a binary search to find the highest block supported by a majority of validators

Is the fault tolerance for the original adversary oracle correct?

the minimum validator weight in C. !?

Inspector is strictly better than clique oracle

Make sure Inspector is the best at both completeness (#27) and computational efficiency
Modify the conclusion to focus this claim

Can the Inspector be improved from O(V^2J)?

It seems that the Inspector can be improved from O(V^2J) to O(VJ log V). #14

[PM] Goal

Goal

Phase 1

Summarize
- Algorithm and data structures to detect finality
- Algorithm and data structures to update when a message comes in or when a validator is added or deleted
Compare existing safety oracles
- Clique Oracle
- Turán Oracle
- Adversary Oracle
- The Inspector
- Ideal Oracle (Necessary and sufficient oracle)
Metrics
- Threshold: (Number of validators required to form the DAG, which decides the fault tolerance for plausible liveness)
- Time to detection
- Time complexity
- Space complexity (Storage, Memory)

Phase 2

Modified Adversary Oracle
Simulation

Why >3 round clique oracle? (not inspector?)

Also, the threshold od 2-round clique oracle is n/2 + t/2 ?

`t` should be the number of equivocations which the DAG can tolerate

I.e. It takes more than t equivocations to revert the finality

Also, q = floor(n/2 + t) + 1 is clearer

Inspector needs to find a maximum `q`

Theorem: CBC Casper needs at least 3 rounds to have `n/3` fault tolerance

Also, explain this is not because of the whole CBC but "no in-protocol threshold" philosophy, or strictly speaking, "decoupling of safety/liveness threshold"

Liveness optimistic safety oracle

There seem to be various techniques like Turan oracle which detects t-finality in lucky cases, sometimes requiring more (than the corresponding q of) validators and denser message DAG.

This compromises fault tolerance for liveness and requires stronger synchrony assumption but might be good in practice about computational complexity (compared to clique oracle) or time to detection (compared to k-level inspector).

(If an algorithm to find that a lower bound of a clique is >q need >q + α of validators, the fault tolerance for plausible liveness is reduced proportionally to α)

Ideas:

One of such ideas is Turan oracle with pre-processing.
- E.g. First, remove all the validators who are not connected to more than q of validators (This needs O(n^2)) and if there remained more than q validators then apply the Turan theorem
Turan oracle for 3-rounds clique oracle with observable equivocations #3

3-round Clique oracle with "observable" equivocations

Make sure and prove that this threshold is correct for 3 round clique oracle

T(σ) = n/2 + t/2 - Fo(σ)

Here, Fo(σ) is a number of equivocations which is observable for any member of the given clique

https://scrapbox.io/layerx/Casper_CBC:_Observable_equivocations

Naming of clique oracle

"Clique oracle is exponential" is a bit unfair since the clique is not necessarily the maximum sized clique.

Should we redefine "clique oracle" as a family of oracles which detects finality from a clique of validators and call the current clique oracle "find-max-weight-clique" or something?
(Also, make it include Turan oracle.)

Adversary oracle with observable equivocations

Questions

Can we improve the current adversary oracle by considering the observable equivocations?
At least, can this oracle detect all of the finality in #3?
- If so, this oracle is only the oracle which is polynomial and the fault tolerance for plausible liveness is n/3
  - Is it really polynomial?
Comparison with Inspector
- Hyp. Worse in computational complexity but can detect more finality (and hence better at "time to finality")
- Does this improve the "V is much bigger than V'" case of Inspector?

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

layerxcom / safety-oracle Goto Github PK

safety-oracle's Introduction

Safety Oracle

Table of Contents

Definitions

MessageDAG

Message types

Example

Notations

Lobbying Graph

Example

Complexity

Clique Oracle

2-round Clique Oracle

Algorithm

Metrics

Why q > n/2 + t?

Turán Oracle

Turán theorem

Algorithm

Metrics

Finality Inspector

Simple Inspector

Algorithm

Metrics

Inspector

Algorithm

Metrics

Adversary Oracle

Adversary Oracle (straightforward)

Algorithm

Metrics

Adversary Oracle with Priority Queue (WIP)

Metrics

Ideal Oracle

Comparison

Complexity

Detection from scratch

Detection for a new message

Fault tolerance and quorum

Example 1

Example 2

Example 3

Example 4

Conclusion

safety-oracle's People

Contributors

Stargazers

Watchers

Forkers

safety-oracle's Issues

Goal

Phase 1

Phase 2

Recommend Projects

Recommend Topics

Recommend Org