CHORD: A Scalable Peer-to-Peer Lookup Service for Internet...

CHORD: A Scalable Peer-to-Peer Lookup Service for Internet Applications

Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, HariBalakrishnan

MIT Laboratory for Computer Science

Adopted for Presented in 527-07 UBC

OutlineBackgroundChord

NamingSearchingStoring dataNode JoinNode LeaveFault ToleranceLoad BalancingSimulation and Experimental Results

Summary

P2P Background

What is P2P?An application-level Internet on top of the InternetEvery participating node acts as both a client and a server (“servent”)•Every node “pays” its participation by providing access to (some of) its resourcesProperties:

No central coordinationNo central databaseNo peer has a global view of the system. Global behavior emerges from local interactionsAll existing data and services are accessible from any peerDynamicPeers and connections are unreliable

P2P applicationsFile sharing & storage SystemsPeer information retrieval and P2P web searchPeer data management, Peer query reformulation

P2P Network Requirements

Efficient data location & Routing (Scalability)Adaptable to changes (data and query)Self-Organizing, Ad hoc participationRobust and fault tolerant

Question: How to find the data?An efficient index mechanism: DHT

Hash Tabledata structure that maps “keys” to “buckets”essential building block in software systems

Distributed Hash Table (DHT) similar, but spread across the InternetRely on the hashing to achieve load balancing

Interface--Every DHT node supports operation:lookup(key)--Given key as input; route messages toward node holding keyinsert(key, value)

DHT Design GoalsLow diameter: Theoretical search bound Low degree: Limited number of neighboursLocal routing decision: Greedy. Nodes incrementally calculate a path to the targetLoad balance: distributed hash function, spreading keys evenly over nodesRobustness: Operational even in partial failureAvailability: Can automatically adjusts its internal tables to ensure that the node responsible for a key can always be found

An Example DHT: Chord

Consistent HashingA scheme that provides hash table functionality in a way that the addition or removal of one bucket does not significantly change the mapping of keys to buckets A consistent hashing function:

Map both the buckets and keys into a range. The consistent hash of a key is defined as the bucket whose image is closest* to the key’s.When buckets join or leave, only nearby buckets are affected

CHORD is a distributed consistent hashing table.

Chord Naming

Key identifier = SHA-1(key)Node identifier = SHA-1(IP address)Both are uniformly distributedBoth exist in the same ID spaceHow to map key IDs to node IDs?

N105 K20K5

Circular 7-bitID space

Key 5Node 105

2identifier

circle

identifier

Successor Nodes

successor(1) = 1

successor(2) = 3successor(6) = 0

A key is stored at its successor: node with the same id or next higher ID

Note: successor(key) is different from successor(node)!

CHORD Search: Basic lookup

N10N120

“Where is key 80?”

“N90 has K80”

CHORD Search: Acceleration of LookupsEach node maintains

a routing table with (at most) m entries called the finger table

Start: (n + 2i-1) mod 2m

Interval: [finger[i].start, finger[i+1].start )Successive node (finger): First node >= finger[i].start

A successor: the next node on the identifier ring, i.e. finger[1].succA predecessor: the previous node on the identifier ring.

[1,2)[2,4)[4,0)

finger tablestart int. finger

[2,3)[3,5)[5,1)

[4,5)[5,7)[7,3)

CHORD Search: Acceleration of Lookups

[1,2)[2,4)[4,0)

[2,3)[3,5)[5,1)

[4,5)[5,7)[7,3)

Each node stores information about only a small number of other nodes, and knows more about nodes closely following it than about nodes fartherawayA node’s finger table generally does not contain enough information to determine the successor of an arbitrary key k. E.g. Node 3 looks up key 1Repetitive queries to nodes that immediately precede the given key will lead to the key’s successor eventually

The algorithm

CHORD Search: An example

N20N110

Lookup(K19)

7bit identifier space

Lookups take O(log(N)) hops

……96

……

[96,32)

……99

…335

…[3,35)[35,99)

…560

…913

…[9,13)[13,21)

…1020

finger tablestart int. finger 19 belongs to (10,

20], so N10 is the predecessor of key 19

CHORD: Storing DataSearch the node responsible for holding the keyInsert the key into that node

CHORD Node Joins

[1,2)[2,4)[4,0)

[2,3)[3,5)[5,1)

[4,5)[5,7)[7,3)

[7,0)[0,2)[2,6)

Node 6 wants to join

1. Initialize fingers and predecessor: O(log2N)

2. Transferring keys: O(1)

3. Updating fingers of existing nodes: O(log2N)

Total Cost: O(log2N)

Each node keeps a predecessor pointer

CHORD Node Departures

[1,2)[2,4)[4,0)

finger tablestart int. succ.

[2,3)[3,5)[5,1)

[4,5)[5,7)[7,3)

[7,0)[0,2)[2,6)

Node 1 wants to leave

1. Transferring keys: O(1)

2. Remove the node: O(1)

3. Updating fingers and predecessor of affected nodes: O(log2N)

Total Cost: O(log2N)

CHORD: Fault Tolerance

Basic “stabilization” protocol is used to keep nodes’ successor pointers up to date, which is sufficient to guarantee correctness of lookups

Every node runs stabilize periodically to find newly joined nodes

Also fix the fingers：find_successor(finger[i].start)

CHORD: Fault Tolerance --Stabilization after Join

n joinspredecessor = niln acquires ns as successor via some n’n notifies ns being the new predecessorns acquires n as its predecessor

np runs stabilizenp asks ns for its predecessor (now n)np acquires n as its successornp notifies nn will acquire np as its predecessor

all predecessor and successor pointers are now correct

CHORD: Fault Tolerance –Failure Recovery

Key step in failure recovery is maintaining correct successor pointers (The worst case as in the simple lookup)

To help achieve this, each node maintains a successor-list of its rnearest successors on the ring

If node n notices that its successor has failed, it replaces it with the first live entry in the list

stabilize will correct finger table entries and successor-list entries pointing to failed node

Performance is sensitive to the frequency of node joins and leaves versus the frequency at which the stabilization protocol is invoked

CHORD: Fault Tolerance –Replication

Chord can store replicas of the data associated with a key at the k nodes succeeding the key(+) Increase data availability(–) larger size of the <key, value> database

CHORD: Load BalancingEven hashing is used, the number of keys stored on each node may vary a lot. This is because the node identifiers do not uniformly cover the entire identifier space.Solution:

Associate keys with a number of virtual nodesMap multiple (e.g. logN as suggested in the consistent hashing paper) virtual nodes to each real nodeDoesn’t affect worst case performance:O(log(NlogN)) = O(logN)

Experimental Results—Load balance

Figure 2: The 1st and the 99th percentiles of the number of keys per node as a function of virtual nodes mapped to a real node.

Figure 1: The mean and 1st and 99th percentiles of the number of keys stored per node in a 10^4 node network.

Number of Nodes: N= 104

Number of Keys: K= 105 ~ 106

Experimental Results– Path LengthNumber of Nodes: N= 2k

Number of Keys: K= 100 * 2k

K from 3 to 14

Experimental Results– Simultaneous Node Failures

Number of Nodes: N= 104

Number of Keys: K= 105 ~ 106

CHORD SummariesScalability

Search O(Log n) w.h.p.Update requires search, thus O(Log n) w.h.p.Construction: O(Log^2 n) if a new node joins

RobustnessReplication might be used by storing replicas at successor nodes

Global knowledgeMapping of IP addresses and data keys to key common key space

AutonomyStorage and routing: noneNodes have by virtue of their IP address a specific role

Search typesOnly equality

CHORD: A Scalable Peer-to-Peer Lookup Service for Internet...

Documents

CHORD: A Peer-to-Peer Lookup Service CHORD: A Peer-to-Peer Lookup Service Ion StoicaRobert Morris David R. Karger M. Frans Kaashoek Hari Balakrishnan Presented

Chord: A Scalable Peer-to-peer Lookup Protocol for ...pages.di.unipi.it/ferrari/CORSI/TEL/chord.pdf · Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications Ion

Peer-to-Peer-Protokoll Till … · 2016-08-31 · Chord Quelle Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, and Hari Balakrishnan. Chord: A scalable peer-to-peer lookup

Chord: A scalable peer-to- peer lookup service for Internet applications Ion Stoica, Robert Morris, David Karger, M. Frans Kaashock, Hari Balakrishnan

Today’s Paperskubitron/courses/... · • Chord: A Scalable Peer‐to‐peer Lookup Protocol for Internet Applications, Ion Stoica, Robert Morris, David Liben‐Nowell, David R

5 Lectures on Peer-peer and Application-level Networking Advanced System … · 2005-02-12 · scaleble (distributed) directory lookup 6. Chord: A Scalable Peer-to-peer Lookup Service

HOW TO MAKE CHORD CORRECT - Cornell University · "Three features that distinguish Chord from many peer-to-peer lookup protocols are its simplicity, provable correctness, and provable

DHT Routing - Cornell University · Chord: A P2P Lookup Service Ion Stoica UC Berkeley “Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications” MIT Laboratory

Yang Li Pravesh Ramachandran Shashank Chaudhary …liyang5/docs/bichord_p2p_protocol.pdf · CHORD Overview Chord is a representative peer to peer lookup service based on DHT. In Chord,

Chord: a scalable peer-to-peer lookup protocol for internet applications … · 2020-04-06 · Chord: A Scalable Peer-to-Peer Lookup Protocol for Internet Applications Ion Stoica,

Chord: A Scalable Peer-to-peer Lookup Service for Internet ...homepage.divms.uiowa.edu/~ghosh/chord-tn.pdfChord: A Scalable Peer-to-peer Lookup Service for Internet Applications Ion

CHORD: A Scalable Peer-to-Peer Lookup Service for …cs527/Lectures2010/9c-Chord-527-10.pdf · CHORD: A Scalable Peer-to ... data structure that maps “keys” to “buckets

Chord: a scalable peer-to-peer lookup protocol for …celio/peer2peer/chord/chord.pdfDNS only works well when host names are structured to reflect administrative boundaries; Chord

1 Structured P2P overlay. 2 Outline Introduction Chord I. Stoica, R. Morris and D. Karger,“Chord: A Scalable Peer-to-peer Lookup Service for Internet

Chord: A Scalable Peer-to-peer Lookup Service for Internet ...courses.cse.tamu.edu/caverlee/csce438/readings/chord.pdfpeer-to-peer systems is efﬁcientlocation of data items. The

CHORD – peer to peer lookup protocol

File Sharing : Hash/Lookup Yossi Shasho (HW in last slide) Based on Chord: A Scalable Peer-to-peer Lookup Service for Internet ApplicationsChord: A Scalable

Chord A Scalable Peer-to-peer Lookup Service for Internet Applications Lecture 3 1

Chord: a scalable peer-to-peer lookup protocol for internet ...cs622/papers/01180543.pdf · IEEE/ACM TRANSACTIONS ON NETWORKING, VOL. 11, NO. 1, FEBRUARY 2003 17 Chord: A Scalable

PWL Seattle #16 - Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications