Overview on ZHT - IIT-Computer Scienceiraicu/teaching/CS554-F13/lecture03_ZHT-overview.… ·...

Overview on ZHT

General terms Overview to NoSQL dabases and key-value

stores Introduction to ZHT CS554 projects

Relational databases Query with SQL

DB2, MySQL, Oracle, SQL Server

CS 425, 525 NoSQL databses

Loose consistency model

Simpler design

High performance

Distributed design

Key-Value store

ZHT, Dynamo, Memcached, Cassandra, Chord

Document Oriented Databases

MongoDB, Couchbase

Graph databases

Neo4J, Allegro, Virtuoso

Another name for Distributed Hash Table

1 Node

Client 1 … n

Value j

Replica

Value j

Replica

Value j

Replica

Value k

Replica

1 Value k

Replica

Value k

Replica

Updating membership tables

Planed nodes join and leave: strong consistency

Nodes fail: eventual consistency

Updating replicas

Configurable

Strong consistency: consistent, reliable

Eventual consistency: fast, availability

Many DHTs: Chord, Kademlia, Pastry, Cassandra, C-MPI, Memcached, Dynamo ...

Why another?

Name Impl.

Routing Time

Persistence Dynamic

membership

Append Operation

Cassandra Java Log(N) Yes Yes No

C-MPI C Log(N) No No No

Dynamo Java 0 to

Log(N) Yes Yes No

Memcached C 0 No No No

ZHT C++ 0 to 2 Yes Yes Yes

ZHT Bench: Benchmarking mainstream NoSQL databases

ZHT Cons: Eventual consistency support for ZHT ZHT DMHDFS: Distributed Metadata

Management for the Hadoop File System ZHT Graph: Design and implement a graph

database on ZHT ZHT OHT: Hierarchical Distributed Hash Tables ZHT ZST: Enhance ZHT through Range Queries

and Iterators

IBM Blue Gene/P supercomputer

Up to 8192 nodes

32768 instance deployed

Commodity Cluster

Up to 64 node

Amazon EC2

M1.medium and Cc2.8xlarge

96 VMs, 768 ZHT instances deployed

Familiar with Linux and it’s command line Shell scripting language (eg. Bash, zsh…) Programming skills in C++/C (except

benchmark) GCC compiler No object oriented skill needed

Goal: Extensively benchmarking NoSQL databases and analysis performance data.

ZHT, MongoDB, Cassandra Neo4J (experiment for Graph) And others… Metrics

Latency and its distribution , throughput Parameters

Message size Scales Key Distributions

Goal 1: allow replicas serve read operation Goal 2: maintain eventual consistency

between replicas Goal 3: make it scale (pretty hard!)

Optional goal: allow replicas serve write

requests and maintain consistency (applying Paxos protocol, even harder)

What is metadata? Goal: improve HDFS performance by adding

distributed metadata service Requirement: experience with Hadoop and

HDFS; strong programming skill in both Java and C++

1 2 4 8 16 32 64 128 256 512

Number of Nodes

Fusionfs

Goal: build a graph databases on top of ZHT How: construct a mapping from key-value

store interface to graph interface

Goal: adding a proxy level to ZHT architecture so to reduce concurrency stress to each server

Easy: make it work and scale Hard: handle failures

Goal: design and implement new interface methods to ZHT

Iterator: next/previous operation

Range get/put: given a range of key, return a series of results in one request loop

Sorted map

B+ tree (bold!)

Communication: come and talk to me (by appointment)

Make good use of Google Fail quick, fail early, fail cheap. Fast iteration: very small but frequent

progress

Why bother? 80% points from projects!

Welcome abroad and enjoy!

Tonglin Li

tli13@hawk.iit.edu http://datasys.cs.iit.edu/projects/ZHT/

Overview on ZHT - IIT-Computer Scienceiraicu/teaching/CS554-F13/lecture03_ZHT-overview.… ·...

Documents

A role-based access control model and ... - Binghamtonkang/teaching/cs554/rbac-implement.pdf · implementations, market analysis, and observations made by Jansen [1988] and Hoffman

Parallel Numerical Algorithmssolomon2.web.engr.illinois.edu/teaching/cs554/slides/... · 2019. 11. 21. · Krylov Methods Other Methods Parallel Numerical Algorithms Chapter 5 –

Policies & Procedures, ZHT, SG, 2014

Cs554 Introduction to Rational Rose 12219

CloudKon: a CLOUD-enabled distributed tasK executiON …iraicu/teaching/CS554-F13/lecture03_CloudKon-overview.pdf•Design and implement simple yet effective distributed task execution

Pompes centrifuges Modèle ZHT / ZHB / ZHS / TH / THK / DUO

Final Presentation [CS554] Designs for Software and Systems Supreme Design Khalil Mezyaoui, Jun Ho Yi Jongmin Lee, Taeju Park

Project 2 Presentations CS554 – Designs for Software and Systems

h noeoda nÌH$ma Zht h¡ Am¡a Zm hr h mao nmg nÌH$m[aVm H$m ...golalariya.com/pdf/biodata/dec,11.pdf · _mh - {Xgå~a 2011 6 h_ noeoda nÌH$ma Zht h¡ Am¡a Zm hr h_mao nmg nÌH$m[aVm

ZHT: A Light-weight Reliable Persistent Dynamic Scalable ...iraicu/research/publications/2013_IPDPS13_ZHT.pdf · Abstract— This paper presents ZHT, a zero-hop distributed hash table,

CS554 ï¿½ Introduction to Rational Rose

MAESTRO ZHT - Axion Biosystems

Parallel Numerical Algorithmssolomon2.web.engr.illinois.edu/teaching/cs554/slides/slides_08.pdf · Parallel Numerical Algorithms Chapter 4 – Sparse Linear Systems Section 4.1 –

ZHT: A Light-weight Reliable Persistent Dynamic Scalable ...datasys.cs.iit.edu/publications/2013_IPDPS13_ZHT.pdfZHT: A Light-weight Reliable Persistent Dynamic Scalable Zero-hop Distributed

ZHT: a Zero-hop DHT for High-End Computing …datasys.cs.iit.edu/reports/2012_Qual-IIT_ZHT.pdfIllinois Institute of Technology Department of Computer Science ZHT: a Zero-hop DHT for

OHT: Hierarchical Distributed Hash Tablesiraicu/teaching/CS554-F13/best-reports/...OHT: Hierarchical Distributed Hash Tables Kun Feng, Tianyang Che, Tonglin Li, Ioan Raicu Department

CloudKon: a CLOUD-enabled distributed tasK executiON …iraicu/teaching/CS554-F13/lecture03_CloudKon... · CloudKon: a CLOUD-enabled distributed tasK executiON framework Iman Sadooghi

ZHT 1 A Fast, Reliable and Scalable Zero-hop Distributed Hash Table

Introduction & Motivation Problem Statementiraicu/teaching/CS554-S15/lecture06... · 2015. 2. 11. · •Introduction & Motivation • Problem Statement • Proposed Work • Evaluation

The Fusion Distributed File System - IIT-Computer Scienceiraicu/teaching/CS554-S15/lecture04-FusionFS.pdf · Illinois Institute of Technology Department of Computer Science Dongfang