21
NoSQL Overview Reporter: Haiping Wang WAMDM Cloud Group Mail:[email protected]

Reporter: Haiping Wang WAMDM Cloud Group Mail:[email protected]

  • View
    246

  • Download
    8

Embed Size (px)

Citation preview

Page 1: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

NoSQL Overview

Reporter: Haiping WangWAMDM Cloud Group

Mail:[email protected]

Page 2: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Outline Why NoSQL?

Four trendsHistory

What is NoSQL?Definition Three fundamental theories

NoSQL categories RDBMS vs. NoSQL

Page 3: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Trend1:data set sizeRapid Increase of Data

57% every year (IDC2007) Double every 1.5 years988EB (1EB=1024PB) data will be produced in

2010 (IDC) 18 million times of all info in books

Page 4: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Trend2:Information connectivity

Page 5: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Trend3:Semi-structure Individualization of content!

In the salary lists of the 1970s, all elements had exactly one job

In the salary lists of the 2000s, we need 5 job columns! Or 8? Or 15?

Trend accelerated by the decentralization of content generation that is the hallmark of the age of participation (“web 2.0”)

Page 6: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

RDBMS performance

Page 7: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Trend4:architecture changes

Page 8: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

NoSQL history The term NoSQL was first used in 1998Reintroduced in early 2009 by Eric EvansHot in 2009

Page 9: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Outline Why NoSQL?

Four trendsHistory

What is NoSQL?Definition Three fundamental theories

NoSQL categories RDBMS vs. NoSQL

Page 10: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Definition From http://nosql-database.org/

Original intention modern web-scale databases

Characteristics non-relational, Distributed  open-source horizontal scalable schema-free easy replication support simple API eventually consistent / BASE

(not ACID) Others…

From Wikipedialoosely defined class of

non-relational data storesnot require fixed table

schemasAvoid join operationsScale horizontally

NoSQL is NOT Only SQL

Page 11: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Fundamental theories CAPBASE

APEventual consistency

Causal consistencyRead-your-writes consistencySession consistencyMonotonic read consistencyMonotonic write consistency

Page 12: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Outline Why NoSQL?

Four trendsHistory

What is NoSQL?Definition Three fundamental theories

NoSQL categories RDBMS vs. NoSQL

Page 13: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

NoSQL categories Key-value stores

Based on DHTs / Amazon's Dynamo paper Data model: (global) collection of K-V pairs Example: Dynomite, Voldemort, Tokyo

BigTable clones Based on Google's BigTable paper Data model: big table, column families Example: Hbase, Hypertable

Page 14: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

NoSQL categories Document databases

Inspired by Lotus Notes Data model: collections of K-V collectionsExample: CouchDB, MongoDB

Graph databasesInspired by Euler & graph theory Data model: nodes, rels, K-V on both Example: AllegroGraph, VertexDB, Neo4j

Page 15: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Key-value stores

Key Value

... name_€#_Stellamood_€#_Happ

ybirthdate%/// 135465645)

dog_12

Page 16: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Bigtable clones

Page 17: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Document databases

Key document

dog_12

{type: “Dog”,

name: “Stella”, mood: “Happy”, birthdate: 2007-

04-01}

Page 18: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Graph databases

Page 19: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

RDBMS vs. NoSQLStrong consistency vs. Eventual consistency Big dataset vs. HUGE datasets Scaling is possible vs. Scaling is easySQL vs. Map-ReduceGood availability vs. Very high availability

Page 20: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn
Page 21: Reporter: Haiping Wang WAMDM Cloud Group Mail:lulang1022@yahoo.com.cn

Thank you!!!