Column-Oriented Database

Preview:

DESCRIPTION

Column-Oriented Database. Yiqun Xie (Ian) & Yingbin Liang (Ben) Group 9. Row-based RDBMS vs. Column-oriented DBMS. Row-based, the best solution?. Big Data OLAP-Online Analytical Processing … …. Traditional row-based RDBMS. - PowerPoint PPT Presentation

Citation preview

Column-Oriented Database

Yiqun Xie (Ian) & Yingbin Liang (Ben)Group 9

Row-based RDBMS vs. Column-oriented

DBMS

Row-based, the best solution?• Big Data

• OLAP-Online Analytical Processing

• … …

Traditional row-based RDBMS• Optimal write time and abundant reading overhead for retrieval of

subset queries

Traversing all the headers!Going through all attributes.

varchar

From row to column…

SELECT c1 FROM bigtable; …

…ideal

Column-oriented DBMS• Optimal read time for subset retrieval queries

SELECT c1 FROM bigtable;

100GB1GB 10GB

20GB

19GB

30GB

20GB

col6

col1 col2col3 col4

col5

RDBMS

Strengths

SQL Example:• SELECT month, store

FROM sales, productsWHERE productType = ‘technology’AND products.id = sales.productIDGROUP BY month, store

• Easier to implement data compression algorithms (e.g. Run-length encoding)

Only read columns needed ! (7 columns)

Fruit (3 Apples, 2 Pears, 1 Banana, 2 Oranges, 1 Apple)

Column-oriented DBMS• Bad write performance

• Advanced column-oriented storage system: HBase, MongoDB, MapReduce

INSERT INTO bigtable VALUES (Rn1, Rn2, Rn3, Rn4)

Weakness

Relationship With Course

References[1] http://www.cs.yale.edu/homes/dna/talks/Column_Store_Tutorial _VLDB09.pdf[2] http://www.spatial.cs.umn.edu/Courses/Fall11/5707/class_no tes/G5.ppt[3] http://www.slideshare.net/arangodb/introduction-to-column- oriented-databases[4] Column-oriented database speeds predictive criminal monitoring program

Questions?

Recommended