column-oriented database
DESCRIPTION
Column-Oriented Database. Yiqun Xie (Ian) & Yingbin Liang (Ben) Group 9. Row-based RDBMS vs. Column-oriented DBMS. Row-based, the best solution?. Big Data OLAP-Online Analytical Processing … …. Traditional row-based RDBMS. - PowerPoint PPT PresentationTRANSCRIPT
Column-Oriented Database
Yiqun Xie (Ian) & Yingbin Liang (Ben)Group 9
Row-based RDBMS vs. Column-oriented
DBMS
Row-based, the best solution?• Big Data
• OLAP-Online Analytical Processing
• … …
Traditional row-based RDBMS• Optimal write time and abundant reading overhead for retrieval of
subset queries
Traversing all the headers!Going through all attributes.
varchar
From row to column…
SELECT c1 FROM bigtable; …
…ideal
Column-oriented DBMS• Optimal read time for subset retrieval queries
SELECT c1 FROM bigtable;
100GB1GB 10GB
20GB
19GB
30GB
20GB
col6
col1 col2col3 col4
col5
RDBMS
Strengths
SQL Example:• SELECT month, store
FROM sales, productsWHERE productType = ‘technology’AND products.id = sales.productIDGROUP BY month, store
• Easier to implement data compression algorithms (e.g. Run-length encoding)
Only read columns needed ! (7 columns)
Fruit (3 Apples, 2 Pears, 1 Banana, 2 Oranges, 1 Apple)
Column-oriented DBMS• Bad write performance
• Advanced column-oriented storage system: HBase, MongoDB, MapReduce
INSERT INTO bigtable VALUES (Rn1, Rn2, Rn3, Rn4)
Weakness
Relationship With Course
References[1] http://www.cs.yale.edu/homes/dna/talks/Column_Store_Tutorial _VLDB09.pdf[2] http://www.spatial.cs.umn.edu/Courses/Fall11/5707/class_no tes/G5.ppt[3] http://www.slideshare.net/arangodb/introduction-to-column- oriented-databases[4] Column-oriented database speeds predictive criminal monitoring program
Questions?