1 1 apache hadoop and the emergence of the enterprise data hub eli collins, chief technologist...

Post on 19-Jan-2016

218 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

TRANSCRIPT

1 ©2014 Cloudera, Inc. All rights reserved.1

Apache Hadoop and the Emergence of the Enterprise Data HubEli Collins, Chief Technologist

2 ©2014 Cloudera, Inc. All rights reserved.2

The Enterprise Data Warehouse

Flat Files

Operational Store

Data Sources

Staging

Reporting

Analysis

MiningOperational

Store Metadata

Summary

Facts & Dimensions

EDW

Archive

Data marts

3 ©2014 Cloudera, Inc. All rights reserved.3

The Enterprise Data Hub

imageslogs

binaryDB dumps

1. Inexpensive storage2. Flexible storage3. Co-located compute4. Multiple compute engines

MR, Pig/Hive, SQL, Spark, SAS, R, Search, Graph..

4 ©2014 Cloudera, Inc. All rights reserved.4

So it’s Like a Data Warehouse?

..but can store more data, more kinds of data, and do more flexible analysis. It’s open source and runs on industry standard hardware so it’s more economical.

5 ©2014 Cloudera, Inc. All rights reserved.5

An Analogy

6 ©2014 Cloudera, Inc. All rights reserved.6

What changed?

• The need?• Convenience? Cost?

7

Take and share good photos

8

Data Warehouse vs. Data Hub

©2014 Cloudera, Inc. All Rights Reserved.

Enterprise Data Warehouse Enterprise Data Hub

9 ©2014 Cloudera, Inc. All Rights Reserved.

An Operating System

APP

SCHEDULER

FILE SYSTEM

MG

TSERVICES

APP

LIB

APP 3rd PARTY APP

10 ©2014 Cloudera, Inc. All Rights Reserved.

An Enterprise Data Hub

BATCHPROCESSING

ANALYTICSQL

SEARCHENGINE

MACHINELEARNING

STREAMPROCESSING

3RD PARTYAPPS

WORKLOAD MANAGEMENT

STORAGE FOR ANY TYPE OF DATAUNIFIED, ELASTIC, RESILIENT, SECURE

DATAM

ANAG

EMEN

TSYSTEM

MAN

AGEM

ENT

Filesystem Online NoSQL

11 ©2014 Cloudera, Inc. All rights reserved.11

Data Warehousing with an EDH

Flat Files

Operational Store

Data SourcesEDH

Reporting AnalysisMining

Operational Store

EDW

1. Stage, transform, archive

3. Exploratory, Discovery,Search, ML..

2. Reporting,Mining,Analysis

12 ©2014 Cloudera, Inc. All rights reserved.12

What about data warehousing on the enterprise data hub?

13 ©2014 Cloudera, Inc. All rights reserved.13

14 ©2014 Cloudera, Inc. All Rights Reserved.

Data Warehousing in Cloudera’s EDH

15 ©2014 Cloudera, Inc. All rights reserved.15

16 ©2014 Cloudera, Inc. All rights reserved.

top related