hi - kerry osborne’s oracle blogkerryosborne.oracle-guy.com/papers/hadoop meets exadata 2.pdf ·...

36
Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber, 2012

Upload: hathuy

Post on 30-Jan-2018

221 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

Hadoop Meets Exadata

DW Global Leaders Program

Presented by: Kerry Osborne

Hi

Decemeber, 2012

Page 2: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

3

whoami –

Never Worked for Oracle Worked with Oracle DB Since 1982 (V2) Working with Exadata since early 2010 Work for Enkitec (www.enkitec.com) (Enkitec owns an Exadata Half Rack – V2/X2) (Enkitec owns an Oracle Big Data Appliance) Exadata Book (recently translated to Chinese)

Blog: kerryosborne.oracle-guy.com Twitter: @KerryOracleGuy

Hadoop Aficionado

Page 3: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

4

What’s the Point?

Data Volumes are Increasing Rapidly Cost of Processing / Storing is High Scalability is Big Concern

And …

Page 4: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

5

Hadoop Is A Virus

* Stolen from Orbitz

Page 5: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

6

Google Trends

Page 6: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

7

Google Trends

Page 7: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

8

Google Trends

Page 8: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

9

Disjointed Presentation ???

Architectures Integration Approaches Oracle Products Exadoop Case Study

Page 9: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

10

Traditional RDBMS Architecture

DB Server

Storage

Compute work

Storage

Plumbing

Page 10: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

11

Traditional Oracle Architecture

Cache

Storage

dbwr lgwr etc…

workers

RAC

Block Mapper (ASM)

(SGA) work

Page 11: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

tasktracker tasktracker

12

HDFS/Hadoop Architecture

Name Node Job Tracker work

Storage

workers

datanode

Storage

workers

datanode

HA ?

Page 12: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

tasktracker tasktracker

13

HDFS/Hadoop Architecture HA ?

Block Mapper (namenode)

Job Tracker work

Storage

workers

datanode

Storage

workers

datanode

Page 13: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

14

Exadata Architecture RAC

Block Mapper (ASM)

Cache work

Storage

workers

Storage Node

Storage

workers

Storage Node

workers

Page 14: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

tasktracker tasktracker

15

HDFS/Hadoop Architecture HA ?

Block Mapper (namenode)

Job Tracker work

Storage

workers

datanode

Storage

workers

datanode

Page 15: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

16

Oracle + Hadoop Integration

Page 16: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

17

Obligatory Marketing Slide

Page 17: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

18

Oracle Big Data Appliance

Prebuilt Hadoop Stack in a Rack Engineered System Open Source Software Includes Cloudera Distribution

Page 18: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

19

Oracle Big Data Appliance

Page 19: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

20

BDA Software

Page 20: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

21

Top Secret Feature of BDA

Page 21: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

22

Integration Options

Many Ways to Skin the Cat

•  Fuse •  Sqoop •  Oracle Big Data Connectors

Page 22: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

23

Fuse – External Tables

Page 23: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

24

Sqoop (SQL-to-Hadoop)

•  Graduated from Incubator Status in March 2012 •  Slower (no direct path?) •  Quest has a plug-in (oraoop) •  Bi-Directional

Page 24: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

25

Oracle Big Data Connectors

Oracle Loader for Hadoop - OLH

Oracle Direct Connector for HDFS - ODCH

Oracle R Connector for Hadoop – ORHC

Oracle Data Integrator Application Adapter for Hadoop

Note:

All Connectors are One Way

Page 25: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

26

Oracle Data Integrator Application Adapter for Hadoop

ODIAAH ?

Page 26: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

27

Oracle R Connector for Hadoop (ORHC)

•  Provides ability to pull data from Oracle RDBMS •  Provides ability to pull data from HDFS •  Provides access to local file system •  Not really a loader tool •  Most useful for analysts

Page 27: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

28

Oracle Loader for Hadoop (OLH) •  Implemented as a MapReduce job (oraloader.jar) •  Saves CPU on DB Server •  Can convert to Oracle datatypes •  Can partition data and optionally sort it •  Online – direct into Oracle tables

•  Can load into Oracle via JDBC or OCI Direct Path •  Offline – generate preprocessed files in HDFS (DP format)

Page 28: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

29

Oracle Direct Connector for HDFS  (ODCH)

•  Uses External Tables •  Fastest - 12T per hour •  Can load DP files preprocessed by OLH •  Allows Oracle SQL to query HDFS data •  Doesn’t require loading into Oracle •  Downside – uses DB CPU’s

Page 29: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

Exadoop

30

* Mad Scientist Project

Page 30: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

Exadoop

31

Unusual Situation! Exadata Half Rack with 4 Spare Storage Servers Company Playing with “Big Data” Technology Exadata Cells Very Similar to BDA Servers 4 Cells ≈ Mini BDA! (happy face)

Page 31: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

Exadoop Layout

32

- Exa Compute Nodes

-  Exa Storage Nodes (108TB raw)

-  Hadoop Cluster (144TB raw)

Big

Fat

Pip

e

- 4 Compute Nodes

-  7 Storage Nodes (252TB)

Exa Half Rack

X X X X X X X X X X X X

Exadoop

Page 32: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

Exadoop Applications

33

Telecom Company Call Detail Records Dumped by Switches Loaded into HDFS via Flume

Page 33: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

Exadoop – Proposed Architecture

34

- Exa Compute

- Exa Storage

- Hadoop Cluster

SIP Server

Flume Agent

CDR HDFS

Packet Sniffer

Hbase

Error Codes

Apex App

Java App

Page 34: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

Exadoop Applications

35

Page 35: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

Wrap Up

36

Is Hadoop the right tool for the job?

Maybe

All the Cool Kids Are Doing It!

Page 36: Hi - Kerry Osborne’s Oracle Blogkerryosborne.oracle-guy.com/papers/Hadoop Meets Exadata 2.pdf · Hadoop Meets Exadata DW Global Leaders Program Presented by: Kerry Osborne Hi Decemeber,

37

Questions? Contact Information : Kerry Osborne

[email protected] kerryosborne.oracle-guy.com

www.enkitec.com