data is the constraint

113
Data is the Constraint Blog: kylehailey.com 20 years working w/ Oracle Oracle Ace Oaktable Conferences Training days Tech journal articles Sales meetings

Upload: kyle-hailey

Post on 27-Jan-2015

114 views

Category:

Technology


2 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Data is the Constraint

Data is the ConstraintBlog: kylehailey.com• 20 years working w/ Oracle• Oracle Ace• Oaktable• Conferences• Training days• Tech journal articles• Sales meetings

Page 2: Data is the Constraint

Main blogs

• Personal blog : http://kylehailey.com

• Linkedin www.linkedin.com/in/kylehailey

• Twitter @kylehhailey• Facebook

facebook.com/kyle.hailey.73

Page 3: Data is the Constraint

Top Blog Posts

• http://www.kylehailey.com/delphix/– What is Delphix – Benefits in Brief – High Performance Delphix– Jonathan Lewis Explains Delphix– Delphix vs Netapp and EM 12c

Page 4: Data is the Constraint

Data is the constraint

I. Data Tax strains infrastructure

II. Data Tax price is hugeIII. Companies unaware of Data

Tax

Three points

Page 5: Data is the Constraint

I. Data Taxes your business

If you can’t satisfy the business demands then your process is broken.

Page 6: Data is the Constraint

II. Data Tax hits the bottom line hard

Page 7: Data is the Constraint

III. Companies don’t think there is a Data Tax problem

Page 8: Data is the Constraint

I. Data Tax

– Moving data is hard

– Triple tax

– Data Floods infrastructure

Page 9: Data is the Constraint

I. Data Tax : moving data is hard

– Storage & Systems : capital resources– Personnel : operation expenditure– Time : delayed projects

Page 10: Data is the Constraint

I. Data Tax : Typical Architecture

Production Dev, QA, UAT

Instance InstanceInstance Instance

Database

File systemFile system

Database

File systemFile system

InstanceInstance

Database

File systemFile system

InstanceInstance

Database

File systemFile system

Reporting Backup

Instance Instance

Database

File system

File systemFile systemFile systemFile system

Instance

Page 11: Data is the Constraint

I. Data Tax: triple data tax

Production

Development

Business Intelligence

Backup

QA

UAT

Tape

Sandbox

Forensics

Analytics

Reporting

Recovery

Page 12: Data is the Constraint

I. Data Tax : Copying data floods infrastructure

Page 13: Data is the Constraint

I. Data Tax : Data Tax floods company infrastructure

92% of the cost of business — the financial services business —

is “data” www.wsta.org/resources/industry-articles

Most companies have 2-9% IT spending http://uclue.com/?xq=1133

Data management is the largest portion of IT expense

Page 14: Data is the Constraint

Part II. Data Tax is Huge

Page 15: Data is the Constraint

Part II. Data Tax is Huge

• Four Areas data tax hits

1. IT Capital resources 2. IT Operations personnel 3. Application Development ***4. Business

• How big is the data tax?– Measure before and after installing

Delphix

Page 16: Data is the Constraint

II. Data Tax is huge : 1. IT Capital

• Hardware– Servers– Storage– Network– Data center floor space, power, cooling

• Example– Some customers have over 1 Petabyte

duplicate data– (1000 TB, ie 1,000,000 GB )

Page 17: Data is the Constraint

II. Data Tax is huge : 2. IT Operations

• Involves many people– DBAs– SYS Admin– Storage Admin– Backup Admin – Network Admin

• 1000s of hours annually just for DBAs – not including all the other personnel required to supply the

infrastructure necessary

• Data center efforts costly and difficult– Consolidation– Migrations– Move to cloud

Page 18: Data is the Constraint

II. Data Tax is Huge : 3. App Dev quality and speed

• Inefficient QA: Higher costs of QA• QA Delays : Greater re-work of code• Sharing DB Environments :

Bottlenecks• Using DB Subsets: More bugs in Prod• Slow Environment Builds: Delays

“if you can't measure it you can’t manage it”

Five examples of application Data Tax impact

Page 19: Data is the Constraint

II. Data Tax is Huge : 3. App Dev

Build Time

QA Test

96% of QA time was building environment$.04/$1.00 actual testing vs. setup

Build

Inefficient QA : Long Build times

Page 20: Data is the Constraint

II. Data Tax is Huge : 3. App Dev

Build QA Env QA Build QA Env QA

Sprint 1 Sprint 2 Sprint 3

Bug CodeX

1 2 3 4 5 6 70

10203040506070

Delay in Fixing the bug

Cost ToCorrect

Software Engineering Economics – Barry Boehm (1981)

QA Delays: bugs found late = more code re-work

Page 21: Data is the Constraint

II. Data Tax is Huge : 3. App Dev full copies cause bottlenecks

Frustration Waiting

Old Unrepresentative Data

Page 22: Data is the Constraint

II. Data Tax is Huge : 3. App Dev subsets cause bugs

Page 23: Data is the Constraint

Production

II. Data Tax is Huge : 3. App Dev subsets cause bugs

Classic problem is that queries that run fast on subsets hit the wall in production.

Developers are unable to test against all data

The Production ‘Wall’

Page 24: Data is the Constraint

II. Data Tax is Huge : 3. App Dev Slow Environment Builds:

Developer Asks for DB Get Access

Manager approves

DBA Request system

Setup DB

System Admin

Requeststorage

Setup machine

Storage Admin

Allocate storage (take snapshot)

3-6 Months to Deliver Data

Page 25: Data is the Constraint

II. Data Tax is Huge : 3. App Dev Slow Environment Builds Why are hand offs so expensive?

Page 26: Data is the Constraint

II. Data Tax is Huge : 3. App Dev Slow Environment Builds: culture of no

DBA Developer

Page 27: Data is the Constraint

II. Data Tax is Huge : 3. App DevSlow Environment Builds

Never enough environments

Page 28: Data is the Constraint

II. Data Tax is Huge : 3. App Dev What We’ve Seen

• Inefficient QA: Higher costs• QA Delays : Increased re-work• Sharing DB : Bottlenecks• Subset DB : Bugs• Slow Environment Builds: Delays

Five examples of application Data Tax impact

Page 29: Data is the Constraint

II. Data Tax is Huge : 4. Business

Ability to capture revenue

• BusinessIntelligence – ETLs or data warehouse refreshes

• Old data = less intelligence• Less Intelligence = Missed

Revenue

• Business Applications – Delays on getting applications that

generate the revenue

Page 30: Data is the Constraint

II. Data Tax is Huge : 4. Business Intelligence

1pm 10pm 8am noon20112012201320142015

Page 31: Data is the Constraint

II. Data Tax is Huge : 4. Business

Page 32: Data is the Constraint

II. Data Tax is Huge : 4. Business

Magnitude of business impact

• $27 Billion Revenue• $1 Billion IT spend

• $850M development staff• $110M IT Ops• $ 40M storage

Storage

IT Ops

Dev

Revenue

0 5000 10000 15000 20000 25000 30000

Page 33: Data is the Constraint

II. Data Tax is Huge : 4. Business

Storage

IT Ops

Dev

Revenue

0 5000 10000 15000 20000 25000 30000

More obvious

More important

Page 34: Data is the Constraint

II. Data Tax is Huge : How Big is the Data tax?

Measure before and after Delphix w/ Fortune 500 :

Median App Dev throughput increase by 2x

Page 35: Data is the Constraint

II. Data Tax is Huge : How big is the data tax ?• 10 x Faster Financial Close

– Facebook 21 days down to 2

• 9x Faster BI refreshes– Bank of West from 3 weeks per refresh to 3x a week – Weekly refreshes down to 3 times a day

• 2x faster Projects – Informatica– KLA-Tencor (5 x increase)– Presbyterian Health– NY Life

• 20 % less bugs– Stubhub

Page 36: Data is the Constraint

II. Data Tax is Huge : Public Customer Quotes

• "Delphix allowed us to shrink our project schedule from 12 months to 6 months.”– BA Scott, NYL VP App Dev

• "It used to take 50-some-odd days to develop an insurance product, … Now we can get a product to the customer in about 23 days.”– Presbyterian Health

• “Can't imagine working without Delphix”– Ramesh Shrinivasan CA Department of General Services

Page 37: Data is the Constraint

Part III. Companies unaware of the Data Tax

Page 38: Data is the Constraint

III. Companies unaware of the Data Tax

Why do I need an iPhone ?

Isn’t there technology that does that? Why do I need Delphix?

Page 39: Data is the Constraint

III. Companies unaware of the Data TaxNobody does what we do

• NetBackup• Flashback• DataGuard• GoldenGate • SnapClone• FlexClone

=

Page 40: Data is the Constraint

Three Core Parts

Production

File System

Instance

DevelopmentStorage

21 3

Copy Sync SnapshotsTime FlowPurge

Clone (snapshot)CompressShare CacheStorage Agnostic

Mount, recover, renameSelf Service, Roles & Security Rollback & Refresh Branch & Tag

Instance

Page 41: Data is the Constraint

Snap Manager

SnapManagerRepository

Protection Manager

Snap Drive

Snap Manager

Snap Mirror

Flex Clone

RMANRepository

Production

Development

DBA

Storage Admin

1 tr-3761.pdf

Netapp

Netapp Frankenstein

Page 42: Data is the Constraint

Oracle EM 12c Snap Clone3

EM 12c

Agents instance

• Register Netapp or ZFS with Storage Credentials• Install agents on a LINUX machine to manage the Netapp or ZFS storage. • Register test master database• Enable Snap Clone for the test master database• Set up a zone – set max CPU and Memory and the roles that can see these zones• Set up a pool – a pool is a set of machines where databases can be provisioned• Set up a profile – a source database that can be used for thin cloning• Set up a service template – init.ora values

Test Master

Instance

Source

? instance

CloneLinux

ZFS orNetApp

PoolProfile

ZoneTemplate

Oracle Frankenstein

Page 43: Data is the Constraint

EM 12c: Snap Clone

Production Development

Flexclone Flexclone

Netapp Snap Manager for Oracle

Other technology?• Prove it• Bake off• Customer refs

Page 44: Data is the Constraint

III. Companies unaware of the Data Tax

#1 Biggest Enemy IT departments believe– have best processes that exist– have latest and greatest technology– nothing better exists – Just the way it is– Data management is a drop in the bucket of

overall issues

“The status quo is pre-ordained failure”

Page 45: Data is the Constraint

The Phoenix Project

• “Any improvement not made at the constraint is an illusion.”• What is the constraint ?

• The DBAs and environments. • “One of the most powerful things that IT can do is get

environments to development and QA when they need it”

• IT bottlenecks• Setting Priorities• Company Goals• Defining Metrics• Fast Iterations

IT version of

“The Goal”

by E. Goldratt

Page 46: Data is the Constraint

III. Companies unaware of the Data Tax

• Ask Questions– Delphix: we provision environments in

minutes for almost not extra storage.– Customer: We already do that – Delphix: How long does it take a

developer to get an environment after they ask ?

– Customer: 2-3 weeks– Delphix: we do it in 2-3 minutes

No other product does what we do

Page 47: Data is the Constraint

III. Companies unaware of the Data Tax

• How to enlighten companies ? Ask for metrics– Batch window size for ETL – How new (old) is their BI data?– Number: app projects per year– How long does it take a developer to get a DB

copy?– How long does it take QA to setup an environment

• How long to rollback• How long to refresh• How many time do they run a QA cycle

– How old is data in QA and DEV

Page 48: Data is the Constraint

Summary

I. Companies pay a Data TaxII. Data Tax is hugeIII. Companies unaware of the

Data Tax

Page 49: Data is the Constraint

Life with Delphix

Page 50: Data is the Constraint

Typical Architecture

Production Dev, QA, UAT

Instance InstanceInstance Instance

Database

File systemFile system

Database

File systemFile system

InstanceInstance

Database

File systemFile system

InstanceInstance

Database

File systemFile system

Reporting Backup

Instance Instance

Database

File system

File systemFile systemFile systemFile system

Instance

Page 51: Data is the Constraint

With Delphix

Database

Production

Instance

Database

Dev & QA

Instance

Database

Reporting

Instance

Database

Backup

Instance Instance Instance

File system

Database

InstanceInstance

Database

InstanceInstance

Page 52: Data is the Constraint

DevOps With Delphix

1. Efficient QA: Low cost, high utilization2. Quick QA : Fast Bug Fix3. Every Dev gets DB: Parallelized Dev4. Full DB : Less Bugs5. Fast Builds: Culture of Yes

Page 53: Data is the Constraint

Impact on bottom line

• IT Capital expense– 90 % less storage– 30 % less licenses, floor space, servers , power

• IT Operational & Personnel– 1000s of hours down to 10

• Application Development– 2x project output at higher quality

• Business – 9x more Fresh BI data– 2x Faster time to market and higher quality– Increase revenue

Page 54: Data is the Constraint

Summary

• Data tax IS the big gorilla. • Delphix Agile data is small & fast• With Delphix , Deliver projects

– Half the Time– Higher Quality– Increase Revenue

Page 55: Data is the Constraint

kylehailey.com/delphix

Use Cases

What is Delphix

Competition

Page 56: Data is the Constraint

What is Delphix

Page 57: Data is the Constraint

Three Physical CopiesThree Virtual Copies

Delphix

Page 58: Data is the Constraint

Install Delphix on x86 hardware

Intel hardware

Page 59: Data is the Constraint

Allocate Any Storage to Delphix

Allocate StorageAny type

Pure Storage + DelphixBetter Performance for 1/10 the cost

Page 60: Data is the Constraint

One time backup of source database

Database

Production

File system

File systemUpcoming

Supports

InstanceInstanceInstance

Page 61: Data is the Constraint

DxFS (Delphix) Compress Data

Database

Production

Data is compressed typically 1/3 size

File system

InstanceInstanceInstance

Page 62: Data is the Constraint

Incremental forever change collection

Database

Production

File system

Changes

• Collected incrementally forever• Old data purged

File system Time Window

Production

InstanceInstanceInstance

Page 63: Data is the Constraint

Source Full Copy Source backup from SCN 1

Page 64: Data is the Constraint

Snapshot 1Snapshot 2

Page 65: Data is the Constraint

Snapshot 1Snapshot 2

Backup from SCN

Page 66: Data is the Constraint

Snapshot 1Snapshot 2

Snapshot 3

Page 67: Data is the Constraint

DropSnapshot 1

Snapshot 2Snapshot 3

Page 68: Data is the Constraint

Cloning

Database

Production

Instance

File systemFile system Time Window

Database

InstanceInstance

InstanceInstance

Page 69: Data is the Constraint

Typical Architecture

Production Development QA UAT

Instance Instance Instance InstanceInstance Instance Instance Instance

Database

File systemFile system

Database

File systemFile system

Database

File system

Database

File systemFile systemFile systemFile system

Page 70: Data is the Constraint

With Delphix

Database

Production

Instance

Database

Development

Instance

Database

QA

Instance

Database

UAT

InstanceInstance Instance Instance Instance

File system

Page 71: Data is the Constraint

Delphix Use Cases

1. Fast, Fresh, Full2. Free3. Branching4. Federated5. Self Serve

Page 72: Data is the Constraint

Fast, Fresh, Full

Instance

Time Window

Instance

Development VDB

Source

Page 73: Data is the Constraint

Free

gif by Steve Karam

Instance

Instance

Instance

Instance

Source

Page 74: Data is the Constraint

Branching

Instance Instance

Instance

Source

Source Dev

QA branched from Dev

checkout

branching

bookmark

Page 75: Data is the Constraint

Federated Cloning

Page 76: Data is the Constraint
Page 77: Data is the Constraint

Federated

Instance

Instance

Instance

Instance

Source1

Source2

Source1

Page 78: Data is the Constraint

“I looked like a hero”Tony Young, CIO Informatica

Page 79: Data is the Constraint

Self Service

Page 80: Data is the Constraint

Use Cases

1. Development Acceleration

2. Quality

3. BI

Page 81: Data is the Constraint

DevOps

Page 82: Data is the Constraint

DevOps With Delphix

1. Efficient QA: Low cost, high utilization2. Quick QA : Fast Bug Fix3. Every Dev gets DB: Parallelized Dev4. Full DB : Less Bugs5. Fast Builds: Culture of Yes

Page 83: Data is the Constraint

1. Efficient QA: Lower cost

Build Time

QA Test

1% of QA time was building environment$.99/$1.00 actual testing vs. setup

Build Time

QA Test

Build

Page 84: Data is the Constraint

Rapid QA via Branching

Page 85: Data is the Constraint

2. QA Immediate: bugs found fast and fixed

Sprint 1 Sprint 2 Sprint 3

Bug CodeX

QA QA

Build QA Env QA Build QA Env Q

A

Sprint 1 Sprint 2 Sprint 3

Bug CodeX

Page 86: Data is the Constraint

3. Private Copies: Parallelize

gif by Steve Karam

Page 87: Data is the Constraint

4. Full Size DB : Eliminate bugs

Production

Page 88: Data is the Constraint

5. Self Service: Fast, Efficient. Culture of Yes!

Page 89: Data is the Constraint

Quality

1. Forensics2. Testing 3. Recovery

Page 90: Data is the Constraint

1. Forensics: Investigate Production Bugs

Instance

Time Window

Instance

Development

Anomaly on ProdPossible code bugAt noon yesterday

Spin up VDB of Prod as it was during anomaly

Page 91: Data is the Constraint

2. Testing : Rewind for patch and QA testing

Instance

Time Window

Instance

Development

Time Window

Prod

Page 92: Data is the Constraint

2. Testing: A/B

Instance

Time Window

Instance

Instance

• Keep tests for compare• Production vs Virtual

– invisible index on Prod– Creating index on virtual

• Flashback vs Virtual

Test A with Index 1

Test B with Index 2

Page 93: Data is the Constraint

3. Recovery: Surgical recover of Production

Instance Instance

Development

Time Window

Spin VDB up Before drop

Problem on ProdDropped Table Accidently

Source

Page 94: Data is the Constraint

Time Window

3. Recovery Surgical or Full Recovery on VDB

Instance

Instance

Dev1 VDB

Time Window

Dev1 VDB

InstanceSource

Source

Dev2 VDB Branched

Page 95: Data is the Constraint

3. Recovery: Virtual to Physical

Instance Instance

VDB

Source

Time Window

Spin VDB up Before drop

Corruption

Page 96: Data is the Constraint

3. Recovery

Page 97: Data is the Constraint

Business Intelligence

Page 98: Data is the Constraint

ETL and Refresh Windows

1pm 10pm 8am noon

Page 99: Data is the Constraint

ETL and DW refreshes taking longer

1pm 10pm 8am noon20112012201320142015

Page 100: Data is the Constraint

ETL and Refresh Windows

20112012201320142015

1pm 10pm 8am noon

10pm 8am noon 9pm

6am 8am 10pm

Page 101: Data is the Constraint

ETL and DW Refreshes

Instance

Prod

Instance

DW & BI

Data Guard – requires full refresh if usedActive Data Guard – read only, most reports don’t work

Page 102: Data is the Constraint

Fast Refreshes

• Collect only Changes• Refresh in minutes

Instance Instance

DW

Prod

Instance

BI

ETL24x7

Page 103: Data is the Constraint

Temporal Data

Page 104: Data is the Constraint

Oracle 12c

Page 105: Data is the Constraint

80MB buffer cache ?

Page 106: Data is the Constraint

200GBCache

Page 107: Data is the Constraint

5000

Tnxs

/ m

inLa

tenc

y

300 ms

1 5 10 20 30 60 100 200

with

1 5 10 20 30 60 100 200Users

Page 108: Data is the Constraint

8000

Tnxs

/ m

inLa

tenc

y

600 ms

1 5 10 20 30 60 100 200Users

1 5 10 20 30 60 100 200

Page 109: Data is the Constraint

$1,000,000

$6,000

Page 110: Data is the Constraint

Data Center Migration : clone migration

5x Source Data Copy < 1x Source Data Copy

Page 111: Data is the Constraint

Data Center Migration : clone migration + source

5x Source Data Copy < 2 x Source Data Copy

S S

Page 112: Data is the Constraint

Data Center Migration : clone migration + source

5x Source Data Copy < 1 x Source Data Copy

S SC C C C V V V V

Page 113: Data is the Constraint

Consolidation

Active Active IdleActive Active

Idle

Without Delphix With Delphix