evolution of dedupe

14
Evolution of Data Deduplication 1 Evolution of Data Deduplication (c) Druva Software 2010 February 11

Upload: rammotive

Post on 07-Dec-2014

1.343 views

Category:

Documents


0 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Evolution Of Dedupe

Evolution of Data Deduplication

1

Evolution of Data Deduplication

(c) Druva Software 2010 February 11

Page 2: Evolution Of Dedupe

Druva inSync – Overview and Advantage

What is Deduplication ?

• Specialized data compression technique

• Eliminates coarse grained redundant data

February 11

2

(c) Druva Software 2010

• Eliminates coarse grained redundant data

• Improves storage (and bandwidth) utilization.

Page 3: Evolution Of Dedupe

Druva inSync – Overview and Advantage

Why is it so Important ?

February 11

3

(c) Druva Software 2010

8000

12000

16000

Ax

is T

itle

PC Data Vs Available Bandwidth*

Data (MB)

0

4000

8000

2000 2002 2004 2006 2008 2010

Ax

is T

itle

Data (MB)

Bandwidth (KB/Sec)

• Data doubling every 18 months

• Bandwidth grown only 10X last 10 years

• Data duplication

▫ 80% across PCs

▫ 45% across servers

Page 4: Evolution Of Dedupe

Druva inSync – Overview and Advantage

Data Reduction at Target

February 11

4

(c) Druva Software 2010

• Duplicates removed at secondary storage to save

space

Page 5: Evolution Of Dedupe

Druva inSync – Overview and Advantage

Synchronous/Inline Target Data Reduction

February 11

5

(c) Druva Software 2010

• Real-time/Synchronous duplicates removal

• Improved storage capacity management

• Slower than asynchronous deduplication

Page 6: Evolution Of Dedupe

Druva inSync – Overview and Advantage

Source Based Data Deduplication

February 11

6

(c) Druva Software 2010

• Agent based deduplication

• Duplicates identified at source and removed from backup

• Saves bandwidth in addition to storage

Page 7: Evolution Of Dedupe

Druva inSync – Overview and Advantage

Global Source Based Data Deduplication

February 11

7

(c) Druva Software 2010

• Duplicates compared against all sources

• Big Leap in bandwidth and storage saving

Page 8: Evolution Of Dedupe

Druva inSync – Overview and Advantage

Granularity: Block Based Deduplication

February 11

8

(c) Druva Software 2010

• Granular block based comparison = Better Deduplication

• Works well across simple application and similar data streams

• Does not give very accurate results for complex apps e.g. Outlook or Exchange

Page 9: Evolution Of Dedupe

Druva inSync – Overview and Advantage

Application Aware Deduplication

February 11

9

(c) Druva Software 2010

• Deduplicate logical information within data sets

• Works across applications

• 35-50% Better accuracy and storage/bandwidth savings

Page 10: Evolution Of Dedupe

Druva inSync – Overview and Advantage

Granularity: Extent of deduplication

Fixed Blocks

• Good for single application

environments

February 11

10

(c) Druva Software 2010

Variable Length Blocks

• Block size determined by

heuristics/ rolling-

checksum

App-Aware Block Length

• Block size determined by

application data-structure

• Excellent for complex checksum

• Good for multiple data

streams

• Excellent for complex

applications like

Outlook, Exchange

• Delivers dedupe across

applications

1: 3X 1: 8X 1: 15X

Page 11: Evolution Of Dedupe

Druva inSync – Overview and Advantage

Druva: App-Aware Deduplication

• Source Based

• Inline

• Global

February 11(c) Druva Software 2010

11

• App-Aware block sizes

• Supported applications –

▫ MS Outlook 2003/07/10

▫ MS Office 2003/2007/10

▫ PDFs

▫ JPEGs

Page 12: Evolution Of Dedupe

Druva inSync – Overview and Advantage

Druva: Blackbird Storage Engine

• Near CDP ▫ All incremental backups

▫ Instant search based restores

• High Performance

February 11

12

(c) Druva Software 2010

(in memory) Hyper Cache

Blackbird storage engine

• High Performance� Distributed Caching

� Hyper-Cache (coming soon)

� SSD Support

• Scalable▫ Based on embedded Oracle DB

▫ 16TB, 200 parallel backups

• Simple� Software only solution

� Simple 20 Mins deployment

� Zero Maintenance

CDP + Dedupe File-system

(in memory) Hyper Cache

Oracle DB for Oracle DB for

meta-data

DiskSSD

Page 13: Evolution Of Dedupe

Druva inSync – Overview and Advantage

How Does Druva Compared to Others ?

Source Based DedupeSource Based Dedupe

February 11

13

(c) Druva Software 2010

EMC, Acronis, IronMountain, Druva

, Veritas

Druva , Veritas, EMC, CA, Comvault,

IronMountain, Acronis, Atempo

Global DeduplicationGlobal Deduplication

InlineInline

Sub-FileSub-File

App-AwareApp-

Aware

, Veritas

EMC, IronMountain, Druva, Veritas

EMC Avamar, Druva inSync

EMC avamar, Druva inSync

Druva inSync

Page 14: Evolution Of Dedupe

Druva inSync – Overview and Advantage

Why Druva

• A Fresh Approach Towards Backup

▫ Unique Cutting-Edge Technology

▫ Simplified Management

Everything is there. I checked

already… I've been in this game a long

time and I can honestly say that I have

never seen something so simple.

”▫ Enterprise Grade Support

▫ Affordable Solutions

• 600 Customers across 26 Countries

This, frankly, is brilliant. :-)

Cheers !

Christian R., Bechtel Corporation