hydrastor: a scalable secondary storage · 2010-09-08 · hydrastor: a scalable secondary storage....

78
March 2017R15 1.877.630.SEON www.seon.com Document Part Number: 700-0137 Rev. 15 vMax View Install and User Guide Software version 4.5

Upload: others

Post on 20-Mar-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor:a Scalable Secondary Storage

7th TF-Storage Meeting

September 9th 2010

Łukasz Heldt

Page 2: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

Largest Japanese IT company$43 Billion in annual revenue143,000 staffwww.nec.com

Polish R&D company50 engineers and scientistswww.9livesdata.com

Scalable disk based storage for backup with global deduplicationStarted in 2003 in NEC Labs by Cezary Dubnicki2007 Product of the year award by SearchStorage.com2008 Product innovation award by Network Products Guide2009/2010 FAST conference publication in San JoseSold in US and Japan since 2007Will be sold in Poland in 2011 by 9LivesData in coop. with NEC

R&D of criticalbackend component

Owns & sells

Page 3: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3

Backup storage

● Tapes are most common, despite:● Sensitive environment requirements● Unreliable restore● Low performance● Manual labor or expensive robots● Problematic replication

Page 4: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 4

Backup storage size

● Usual backup policy● 4-12+ full backups● 7-30+ incremental● Majority of data does

not change● Data compression 2:1

● Secondary storage size:● 5x-20x more than

primary storage● Includes many copies

of the same data● Each data chunk

stored 5-10+ times

Page 5: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 5

Backup storage size

● Usual backup policy● 4-12+ full backups● 7-30+ incremental● Majority of data does

not change● Data compression 2:1

● Secondary storage size:● 5x-20x more than

primary storage● Includes many copies

of the same data● Each data chunk

stored 5-10+ times

High potential for the deduplication technology.

Page 6: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 6

Deduplication

● Save disk space by eliminating duplicates● Sample reduction ratio 10:1 (depends on backup policy)

● Lowers price of gigabyte

B CA

D EA

File A

File B

B CAFile A

Sub-file level deduplication

B C DA E

Stored blocksOnly unique blocks Only unique blocks are storedare stored

Page 7: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 7

Global deduplication

● Prevent silos of deduped data● One system to manage

Global vs. siloed dedup

Page 8: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 8

HYDRAstor product

● Provides● global deduplication using DataRedux™● performance, storage scalability

and data resiliency using Distributed Resilient Data™

Page 9: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

9

HYDRAstor deployment

● Interface: CIFS, NFS, Symantec OST

● Marker filtering for: Tivoli, Netbackup, Networker, CommVault

Page 10: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 10

HYDRAstor architecture

● Accelerator Nodes realize performance

● Storage Nodes realize capacity

InternalNetwork

Accelerator Nodes

Storage Nodes

NFS / CIFS / OSTover Ethernet

Page 11: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 11

HYDRAstor architecture

● Accelerator Nodes realize performance

● Storage Nodes realize capacity

InternalNetwork

Accelerator Nodes

Storage Nodes

NFS / CIFS / OSTover Ethernet

Non-disruptivegrid expansion

Page 12: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 12

HYDRAstor scalability

● MiniHYDRA – single server● Storage: 12 TB – 240 TB*● Performance: 1.3 TB / hour

● 2AN 4SN● Storage: 48 TB – 960 TB*● Performance: 3.6 TB / hour

● 20AN 40SN (4 racks)● Storage: 480 TB – 9600 TB*● Performance: 36 TB / hour

* - assuming 20x data reduction through DataRedux™

Page 13: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 13

HYDRAstor scalability

● Slide from Curtis Preston presentationCurtis Preston is a famous storage analyst owning independent consulting company

Page 14: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 14

HYDRAstor other features

● Fully automatic/non-disruptive mgmt● Recovery of lost data resiliency● Periodic data scrubbing● Machine and disk failure recovery

● Configurable redundancy level● erasure coding – better than RAID6

● Optimized replication● Smart resource management

Page 15: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 15

HYDRAstor backenddesign

Details of the design:http://www.usenix.org/events/fast09/tech/full_papers/dubnicki/dubnicki.pdf

Page 16: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 16

Programming Model

● Repository of blocks● Content-addressed● Immutable● Variable-sized

hash=011..0

Page 17: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 17

Programming Model

● Repository of blocks● Content-addressed● Immutable● Variable-sized

● Exposed pointers to other blocks

E

hash=011..0

011.

.0

Page 18: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 18

Programming Model

● Repository of blocks● Content-addressed● Immutable● Variable-sized

● Exposed pointers to other blocks

● Trees of blocks E

EE

ERoot1

E

hash=010..1

hash=011..0

011.

.0

Page 19: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 19

Programming Model

● Repository of blocks● Content-addressed● Immutable● Variable-sized

● Exposed pointers to other blocks

● Trees of blocks● DAGs due to deduplication● No cycles possible

E

EE

011.

.0

ERoot1

E

ERoot2hash=010..1

hash=110..0

hash=011..0

011.

.0

Page 20: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 20

Programming Model

● Repository of blocks● Content-addressed● Immutable● Variable-sized

● Exposed pointers to other blocks

● Trees of blocks● DAGs due to deduplication● No cycles possible

● Deletion of whole trees

E

EE

011.

.0

ERoot1

E

ERoot2hash=010..1

hash=110..0

hash=011..0

011.

.0

Page 21: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 21

Programming Model

● Repository of blocks● Content-addressed● Immutable● Variable-sized

● Exposed pointers to other blocks

● Trees of blocks● DAGs due to deduplication● No cycles possible

● Deletion of whole trees

E

EE

011.

.0

ERoot1

E

ERoot2hash=010..1

hash=110..0

hash=011..0

011.

.0

Page 22: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 22

Programming Model

● Repository of blocks● Content-addressed● Immutable● Variable-sized

● Exposed pointers to other blocks

● Trees of blocks● DAGs due to deduplication● No cycles possible

● Deletion of whole trees

E

EE

011.

.0

ERoot1

E

ERoot2hash=010..1

hash=110..0

hash=011..0

011.

.0

Page 23: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 23

Programming Model

● Repository of blocks● Content-addressed● Immutable● Variable-sized

● Exposed pointers to other blocks

● Trees of blocks● DAGs due to deduplication● No cycles possible

● Deletion of whole trees

E 011.

.0

ERoot2

hash=110..0

hash=011..0

011.

.0

Page 24: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 24

Failure tolerance: erasure coding

De

cod e

Any 3 fragments can be lost

Example: N=8, m=5

Enc

ode

Original block

Ori

gina

l F

ragm

e nts

R

edun

d ant

F

ragm

e nts

Page 25: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 25

Failure tolerance: erasure coding

De

cod e

Any 3 fragments can be lost

Example: N=8, m=5

Enc

ode

Original block

Ori

gina

l F

ragm

e nts

R

edun

d ant

F

ragm

e nts

Mirror 3-copy RAID6 Erasure coding

Resiliency 1 2 2 2 3

Overhead 100% 200% 20% 20% 33%

Assuming 12 disks array

Page 26: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 26

Scalability with DHT: data placement

● Block location: DHT with prefix routing

0 1

01 10 11

empty prefix

00

0

01

Page 27: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 27

Scalability with DHT: data placement

● Block location: DHT with prefix routing● Block mapped to hash prefix hash=011..0

0 1

01 10 11

empty prefix

00

Block

0

01

Page 28: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 28

Scalability with DHT: data placement

● Block location: DHT with prefix routing● Block mapped to hash prefix● Prefix components

● Hosted on SNs● N components

per prefix

hash=011..0

Block

Node 1Node 6

Node 1Node 5

Node 1Node 4

Node 1Node 3

Node 1Node 2

Node 1Node 1

0 1

1

3

2

0 0

1

2

3

0

1

2

3

2

3

1

0

01 10 11

empty prefix

00

N=4

Page 29: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 29

Scalability with DHT: data placement

hash=011..0

Block

Node 1Node 6

Node 1Node 5

Node 1Node 4

Node 1Node 3

Node 1Node 2

Node 1Node 1

0 1

1

3

2

0 0

1

2

3

0

1

2

3

2

3

1

0

01 10 11

empty prefix

00

N=4

● Block location: DHT with prefix routing● Block mapped to hash prefix● Prefix components

● Hosted on SNs● N components

per prefix● Store fragments

Page 30: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 30

Scalability with DHT: data placement

hash=011..0

Block

Node 1Node 6

Node 1Node 5

Node 1Node 4

Node 1Node 3

Node 1Node 2

Node 1Node 1

0 1

1

3

2

0 0

1

2

3

0

1

2

3

2

3

1

0

01 10 11

empty prefix

00

N=4

● Block location: DHT with prefix routing● Block mapped to hash prefix● Prefix components

● Hosted on SNs● N components

per prefix● Store fragments

● Distributedconsensus

Page 31: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 31

Scalability with DHT: data placement

hash=011..0

Block

Node 1Node 6

Node 1Node 5

Node 1Node 4

Node 1Node 3

Node 1Node 2

Node 1Node 1

0 1

1

3

2

0 0

1

2

3

0

1

2

3

2

3

1

0

01 10 11

empty prefix

00

N=4

● Block location: DHT with prefix routing● Block mapped to hash prefix● Prefix components

● Hosted on SNs● N components

per prefix● Store fragments

● Distributedconsensus

Page 32: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 32

Scalability with DHT: data placement

hash=011..0

Block

Node 1Node 6

Node 1Node 5

Node 1Node 4

Node 1Node 3

Node 1Node 2

Node 1Node 1

0 1

1

3

2

0 0

1

2

3

0

1

2

3

2

3

1

0

01 10 11

empty prefix

00

N=4

● Block location: DHT with prefix routing● Block mapped to hash prefix● Prefix components

● Hosted on SNs● N components

per prefix● Store fragments

● Distributedconsensus

Page 33: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 33

Scalability with DHT: data placement

hash=011..0

Block

Node 1Node 6

Node 1Node 5

Node 1Node 4

Node 1

Node 1

Node 3

Node 2

Node 1Node 1

0 1

1

3

2

0 0

1

2

3

0

1

2

3

2

3

1

0

01 10 11

empty prefix

00

N=4

● Block location: DHT with prefix routing● Block mapped to hash prefix● Prefix components

● Hosted on SNs● N components

per prefix● Store fragments

● Distributedconsensus

Page 34: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 34

Scalability with DHT: data placement

hash=011..0

Block

Node 1Node 6

Node 1Node 5

Node 1Node 4

Node 1Node 3

Node 1Node 2

Node 1Node 1

0 1

1

3

2

0 0

1

2

3

0

1

2

3

2

3

1

0

01 10 11

empty prefix

00

N=4

● Block location: DHT with prefix routing● Block mapped to hash prefix● Prefix components

● Hosted on SNs● N components

per prefix● Store fragments

● Distributedconsensus

● Load balancing

Page 35: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 35

Data organization: synchrun chains

A B EC D F G● Data stream split to blocks

Page 36: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 36

Data organization: synchrun chains

A B EC D F G

Hash 010…

Hash 101…

Hash 110…

Hash 011…

Hash 000…

Hash 011…

Hash 100…

● Data stream split to blocks

● Hashes of blocks computed

Page 37: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 37

Data organization: synchrun chains

A B EC D F G

Hash 010…

Hash 101…

Hash 110…

Hash 011…

Hash 000…

Hash 011…

Hash 100…

● Data stream split to blocks

● Hashes of blocks computed

● Routing through DHT

Page 38: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 38

Data organization: synchrun chains

A B EC D F G

Hash 010…

Hash 101…

Hash 110…

Hash 011…

Hash 000…

Hash 011…

Hash 100…

● Data stream split to blocks

● Hashes of blocks computed

● Routing through DHTPrefix 01

Page 39: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 39

Data organization: synchrun chains

A B EC D F G

Hash 010…

Hash 101…

Hash 110…

Hash 011…

Hash 000…

Hash 011…

Hash 100…

Erasure Coding

Compression

● Data stream split to blocks

● Hashes of blocks computed

● Routing through DHTPrefix 01

Page 40: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 40

Data organization: synchrun chains

A B EC D F G

Hash 010…

Hash 101…

Hash 110…

Hash 011…

Hash 000…

Hash 011…

Hash 100…

Prefix 01

Erasure Coding

Compression

● Data stream split to blocks

● Hashes of blocks computed

● Routing through DHT

Component0

Component1

Component2

Component3

● Erasure-coded fragments stored by components

Page 41: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 41

Data organization: synchrun chains

A B EC D F G

Hash 010…

Hash 101…

Hash 110…

Hash 011…

Hash 000…

Hash 011…

Hash 100…

Erasure Coding

Compression

● Data stream split to blocks

● Hashes of blocks computed

● Routing through DHT

A D F

A D F

A D F

A D F

Component0

Component1

Component2

Component3

Prefix 01

● Erasure-coded fragments stored by components

Page 42: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 42

Data organization: synchrun chains

A B EC D F G

Hash 010…

Hash 101…

Hash 110…

Hash 011…

Hash 000…

Hash 011…

Hash 100…

Synchrun 1 Synchrun 2 Synchrun 3

Prefix 01

Erasure Coding

Compression

Synchrun

● Data stream split to blocks

● Hashes of blocks computed

● Routing through DHT

Component0

Component1

Component2

Component3

● Erasure-coded fragments stored by components

● Grouped into synchruns

A D F

A D F

A D F

A D F

Page 43: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 43

Data organization: synchrun chains

A B EC D F G

Hash 010…

Hash 101…

Hash 110…

Hash 011…

Hash 000…

Hash 011…

Hash 100…

Synchrun 1 Synchrun 2 Synchrun 3

Prefix 01

Erasure Coding

Compression

● Data stream split to blocks

● Hashes of blocks computed

● Routing through DHT

Component0

Component1

Component2

Component3

Container

● Erasure-coded fragments stored by components

● Grouped into synchruns

● Containers stored on disks

● Fragment metadata separately from data

Synchrun

A D F

A D F

A D F

A D F

Page 44: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 44

Data organization: synchrun chains

A B EC D F G

Hash 010…

Hash 101…

Hash 110…

Hash 011…

Hash 000…

Hash 011…

Hash 100…

Synchrun 1 Synchrun 2 Synchrun 3

Erasure Coding

Compression

● Data stream split to blocks

● Hashes of blocks computed

● Routing through DHT

A D F

A D F

A D F

A D F

Component0

Component1

Component2

Component3

Prefix 01

● Erasure-coded fragments stored by components

● Grouped into synchruns

● Containers stored on disks

● Fragment metadata separately from data

● Ordered synchrun chains

● Preserve order & locality

● ManageableContainer Synchrun

Page 45: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 45

Component01:0

Component01:1

Component01:2

Component01:3

Data Services:Identification of data resiliency level

Missing fragments

Page 46: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 46

Data Services:Identification of data resiliency level

Component01:0

Component01:1

Component01:2

Component01:3

Chain scanning

Page 47: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 47

Data Services:Identification of data resiliency level

Component01:0

Component01:1

Component01:2

Component01:3

Chain scanning

Page 48: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 48

Data Services:Identification of data resiliency level

Component01:0

Component01:1

Component01:2

Component01:3

Chain scanning

Page 49: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 49

Data Services:Identification of data resiliency level

Component01:0

Component01:1

Component01:2

Component01:3

Chain scanning

Page 50: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 50

Data services: reconstruction

Component01:0

Component01:1

Component01:2

Component01:3

● Sequential read/write of entire Containers● Erasure decoding and re-encoding

Page 51: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 51

Data services: reconstruction

Component01:0

Component01:1

Component01:2

Component01:3

● Sequential read/write of entire Containers● Erasure decoding and re-encoding

Page 52: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 52

Data services: reconstruction

Component01:0

Component01:1

Component01:2

Component01:3

● Sequential read/write of entire Containers● Erasure decoding and re-encoding

Page 53: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 53

Data services: fast data transfer

Component01:0

Component01:1

Component01:2

Component01:3

Old component

01:3

Location of newnode (DHT)

Page 54: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 54

Data services: fast data transfer

Component01:0

Component01:1

Component01:2

Component01:3

Old component

01:3

Data transfer

Page 55: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 55

Data services: fast data transfer

Component01:0

Component01:1

Component01:2

Component01:3

Old component

01:3

Data transfer

Page 56: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 56

Data services: fast data transfer

Component01:0

Component01:1

Component01:2

Component01:3

Old component

01:3

Data transfer

Page 57: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 57

Data services: fast data transfer

Component01:0

Component01:1

Component01:2

Component01:3

Old component

01:3

Page 58: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 58

Data services for deduplication

Component

01:0

Component

01:1

Component

01:2

Component

01:3

hash=011..

Block

Choose completechain

Completeness: “definitely not a duplicate”Deletion interaction: wasn't the block scheduled for deletion?

Page 59: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 59

Data services for deduplicationhash=011..

Block

Component

01:0

Component

01:1

Component

01:2

Component

01:3

Query

Page 60: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 60

Data services for deduplicationhash=011..

Block

Local candidate found

Component

01:0

Component

01:1

Component

01:2

Component

01:3

Page 61: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 61

Data services for deduplicationhash=011..

Block

Candidate verification

Successfuldedup

Component

01:0

Component

01:1

Component

01:2

Component

01:3

Page 62: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 62

On-demand data deletion

● Distributed garbage collection● Per-block reference counter stored per-

fragment● Failure-tolerant

● Block reference counter calculated independently on peer Container chains

● Interference with duplicate elimination:● duplicates resurrection after garbage collection● space reclamation in background

Page 63: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 63

Resource management

● Configurable load balancing between:● backup/restore● background tasks (reconstruction, transfer, etc.)● garbage collection

● Shares depend on system state● Assigns priority of tasks automatically

● e.g. reconstruction before transfer or space reclamation

● Maximizes resources utilization

Page 64: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 64

Topics for further discussion

● Features and technical details of HYDRAstor● Sales of HYDRAstor in Poland● Cooperation with 9LivesData on other projects

Page 65: HYDRAstor: a Scalable Secondary Storage · 2010-09-08 · HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 3 Backup storage Tapes are most common, despite: Sensitive environment

HYDRAstor: a Scalable Secondary Storage. 9LivesData, LLC 65

Questions?

Contact:

[email protected]