cloud seeding - snia · 2020-03-22 · steps for vcs/gco failover gco heartbeat (usually icmp ping)...

36
Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved. Cloud Seeding Some Practical Methods for Moving Data between the Enterprise and the Cloud Chris Dunder and Gradimir Starovic Symantec, SAMG

Upload: others

Post on 12-Apr-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Cloud Seeding

Some Practical Methods for Moving Data between the Enterprise and the Cloud

Chris Dunder and Gradimir Starovic

Symantec, SAMG

Page 2: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Outline

Why we are looking at clouds

Amazon AWS storage services

Symantec products we used

Cloud storage as a tier

Replication and failover between on-premise and cloud

Things we learned while using cloud storage

Things to try next

2

Page 3: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Why cloud storage

There is obviously a wave of interest in clouds among users and product/service providers

Some of the potential benefits

on demand capacity, massive scale

pay as you go

reduced need for IT staff and premises

3

Page 4: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Why cloud storage

And some concerns, such as

security

availability

performance

integration with the legacy applications and data

4

Page 5: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Assumptions

Cloud-based computing and storage will co-exist with other, traditional ways of provisioning

Options for applications and data, to be

both on-premise or both in cloud

one on-premise and the other in cloud

in two different clouds

5

Page 6: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

This talk

About ongoing prototyping work

Exploring feasibility and finding the limits

applications on-premise and using cloud storage

failover between on-premise and cloud

using existing products (Veritas Storage Foundation and Cluster Server) to migrate or replicate data, monitor/failover applications

using AWS storage

6

Page 7: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Using Amazon AWS storage

For pragmatic reasons, seems to be more mature and widely used at the moment than other public clouds

Looking at both Elastic Block Storage (EBS) and Simple Storage Service (S3)

Would like to do similar experiments with other clouds

7

Page 8: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Symantec products used

Storage Foundation (SF), enterprise-class management for online data

Veritas Volume Manager (VxVM), host-based disk virtualization - Windows & UNIX/Linux

Veritas Volume Replicator (VVR), network data replication for SF - Windows & UNIX/Linux

Veritas File System (VxFS), advanced-feature file system -UNIX/Linux

Veritas Dynamic Storage Tiering (DST) – Unix/Linux

8

Page 9: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Symantec products used

Veritas Cluster Server (VCS)

Shared storage clusters supporting application failover within a data center and campus-wide

Global Cluster Option (GCO), extending failover across distances spanned by WAN

Windows & UNIX/Linux

9

Page 10: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Storage Foundation (SF)in the cloud

10

Page 11: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Storage Foundation in the cloud

Single machine in cloudIncludes VxVM and VxFS; Basic functionality okXen kernelEBS devices respond to only some SCSI commands

Single path to all EBS devicesNo Dynamic Multipathing (DMP)DMP is irrelevant to performance w/ EBS(160-400Mbps w/ EBS, and higher on the local scratch drive)

11

Page 12: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Cloud storage as a tier

12

Page 13: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Cloud storage as a tier

Storage with availability, performance and cost characteristics different from other available storage (other tiers), and the difference is such that it justifies movement of data between them

In cloudstorage classes (EBS, S3)

cost is space and usage based

performance varies between the storage classes, between regions and availability zones

13

Page 14: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Veritas SF dynamic storage tiering

A mechanism for placing data on the suitable tier

Location transparency

Policy controlled initial file placement (on creation), migration and removal

Based on: file name and size, directory, time of last access, access frequency, ownership

Multi-volume file systems, for performance, availability and migration

14

Page 15: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Example: dynamic storage tiering

15

Page 16: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

EBS as a tier - process

In cloudusing a public Linux AMI provided by Amazonexport iSCSI targetstorage provided by EBS volumes

On-premise servercreate a multi-volume filesystem, some volumes are iSCSI targets in EC2create DST policyattach the policy to the file system

16

Page 17: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Requires 24x7 instanceAccess through firewall

Cloud storage as a tier & issues

17

On-premise storage (TIER1)

Amazon Cloud

VPN server

On-premise server

VPN

Placement policy

AMI instance, exporting iSCSI target(TIER2)

iSCSI initiator

Page 18: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Cloud storage as a tier – status

Currently tiers inside and outside of cloud, utilizing VPN to avoid firewall issues

Placement rules: time of last access, access frequency, name pattern

Investigated use of S3 w/ FUSE and EBS w/ iSCSI

18

Page 19: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

S3 as a tier – details

Provides file-level access onlyPUT: 32-64Mbps; GET: 20-96Mbps(cloudstatus.org averages for week of Sep-6-09)

Snapshots: Not providedReplication: Automatic (3 copies in 3 locations)

19

Page 20: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

S3 as a tier – issues

FUSE-based products provide only a file system viewElastic Drive provides a block level device, but it still doesn't respond to any SCSI commandsVxVM won't work without at least some working SCSI commandsResult: Poor performance when treated as a pseudo-block-level device

S3 is optimized for file-level access

20

Page 21: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

EBS as a tier – details

Provides block-level access to devicesEBS devices: 160-400MbpsAWS internal network: 250-1000Mbps(S3 to EC2 large file transfers)

Snapshots: low performance if compared against VxVM or VxFS snapshots, but expected since it is using S3Replication: Automatic, but transparent

21

Page 22: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

EBS as a tier – details

CostlyRequires EC2 instance (24x7)$75-$600/mo not including data transfer costs

EBS volumes act as pseudo-SCSI devices which limits some functionality (e.g., no DMP)

Limit on the size and number of EBS volumes (can be increased by request)

22

Page 23: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Replication and failover using cloud

23

Page 24: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

2-32 application servers Shared storage devices

Background: Veritas Cluster Server

24

Applicationservers

Shared storage

● ● ●

SAN(e.g., Fibre Channel,

iSCSI)

Private network

Applicationfailover & failback

to clientsto clients

Page 25: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

WAN

Two geographically separated VCS clusters Unreliable “heartbeat” network Long-distance replication of critical data

Can be asynchronous

VCS global cluster option (GCO)

25

Applicationservers

Shared storage

SAN

Private network

Local applicationfailover

to clients

to clients

Applicationservers

Shared storage

SAN

Private network

Local applicationfailover

to clients

to clients

Assisted applicationfailover & failback

Replication ofcritical data

Page 26: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Single-node “clusters” at each location

Failover now only over WAN

VCS GCO variation

26

Storage

SAN

to clients

WAN

Assisted applicationfailover & failback

Replication ofcritical data

Storage

SAN

to clients

Page 27: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

VVR replication - prototype

27

Control & monitor

Amazon cloud

VPN VPN server

On-premise server, SF VVR

AMI instancewith VVR

Page 28: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Cloud as a DR site

Veritas Volume Replicator (VVR) for replication to and from the cloud

VCS/GCO for failing over an application between on-premise machine and AMI instance in the cloud

Single-AMI cluster is failover target in the cloud

Can serve as the DR solution for multi-node on-premise cluster

28

Page 29: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Main data center on user premises Based on a public Linux AMI provided by Amazon Storage provided by EBS volumes

Cloud as a DR site

29

Amazon cloud

Amazon MachineImage (AMI) instance

Elastic Block Storage (EBS)volume

Storage

SAN

to clients

WAN

Assisted applicationfailover & failback

Replication ofcritical data

Page 30: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Steps for VCS/GCO failover

GCO heartbeat (usually ICMP ping) times out

Cluster in the cloud declares on-premise cluster down

Replication target volume in the cloud becomes primary data

VCS mounts file system on former replication target volume

VCS restarts application on AMI instance in the cloud

VCS updates DNS

Manual steps: clear DNS cache or application-specific state

30

Page 31: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Cloud as a DR site – current status

Replicationfrom on-premise machine to AMI instance in the cloud from AMI instance in the cloud to on-premise machine

GCO failover with VVR-replicated live datafrom on-premise 1-node cluster to 1-node cluster in the cloud “Repair” on-premise machine and fail back from cloud AMI instance to on-premise machine

On Linux and Windows Server platforms

31

Page 32: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Corporate firewall: requires open port or use of VPN Requires 24x7 AMI instance for heart-beating EC2 instances support one NIC Clearing stale DNS cache entries or other application-specific non-persistent state

Amazon cloud

VPN serverVPN

Control & monitor

AMI instance withSF VCS/GCO and VVR

VCS/GCO failover prototype - issues

32

On-premise server, SF VCS/GCO and VVR

Page 33: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Cloud as a DR site – performance

Difficult to be exact

Limit is local connection in our case, better results inside the cloud

Sync VVR: ~150 MB/min

Amazon: 250+ Mbps, but no guarantees

33

Page 34: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

In future would like to do

More tests using tiering, replication and failover

between on-premise and cloud

in the cloud

EBS volumes shared by multiple instances

I/O fencing for shared volumes

Using other clouds

34

Page 35: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Prototyping team

Matthew CheungManav DeshmukhChris LinChitrak OjhaPaul MassigliaNiranjan PendharkarAjay Salpekar

35

Page 36: Cloud Seeding - SNIA · 2020-03-22 · Steps for VCS/GCO failover GCO heartbeat (usually ICMP ping) times out Cluster in the cloud declares on-premise cluster down Replication target

Storage Developer Conference 2009 © 2009 Insert Copyright information here. All rights reserved.

Thank You!

Questions?

36