cloud 101 - federation of earth science information …commons.esipfed.org/sites/default/files/intro...

43
Cloud 101 Mike Gangl, Caltech/JPL, [email protected] © 2015 California Institute of Technology. Government sponsorship acknowledged

Upload: others

Post on 10-Jun-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud 101

Mike Gangl, Caltech/JPL, [email protected]

© 2015 California Institute of Technology. Government sponsorship acknowledged

Page 2: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Outline

• What is cloud computing?

• Cloud service models

• Deployment models

• Storage

• Use cases

• Economics

• Caveats and issues

6/27/15 Cloud 101 2

Page 3: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

What is cloud computing?

• Cloud computing is a model for enabling

ubiquitous, convenient, on-demand

network access to a shared pool of

configurable computing resources (e.g.,

networks, servers, storage, applications

and services) that can be rapidly

provisioned and released with minimal

management effort or service provider

interaction. –NIST

6/27/15 Cloud 101 3

Page 4: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

What is cloud computing?

• Key Features: – On-demand self service

– Broad network access

– Resource pooling

– Rapid elasticity

– Measured service

• It is not: – Just like mainframes

– Just server virtualization

– Just a really, really big cluster

– A panacea

6/27/15 Cloud 101 4

Page 5: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

What is cloud computing?

6/27/15 Cloud 101 5

Page 6: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Service Models

6/27/15 Cloud 101 6

Page 7: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud Services Models

• Clouds offer three models of service:

– Software-as-a-Service (SaaS)

– Platform-as-a-Service (PaaS)

– Infrastructure-as-a-Service (IaaS)

6/27/15 Cloud 101 7

©Rackspace

Page 8: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud Service Models

• Software-as-a-Service

– Application deployed over the internet

– Most visible to end users

– Great for large user bases or burst usage

• Gmail, flickr, Dropbox

• Burst usage examples:

– TurboTax: Busiest from January to April

– CBS: NCAA March Madness Streaming

– JPL: MSL Landing

6/27/15 Cloud 101 8

Page 9: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud Service Models

• Platform-as-a-Service

– Suite of services to create web applications

quickly

– Infrastructure and software is provided

• Databases, user information/authentication,

messaging, load balancing, scalability

– Targets software development

– Examples

• Google App Engine

• AWS Elastic Beanstalk

6/27/15 Cloud 101 9

Page 10: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud Service Models

• Infrastructure-as-a-Service

– Users allocate resources, often virtualized, such as servers, network, IP address and disk space

– Variable cost based on size/capacity of resources

– Allocation can be on-demand, reserved, or ‘spot’ instance pricing.

– Replaces the need of buying hardware. Easily “upgradeable”.

– Examples • Amazon Web Services

• Rackspace

6/27/15 Cloud 101 10

Page 11: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Deployment Models

6/27/15 Cloud 101 11

Page 12: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Deployment models

• Deployment depends on:

– Sensitivity of data/software

– Use case being addressed

– Costs

• Cloud deployments come in 4 models

– Private

– Public

– Hybrid

– Community

6/27/15 Cloud 101 12

Page 13: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud Deployment Models

• Private Cloud

6/27/15 Cloud 101 13

Page 14: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud Deployment Models

• Private Cloud – Cloud infrastructure usually managed locally (intranet vs internet)

– Resources are not shared outside of the cloud owner

– Useful when security is the key issue:

• Access within local firewalls or through secure VPN

• SOX, HIPPA, trade secrets

– Or based on other needs

• Public cloud SLAs aren’t guaranteeing enough

– Beware:

• Cloud technology companies are here today, gone tomorrow

• Upgrading cloud stacks (Openstack, Eucalyptus) is not trivial

• Users don’t like downtime

6/27/15 Cloud 101 14

Page 15: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud Deployment Models

• Public Cloud

6/27/15 Cloud 101 15

Page 16: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud Deployment Models

• Public Cloud – Cloud infrastructure provided by third party;

Quintessential cloud (internet vs intranet)

– Provider hosts multiple clients

– Useful for many applications: • Provide SAAS applications

• Handle load spikes

• Test out server/hardware architectures without large capital investments

– Examples: • Amazon

• Rackspace

• Google

6/27/15 Cloud 101 16

Page 17: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud Deployment Models

• Hybrid Cloud

6/27/15 Cloud 101 17

Page 18: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud Deployment Models

• Hybrid Cloud

– “Best of both worlds” model

– On premise cloud for normal load and functionality to reduce nominal cost

– “cloud bursting” to public cloud to handle large demands spikes

– Needs manual or automatic orchestration

– Examples • Microsoft Azure

• (Sales)Force.com PAAS

6/27/15 Cloud 101 18

Page 19: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud Deployment Models

• Community Cloud

– “private cloud”++

– Jointly owned private cloud by groups with

similar compliance and policy considerations

– Examples

• Local first response agencies might share a cloud

that handles route, hospital and population

information instead of creating their own

• ESDIS cloud for all DAACs to use

6/27/15 Cloud 101 19

Page 20: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Storage

6/27/15 Cloud 101 20

Page 21: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Storage

• Storage in the cloud has different

properties

– Instance Storage

– Volume Storage

– Object Storage

• Public clouds generally make you pay for

storage and download costs, not input cost

• Storage has differing speeds of access

6/27/15 Cloud 101 21

Page 22: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Storage

• Instance Storage

– Disk space that comes with procured instances

– Exists only for the life of the instance!!!

– Free with instance pricing

• Volume (Block) Storage

– Most like a traditional hard drive

– Amazon EBS, Rackspace CBS, Open stack ‘Block Storage’, Eucalyptus volumes)

– Can be attached to different instances, but only one at a time

– Data persists on the device even when not in use

6/27/15 Cloud 101 22

Page 23: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Storage

• Object Storage

– Network accessible persistent storage

(usually http/ReST)

– Amazon S3, Rackspace Cloud Files,

Eucalyptus Buckets, OpenStack Object store

– Scalability and redundancy built in

– Private and/or public access

– Economy of scale cost savings

6/27/15 Cloud 101 23

Page 24: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Storage

• Object Storage Continued

– Some providers have multiple tiers (AWS and

Google most notably)

– Normal Object Storage ( $.03/GB example

only)

– Reduced Redundancy ( $.024/GB)

• Make sure you know what is meant by redundancy

– Archive Storage ($.01/GB)

• Long access times (up to 4 hours!)

6/27/15 Cloud 101 24

Page 25: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Storage

6/27/15 Cloud 101 25

Instance

Storage

Volume/Block

Storage

Object Storage

Speed High Medium/High Medium/Low

Redundancy

No No Yes

Chance of data

loss

High* low low

Cost Free** High Low

Scalable No No Yes

* All data in instance store is lost when the instance crashes, dies, or is terminated

** usually this comes free with each instance, but amount of space varies

Page 26: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Use Cases

6/27/15 Cloud 101 26

Page 27: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Use cases

• What are common use cases for the

cloud?

– Mitigating load spikes

– Scalability

– Batch processing

– Data Locality

6/27/15 Cloud 101 27

Page 28: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Use cases

• Load Spikes & Scalability

– Websites and Content distribution • Load balancers in front of expandable bank of ‘servers’

• Many “cloud” technologies (e.g. NoSQL databases) come easy-to-scale as data volume increases

• TurboTax

– Storage Capacity • “infinite” storage, pay

as usage increases

• Allow “unlimited” download bandwidth

6/27/15 Cloud 101 28

Page 29: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Use cases

• Batch Processing

– Provision hardware for one-off processing

– Save results back on institutional hardware

– Examples

• Backfilling Metadata or running analysis on millions

of files might take months. Trade $ for time and

have it done in days.

• CPU Intensive tasks; e.g. data mining

• Large Map/Reduce frameworks jobs

• eHarmony & your perfect match

6/27/15 Cloud 101 29

Page 30: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Use cases

• Data Locality

– Data volume is expanding, users can no

longer have the entire dataset on their own

machine or even a single server

– Bring the processing to the data!

• Store all data on cloud storage

• Have users pay for instances (local to the data) to

do their work

• Provide tools in the environment to assist users

(search, visualize, datamine)

6/27/15 Cloud 101 30

Page 31: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Use Cases

• Dealing with heavy loads (Reprocessing)

6/27/15 Cloud 101 31

Page 32: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Use Cases

6/27/15 Cloud 101 32

• Data-as-a-Service

Page 33: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Cloud Economics

6/27/15 Cloud 101 33

Page 34: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Economics

• Where “We can do that?” meets “It’s going to cost how much?”

• Take advantage of economies of scale

• Cloud economics inherently tricky:

– Can’t look only at capital costs • Admins

• Facility charges (power, cooling)?

• Redundancy

• Failure scenarios

• We will focus mainly on public clouds

6/27/15 Cloud 101 34

Page 35: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Economics

• Instances and servers – On demand pricing (Pay as you go)

• Pay more for better hardware

• $1.20/hour/instance

– Reserve pricing • Flat reservation fee + lower hourly

• Pay for a collection of resources up front, use and deploy however you’d like

– Spot pricing • Bid on pricing instances

• If “rate” falls below your bid, your instances start. They stop when the rates exceed bid

• Good for non-timely processing

• Storage/Object Stores – Pay per GB sometimes in, always out

– Bulk storage “discounts”

– Different Storage Tiers

6/27/15 Cloud 101 35

Page 36: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Economics

• Other costing

– Some providers aren’t as flexible as Amazon

or Rackspace

• Monthly pricing for a set allocation of resources

• Use it or lose it

– Bare metal clouds

• More expensive, but more predictable performance

– Who needs SAs?

• I do.

6/27/15 Cloud 101 36

Page 37: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Economics

• S3 costs

6/27/15 Cloud 101 37

Page 38: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Issues and Caveats

6/27/15 Cloud 101 38

Page 39: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Caveats

• All those cool services being offered…

– Quickly allow you to build scalable apps

– Lock you in to their custom tools and APIs

• Unchecked, costs can become a huge

issue

– Your datasets become extremely popular and

users are downloading terabytes each day…

you’re paying for that!

– Not using the XL1.xlarge instance to it’s

fullest? You’re paying for those unused cycles! 6/27/15 Cloud 101 39

Page 40: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Caveats

• Developer paradigm shifts

– Must refactor programs and applications to scale

– Plan for failures

– NIH*. Does. Not. Work.

• The Promised Land

– Cloud orchestration and dynamic scalability aren’t always easy

– One touch solutions will vary from provider to provider

6/27/15 Cloud 101 40

*Not Invented Here

Page 41: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Questions?

6/27/15 Cloud 101 41

Page 42: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

jp l .nasa.gov

Page 43: Cloud 101 - Federation of Earth Science Information …commons.esipfed.org/sites/default/files/Intro to Cloud...–Great for large user bases or burst usage •Gmail, flickr, Dropbox

Backup Slides

• Useful links – Understanding the Cloud Computing stak (IaaS, PaaS, SaaS) by

Rackspace

– http://aws.amazon.com/architecture/

– http://www.ibm.com/cloud-computing/in/en/what-is-cloud-computing.html

– http://csrc.nist.gov/publications/nistpubs/800-145/SP800-145.pdf [NIST

Definition]

– http://aws.amazon.com/s3/pricing/

– http://aws.amazon.com/ec2/pricing/

– http://www.rackspace.com/en-us/cloud/public-pricing

6/27/15 Cloud 101 43

Reference herein to any specific commercial product, process, or service by trade name, trademark,

manufacturer, or otherwise, does not constitute or imply its endorsement by the United States

Government or the Jet Propulsion Laboratory, California Institute of Technology.