amazon reshift as your data warehouse solution

31
AWS Redshift Your Data Warehouse Solution 1 Cloud IT Better

Upload: blazeclan-technologies-private-limited

Post on 11-May-2015

1.811 views

Category:

Technology


0 download

DESCRIPTION

An Introduction to the Speakers & What BlazeClan as an AWS Advanced Consulting Partners does and how it has Evolved. Varoon, Our Solution Architect, Specializing on Amazon Redshift, Talks about the Key differentiators of Amazon Redshift. Learn why & how Exactly Redshift can optimize your Time and Efforts & reduce costs by 1/10th the cost of a traditional warehouse solution. A Demo of Amazon Redshift in action, processing 2billion records in a matter of seconds! A casestudy of one of our products, Cloudlytics, and how it extensively user Amazon Redshift. We had conducted a webinar on Amazon Redshift, you can also view the Video of the Webinar along with the Q & A at the end of the Slideshare.

TRANSCRIPT

Page 1: Amazon Reshift as your Data Warehouse Solution

AWS Redshift Your Data Warehouse Solution

1 Cloud IT Better

Page 2: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Agenda

Cloud IT Better2

Introduction to Amazon Redshift

Economics for Amazon Redshift

Redshift Demo

Cloudlytics.com Case Study

How BlazeClan can help your organization with Redshift?

Page 3: Amazon Reshift as your Data Warehouse Solution

Blazeclan 3 Cloud IT Better

Image courtesy: datacenterknowledge.com

Introduction to Amazon Redshift

Page 4: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Image courtesy: datacenterdynamics.com

Amazon Redshift

• Fully managed, Petabyte scale data warehouse

• Provision in minutes

• Pay as you go, no upfront costs

• Extremely fast with low prices

• Supports SQL

• Allows JDBC & ODBC Connections

Cloud IT Better4

Page 5: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Amazon Redshift – Key Differentiators

Cloud IT Better5

Built-in Security

Redshift Drastically Reduces I/O

Massively Parallel Processing (MPP) Architecture

Columnar StorageData Compression

Redshift parallelizes everything

EncryptionAmazon VPCAutomated backups

Page 6: Amazon Reshift as your Data Warehouse Solution

We’re off to a good start !

Some Happy feedbacks !

6

Page 7: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Amazon Redshift Reduces I/O drastically

Cloud IT Better7

Column Storage

Direct-attached

Storage

Large data block

sizes

Data

Compression

Zone Maps

7

Page 8: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Amazon Redshift Reduces I/O drastically

• Column Storage

• Data Compression

• Zone Maps

• Direct-attached Storage

• Large data block sizes

Cloud IT Better8

Typical Row Storage

Columnar Storage in Redshift

Page 9: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Amazon Redshift Reduces I/O drastically

• Column Storage

• Data Compression

• Zone Maps

• Direct-attached Storage

• Large data block sizes

Cloud IT Better9

• Data compression reduces storage

• Increases I/O, improves query performance

• Less memory utilization, allowing more memory for query processing

Page 10: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Amazon Redshift Reduces I/O drastically

• Column Storage

• Data Compression

• Zone Maps

• Direct-attached Storage

• Large data block sizes

Cloud IT Better10

• Keep track of minimum & maximum value of each block

• Skip over blocks that don’t contain the data needed for a query

• Minimize unnecessary I/O

Page 11: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Amazon Redshift Reduces I/O drastically

• Column Storage

• Data Compression

• Zone Maps

• Direct-attached Storage

• Large data block sizes

Cloud IT Better11

• Use direct-attached storage to maximize throughput

• Hardware optimized for high performance data processing

• Large block sizes to make the most of each read

• Amazon Redshift manages durability for you

Page 12: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Amazon Redshift Architecture

Cloud IT Better12

• Leader Node

• Manages communication with client

nodes and compute nodes

• Creates execution plans

• Compiles code based on execution

plan

• Distributes loads based on the

execution plan to multiple compute

nodes

• Compute Node

• Executes compiled code received

from the leader node

• Each node has dedicated compute

and storage capacity and memory

• Clusters can be scaled based on

the processing requirements

Page 13: Amazon Reshift as your Data Warehouse Solution

Redshift is Secure

• Amazon Redshift has security built-in

• SSL to secure data in transit

• Encryption to secure

data at rest• AES-256

• All blocks on disk and Amazon

S3 are encrypted

• No direct access to compute

nodes

• Amazon VPC Support

13

Page 14: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Continuous Backup and Recovery

Cloud IT Better14

• Replication within the cluster and backup to

Amazon S3 to maintain multiple copies of data

all the times

• Backups to Amazon S3 are continuous,

automatic and incremental

• S3 is designed for eleven nines of durability

• Continuous monitoring and automated recovery

from failures of drives and nodes

• Able to restore snapshots to any Availability Zone

within a region

Page 15: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Redshift Distributes & Parallelizes everything

Cloud IT Better15

Query

Restore

Resize

Load

Backup

Page 16: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Redshift Distributes & Parallelizes everything

• Query

• Load

• Backup

• Restore

• Resize

Cloud IT Better16

Page 17: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Redshift Distributes & Parallelizes everything

• Query

• Load

• Backup

• Restore

• Resize

Cloud IT Better17

• Load in Parallel from Amazon S3 & Amazon DynamoDB

• Data automatically distributed & sorted

• Scales linearly with number of nodes

Page 18: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Redshift Distributes & Parallelizes everything

• Query

• Load

• Backup

• Restore

• Resize

Cloud IT Better18

• Backups up data automatically to Amazon S3

• Backups are continuous and incremental

• Configurable system snapshot retention period

• Take user snap shots on demand

• Streaming restores enable you to resume querying faster

Page 19: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Redshift Distributes & Parallelizes everything

• Query

• Load

• Backup

• Restore

• Resize

Cloud IT Better19

• Scale up without any downtime

• Provision a new cluster in the background

• Copy data in parallel from node to node

• Only charged for source cluster

• Automatic SQL endpoint switchover via DNS

• Decommission Source Cluster

Page 20: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Economics of Amazon Redshift

20 Cloud IT Better

Image courtesy: dataversity.net

Page 21: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Traditional Data Warehouses

• Expensive Hardware &

Software Licensing

• Upfront investments

• Large team of skilled, highly

paid DBAs to manage

• Tuning & Administration is expensive

Cloud IT Better21

Image courtesy: clker.com

Page 22: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Traditional Data Warehouses

• Large Enterprises

• YoY data growth is more than 50%

• Data warehousing is not growing at the

same rate

• Most of the data generated is not put in to

data warehouses

• Losing competitive edge as not all data is

analyzed

• Small Enterprises

• Cannot afford the current solutions

• Limited access to the expensive talent

pool to implement

Cloud IT Better22

Page 23: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Amazon Redshift Pricing

• No upfront charges

• Pay-as-you-go

• Priced to analyze all your data

• Less than $1 per hour for on demand prices

• On Demand Annual Cost per TB = $3723

• 3 Year Reserved Annual Cost per TB = $999

Cloud IT Better23

Page 24: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Amazon Redshift Configurations

• HS1.XL:• 2 Cores

• 6 GiB Memory

• 3 disk drives with 2 TB local

compressed storage

• HS1.8XL:• 16 Cores

• 128 GiB Memory

• 24 disk drives with 16 TB local storage

• 2 GB/second scan rate

• You can start with a Single Node instance

Cloud IT Better24

Page 25: Amazon Reshift as your Data Warehouse Solution

Amazon Redshift works with your existing Analysis tools

http://www.slideshare.net/AmazonWebServices/building-fault-tolerant-applications-in-the-cloud-aws-summit-2012-nyc

Content referenced from:

25

Page 27: Amazon Reshift as your Data Warehouse Solution

Blazeclan 27 Cloud IT Better

CLOUDLYTICS Case Study

Page 28: Amazon Reshift as your Data Warehouse Solution

Blazeclan

Cloudlytics.com

Cloud IT Better28

Pay as

you Go

Dynamic Graphs

to get a 360

degree

perspective

Detailed analysis of your

S3 & CloudFront access

patterns

Scalable &

Reliable service

built using

Amazon EMR &

RedShift

Cloudlytics -

Analyze your

Amazon S3 &

CloudFront

Logs.

Page 29: Amazon Reshift as your Data Warehouse Solution

Blazeclan

How BlazeClan can help you with Redshift?

29 Cloud IT Better

Page 30: Amazon Reshift as your Data Warehouse Solution

Blazeclan

End to End Data Warehouse Consulting

Cloud IT Better30

Requirement Analysis

Initial Data Migration

BI Integration

Design & Build ETL process

Data modeling

Capacity Planning & Redshift Setup

Training & Knowledge Transfer

Managed Services