amazon reshift as your data warehouse solution
DESCRIPTION
An Introduction to the Speakers & What BlazeClan as an AWS Advanced Consulting Partners does and how it has Evolved. Varoon, Our Solution Architect, Specializing on Amazon Redshift, Talks about the Key differentiators of Amazon Redshift. Learn why & how Exactly Redshift can optimize your Time and Efforts & reduce costs by 1/10th the cost of a traditional warehouse solution. A Demo of Amazon Redshift in action, processing 2billion records in a matter of seconds! A casestudy of one of our products, Cloudlytics, and how it extensively user Amazon Redshift. We had conducted a webinar on Amazon Redshift, you can also view the Video of the Webinar along with the Q & A at the end of the Slideshare.TRANSCRIPT
AWS Redshift Your Data Warehouse Solution
1 Cloud IT Better
Blazeclan
Agenda
Cloud IT Better2
Introduction to Amazon Redshift
Economics for Amazon Redshift
Redshift Demo
Cloudlytics.com Case Study
How BlazeClan can help your organization with Redshift?
Blazeclan 3 Cloud IT Better
Image courtesy: datacenterknowledge.com
Introduction to Amazon Redshift
Blazeclan
Image courtesy: datacenterdynamics.com
Amazon Redshift
• Fully managed, Petabyte scale data warehouse
• Provision in minutes
• Pay as you go, no upfront costs
• Extremely fast with low prices
• Supports SQL
• Allows JDBC & ODBC Connections
Cloud IT Better4
Blazeclan
Amazon Redshift – Key Differentiators
Cloud IT Better5
Built-in Security
Redshift Drastically Reduces I/O
Massively Parallel Processing (MPP) Architecture
Columnar StorageData Compression
Redshift parallelizes everything
EncryptionAmazon VPCAutomated backups
We’re off to a good start !
Some Happy feedbacks !
6
Blazeclan
Amazon Redshift Reduces I/O drastically
Cloud IT Better7
Column Storage
Direct-attached
Storage
Large data block
sizes
Data
Compression
Zone Maps
7
Blazeclan
Amazon Redshift Reduces I/O drastically
• Column Storage
• Data Compression
• Zone Maps
• Direct-attached Storage
• Large data block sizes
Cloud IT Better8
Typical Row Storage
Columnar Storage in Redshift
Blazeclan
Amazon Redshift Reduces I/O drastically
• Column Storage
• Data Compression
• Zone Maps
• Direct-attached Storage
• Large data block sizes
Cloud IT Better9
• Data compression reduces storage
• Increases I/O, improves query performance
• Less memory utilization, allowing more memory for query processing
Blazeclan
Amazon Redshift Reduces I/O drastically
• Column Storage
• Data Compression
• Zone Maps
• Direct-attached Storage
• Large data block sizes
Cloud IT Better10
• Keep track of minimum & maximum value of each block
• Skip over blocks that don’t contain the data needed for a query
• Minimize unnecessary I/O
Blazeclan
Amazon Redshift Reduces I/O drastically
• Column Storage
• Data Compression
• Zone Maps
• Direct-attached Storage
• Large data block sizes
Cloud IT Better11
• Use direct-attached storage to maximize throughput
• Hardware optimized for high performance data processing
• Large block sizes to make the most of each read
• Amazon Redshift manages durability for you
Blazeclan
Amazon Redshift Architecture
Cloud IT Better12
• Leader Node
• Manages communication with client
nodes and compute nodes
• Creates execution plans
• Compiles code based on execution
plan
• Distributes loads based on the
execution plan to multiple compute
nodes
• Compute Node
• Executes compiled code received
from the leader node
• Each node has dedicated compute
and storage capacity and memory
• Clusters can be scaled based on
the processing requirements
Redshift is Secure
• Amazon Redshift has security built-in
• SSL to secure data in transit
• Encryption to secure
data at rest• AES-256
• All blocks on disk and Amazon
S3 are encrypted
• No direct access to compute
nodes
• Amazon VPC Support
13
Blazeclan
Continuous Backup and Recovery
Cloud IT Better14
• Replication within the cluster and backup to
Amazon S3 to maintain multiple copies of data
all the times
• Backups to Amazon S3 are continuous,
automatic and incremental
• S3 is designed for eleven nines of durability
• Continuous monitoring and automated recovery
from failures of drives and nodes
• Able to restore snapshots to any Availability Zone
within a region
Blazeclan
Redshift Distributes & Parallelizes everything
Cloud IT Better15
Query
Restore
Resize
Load
Backup
Blazeclan
Redshift Distributes & Parallelizes everything
• Query
• Load
• Backup
• Restore
• Resize
Cloud IT Better16
Blazeclan
Redshift Distributes & Parallelizes everything
• Query
• Load
• Backup
• Restore
• Resize
Cloud IT Better17
• Load in Parallel from Amazon S3 & Amazon DynamoDB
• Data automatically distributed & sorted
• Scales linearly with number of nodes
Blazeclan
Redshift Distributes & Parallelizes everything
• Query
• Load
• Backup
• Restore
• Resize
Cloud IT Better18
• Backups up data automatically to Amazon S3
• Backups are continuous and incremental
• Configurable system snapshot retention period
• Take user snap shots on demand
• Streaming restores enable you to resume querying faster
Blazeclan
Redshift Distributes & Parallelizes everything
• Query
• Load
• Backup
• Restore
• Resize
Cloud IT Better19
• Scale up without any downtime
• Provision a new cluster in the background
• Copy data in parallel from node to node
• Only charged for source cluster
• Automatic SQL endpoint switchover via DNS
• Decommission Source Cluster
Blazeclan
Economics of Amazon Redshift
20 Cloud IT Better
Image courtesy: dataversity.net
Blazeclan
Traditional Data Warehouses
• Expensive Hardware &
Software Licensing
• Upfront investments
• Large team of skilled, highly
paid DBAs to manage
• Tuning & Administration is expensive
Cloud IT Better21
Image courtesy: clker.com
Blazeclan
Traditional Data Warehouses
• Large Enterprises
• YoY data growth is more than 50%
• Data warehousing is not growing at the
same rate
• Most of the data generated is not put in to
data warehouses
• Losing competitive edge as not all data is
analyzed
• Small Enterprises
• Cannot afford the current solutions
• Limited access to the expensive talent
pool to implement
Cloud IT Better22
Blazeclan
Amazon Redshift Pricing
• No upfront charges
• Pay-as-you-go
• Priced to analyze all your data
• Less than $1 per hour for on demand prices
• On Demand Annual Cost per TB = $3723
• 3 Year Reserved Annual Cost per TB = $999
Cloud IT Better23
Blazeclan
Amazon Redshift Configurations
• HS1.XL:• 2 Cores
• 6 GiB Memory
• 3 disk drives with 2 TB local
compressed storage
• HS1.8XL:• 16 Cores
• 128 GiB Memory
• 24 disk drives with 16 TB local storage
• 2 GB/second scan rate
• You can start with a Single Node instance
Cloud IT Better24
Amazon Redshift works with your existing Analysis tools
http://www.slideshare.net/AmazonWebServices/building-fault-tolerant-applications-in-the-cloud-aws-summit-2012-nyc
Content referenced from:
25
Case Study
26
Blazeclan 27 Cloud IT Better
CLOUDLYTICS Case Study
Blazeclan
Cloudlytics.com
Cloud IT Better28
Pay as
you Go
Dynamic Graphs
to get a 360
degree
perspective
Detailed analysis of your
S3 & CloudFront access
patterns
Scalable &
Reliable service
built using
Amazon EMR &
RedShift
Cloudlytics -
Analyze your
Amazon S3 &
CloudFront
Logs.
Blazeclan
How BlazeClan can help you with Redshift?
29 Cloud IT Better
Blazeclan
End to End Data Warehouse Consulting
Cloud IT Better30
Requirement Analysis
Initial Data Migration
BI Integration
Design & Build ETL process
Data modeling
Capacity Planning & Redshift Setup
Training & Knowledge Transfer
Managed Services
Blazeclan 31 Cloud IT Better
Thank you
www.blazeclan.com
Follow Us On :
Our Blog : http://blog.blazeclan.com/
Contact us : [email protected]