sas modernization architectures - big data analytics

26
Copyright © 2013, SAS Institute Inc. All rights reserved. IT STRATEGY FOR SCALABLE ANALYTICS, MODERN DATA ARCHITECTURES

Upload: deepak-ramanathan

Post on 25-Jun-2015

526 views

Category:

Data & Analytics


4 download

DESCRIPTION

Big Data Analytics

TRANSCRIPT

Page 1: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

IT STRATEGY FOR SCALABLE ANALYTICS, MODERN DATA ARCHITECTURES

Page 2: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

MODERN ARCHITECTURES

Page 3: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

STUNNING FACT

Making the Modern World: Materials and Dematerialization - Vaclav Smil

Page 4: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

Page 5: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

Scarcity

• Technology constrained

• Process-centric

• Focus on cost control

Everything is forbidden unless it is permitted

Abundance

• Focus on value

• Discovery-centric

• Technology empowered

Everything is permitted unless it is forbidden

Shift in Mindset

Page 6: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

Trends Big Data, Storage, Hadoop & In-memory Technology

Vertica

Teradata

Greenplum

Oracle

Microsoft PDW

Hadoop

$- $20,000 $40,000 $60,000 $80,000 $100,000

Today 2009

Cost of Storage, Memory, Computing • In 2000 a GB of Disk $17 today < $0.07• In 2000 a GB of Ram $1800 today < $1• In 2009 a TB of RDBMS was $70K today < $ 20K

Cost per Terabyte

THE PERFECT STORM: STORAGE TECHNOLOGY COSTS AND CPU SPEED

Page 7: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

MODERN REALITY

• Commoditization• Architectures• ScaleInfrastructure

• New Complex Streams• Perishable Considerations• Cost Data

• New Category of Business Problems• Analytical Algorithms• OperationalizationAnalytics

Page 8: SAS Modernization architectures - Big Data Analytics

8Copyright © 2011, SAS Institute Inc. All rights reserved.

Finding treasures in unstructured datalike social media or survey tools

that could uncover insightsabout consumer sentiment

Mine transaction databases for data of spending patterns that indicate a stolen card..

Leveraging historical data to drive better insight into decision-makingfor the future

Analyze massiveamounts of data inorder to accurately

identify areas likely toproduce the mostprofitable results

FORECASTING

DATA MINING

TEXT ANALYTICS

OPTIMIZATION

STATISTICS

ADVANCED ANALYTICS

INFORMATIONMANAGEMENT

Copyright © 2011, SAS Institute Inc. All rights reserved.

Page 9: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

CURRENT TRENDS IN ANALYTICS

Complex Business Problems Are Driving Analytics Innovation

Speed Will Be Of Essence

Leverage Analytics To Unlock The Information Contained In Unstructured Data

Operationalizing Analytics

Page 10: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

CURRENT AND FUTURE ARCHITECTURES

Page 11: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

WHERE WE ARE TODAY?

SETTING THE SCENE

Operational Data Sources

EDW

Data Mart

Data Mart

Analytic Mart

Analytic Mart

BI and Analytics

Unstructured, Semi-structured and Streaming data (i.e. sensor data) handled often outside the Warehouse flow

Page 12: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

WHERE DOES HADOOP FIT?

HADOOP AS A “NEW DATA” STORE

Operational Data Sources

EDW

Data Mart

Data Mart

Analytic Mart

Analytic Mart

BI and Analytics

Page 13: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

WHERE DOES HADOOP FIT?

HADOOP AS AN ADDITIONAL INPUT TO THE EDW

Operational Data Sources

EDW

Data Mart

Data Mart

Analytic Mart

Analytic Mart

Analytic Mart

Data Mart

BI and Analytics

Page 14: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

WHERE DOES HADOOP FIT?

HADOOP DATA PLATFORM AS A “STAGING LAYER” AS PART OF A “DATA LAKE” – Downstream stores could be Hadoop, data appliances or an RDBMS

Data Mart

Operational Data Sources EDW

Data Mart

Analytic Mart

Analytic Mart

BI and Analytics

Page 15: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

15

SAS BIG DATA STRATEGY – SAS AREAS

Page 16: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

Impala

SAS & HADOOP SAS® WITHIN THE HADOOP ECOSYSTEM

Next-GenSAS® User

User Interface

Metadata

Data Access

DataProcessing

FileSystem

SAS® User

MPI Based

SAS® LASR™ AnalyticServer

SAS® High-Performance

Analytic Procedures

HDFS

Base SAS & SAS/ACCESS® to Hadoop™

SAS Metadata

Pig

Map Reduce

In-MemoryData Access

SAS® Visual Analytics

SAS®

Enterprise Miner™

SAS® Data Integration

SAS®

EnterpriseGuide®

Hive

SAS Embedded Process

Accelerators

SAS® In-Memory Statistics for

Haodop

Page 17: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

IDENTIFY /FORMULATE

PROBLEM

DATAPREPARATION

DATAEXPLORATION

TRANSFORM& SELECT

BUILDMODEL

VALIDATEMODEL

DEPLOYMODEL

EVALUATE /MONITORRESULTS

IN SUMMARY SAS ENABLES THE ENTIRE LIFECYCLE AROUND HADOOP

SAS Visual AnalyticsSAS Visual StatisticsSAS In-Memory Statistics for Hadoop

Done using either the Data Preparation, Data Exploration or Build Model Tools

SAS High Performance Analytics Offerings supported by relevant clients like SAS Enterprise Miner, SAS/STAT etc.

Decision Manager

SAS Scoring Accelerator for HadoopSAS Code Accelerator for Hadoop

SAS Visual AnalyticsDecision Manager

Done using either the Data Preparation, Data Exploration or Build Model Tools

Page 18: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

SAS® VISUAL ANALYTICSA SINGLE SOLUTION FOR DATA DISCOVERY,

VISUALIZATION, ANALYTICS AND REPORTING

Page 19: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

SAS® VISUAL ANALYTICS

EXAMPLE: TEXT ANALYSIS GIVES YOU INSIGHT TO CUSTOMER EXPERIENCE AND OPINION

VISUALIZATION POWERED BY SAS ANALYTICS Analytics applied

to text provides real MEANING

Page 20: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

VISUALIZATION EXAMPLES

Page 21: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

SAS® VISUAL STATISTICS

Page 22: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

DATA TO DECISION LIFECYCLE

SAS® Visual StatisticsTEXT

COMPETITIVEADVANTAGE

MANAGE DATA

EX

PL

OR

ED

ATA

DEVELOP MODELS

DE

PL

OY

&

MO

NIT

OR

Page 23: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

APPLICATION AREAS

Segmentation

Classification

Prediction

Ad-hoc Discovery

Data Preparation

Page 24: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

SAS IN-MEMORY STATISTICS FOR HADOOP

Page 25: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.

SAS® IN-MEMORY STATISTICS FOR

HADOOP

WHY IT IS IMPORTANT?

SPEED

Multi-user interactive analytics environment for increased productivity

Proven state-of-the-art statistical algorithms and machine learning techniques

Highly scalable, in-memory environment grows easily as needed

Memory and data efficient for a significant reduction of data latency to rapidly analyze large and complex data in Hadoop

PRECISION

INTERACTIVE

SCALABLE

Page 26: SAS Modernization architectures - Big Data Analytics

Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.sas.com