sas forum kl - monetise your data with hadoop · pdf...
TRANSCRIPT
1 © Cloudera, Inc. All rights reserved.
Mone9se your Data with Cloudera Hadoop Calvin Hoon Director of Strategic Alliances & Channels Sales Asia Pacific/Japan
2 © Cloudera, Inc. All rights reserved.
Our mission:
Cloudera helps organiza9ons profit from all their data
3 © Cloudera, Inc. All rights reserved.
Cloudera company snapshot
Founded 2008, by former employees of Funding $670M cumula9ve investment Employees Today 900+ worldwide World Class Support 24x7 global staff
Pro-‐ac9ve & predic9ve support programs using our EDH Mission Cri9cal Produc9on deployments in run-‐the-‐business applica9ons
worldwide – Financial Services, Retail, Telecom, Media, Health Care, Energy, Government
The Largest Ecosystem More than 1,500 Partners Cloudera University Over 40,000 trained Open Source Leaders Cloudera employees are leading developers & contributors to
the complete Apache Hadoop ecosystem of projects
4 © Cloudera, Inc. All rights reserved.
Customer growth and reten9on Cloudera leads on all fronts
Categories of Hadoop adop/on
Big Data Maturity
Training
Services & Support
Subscrip/on
Free/Developer
Business Need
Training 60% of Fortune 100 acended Cloudera training, over 30,000 trained since 2009
Service & Support 9/10 for support sa9sfac9on, ability to solve technical issues #1 recommenda/on
Subscrip9on Over 2x revenue of nearest compe9tor, 90% renewal rate
Free/Developer Over 2.5 million downloads
5 © Cloudera, Inc. All rights reserved.
Customer success across industries Financial Services
Telecom
Healthcare & Life Sciences
Media & Technology
Retail & CP
Public Sector
6 © Cloudera, Inc. All rights reserved.
The future of Data Management
7 © Cloudera, Inc. All rights reserved.
Data Sources
Data Systems
Data Access
Business Analy9cs
Custom Applica9ons
Exis9ng Data
Databases
Opera9onal Applica9ons
New Data
Limited Data Not efficient to keep exis9ng data, let alone handle new data sources. Time consuming to transform data for analysis in exis9ng systems.
Limited Insights Power users struggle with data. Many users have no data.
Compliance and Privacy More data, more users, and more tools create complexity. Need to balance business agility with security and governance.
Tradi9onal Architectures Under Pressure
8 © Cloudera, Inc. All rights reserved.
Data Sources
Data Systems
Data Access
Business Analy9cs
Custom Applica9ons
Exis9ng Data
Databases
Opera9onal Applica9ons
New Data
EDH, more value, more data, more users, in less 9me
Enterprise Data Hub
Security and Administra9on
Unlimited Storage
Process Discover Model Serve Manage Compliance From risk due to regula0ons and customer privacy concerns,
to trust in a secure and compliant pla8orm
Keep Unlimited Data From disparate and limited views,
to unlimited informa0on access
Unlock Value from Data From analy0cs for some, to
insights for all
9 © Cloudera, Inc. All rights reserved.
Cloudera Enterprise powered by Apache Hadoop
A new kind of data plajorm. • One place for unlimited data • Unified, mul9-‐framework data access Only with Cloudera: • Enterprise Security • Data Governance • Complete Management • Open source, open standards
Security and Administra9on
Unlimited Storage
Process Discover Model Serve
Deployment Flexibility
On-‐Premises Appliances Engineered Systems
Public Cloud Private Cloud Hybrid Cloud
10 © Cloudera, Inc. All rights reserved.
One Plajorm, Many Workloads
Batch, Interac9ve, and Real-‐Time. Leading performance and usability in one plajorm.
• End-‐to-‐end analy9c workflows • Access more data • Work with data in new ways • Enable new users
Security and Administra9on
Process Ingest
Sqoop, Flume
Transform MapReduce,
Hive, Pig, Spark
Discover Analy9c Database
Impala
Search Solr
Model Machine Learning SAS, R, Spark,
Mahout
Serve NoSQL Database
HBase
Streaming Spark Streaming
Unlimited Storage HDFS, HBase
YARN, Cloudera Manager, Cloudera Navigator
11 © Cloudera, Inc. All rights reserved.
Cloudera Enterprise Data Hub
CDH (Cloudera Distribu9on for Apache Hadoop)
Kaqa
Sqoop
Flume
Sentry
Impala
Hive
MapReduce
YARN
Spark
Pig
Avro
Llama
Solr
Hue
Parquet
HDFS
HBase
Crunch
Oozie
HCatalog
…
Kite
Mahout
Zookeeper
Manager
Deployment
Configura9on
Repor9ng
Backup & DR
Management
Monitoring
Diagnos9cs
API & SNMP
Partners
Services Training
Enterprise
Director
Provision
Automate
Elas9c
API
Navigator
Security
Policy
Lineage
API
12 © Cloudera, Inc. All rights reserved.
Balance Security and Privacy with Business Agility
Cloudera is the leader in Hadoop security. Unique Capabili9es: • Comprehensive and Unified
• Secure at the core
• No Performance Impact • Jointly engineered with Intel
• Compliance-‐Ready • Only distribu9on to pass PCI audit
1. Perimeter Standards-‐based Authen9ca9on
Security and Administra9on
Unlimited Storage
Process Discover Model Serve
2. Access Unified Role-‐based Authoriza9on
4. Data Encryp9on & Key Management
3. Visibility Audi9ng & Governance
13 © Cloudera, Inc. All rights reserved.
The Cloudera approach Cloudera Enterprise
Enterprise Data Hub
Security and Administra9on
Unlimited Storage
Process Discover Model Serve
Manager
Navigator
Director
CDH
Cloudera Services
Inges9on and ETL Pilot
Descrip9ve Analy9cs Pilot
Cluster Cer9fica9on & Opera9ons
Pilot and or Proof of Concept
Cloudera Training
Administrator
Cer9fica9on
Developer
Analyst
Cloudera Partners
14 © Cloudera, Inc. All rights reserved.
Core Benefits of the Enterprise Data Hub
©2014 Cloudera, Inc. All rights reserved.
• Full-‐Fidelity Ac/ve Archive • Accelerate Time to Insight (Scale) • Unlock Agility and Explora/on • Consolidate Silos for 360o View • Enable Pervasive Analy/cs
15 © Cloudera, Inc. All rights reserved.
Case Studies and Success Stories
16 © Cloudera, Inc. All rights reserved.
Big Data for Opera9onal Efficiency Use Cases
Offload resource intensive ETL workloads from systems
Migrate old data and ELT workloads off of EDW
Store old data online so analyst can access historic data
ETL Offload EDW Op9miza9on Ac9ve Archive
17 © Cloudera, Inc. All rights reserved.
Store and process months of transac9ons, wai9ng days to weeks for new lines of enquiry?
Mone9se consumer spending, detect fraud from a PCI compliant repository spanning decades
18 © Cloudera, Inc. All rights reserved.
Joint Customer Spotlight: MasterCard
Fraud costs credit card issuers ~$10B
per year and is detected at a 40% rate.
Most detec9on models are limited by
the amount of data that is available for
analysis at one 9me, which is
constrained by extremely high cost.
Move ETL and storage to Hadoop
EDH and Impala extends queries to
data sets spanning mul9ple years, not just the tradi9onal weeks and months.
SAS® Visual Analy9cs and SAS Visual Sta9s9cs. SAS/ACCESS
Solu9on
Significantly cuts costs and /me to data
More data is held in ac/ve archive, both in original and digested formats, so it is available for future analysis. Test new models using historic data on an ad hoc basis using full and live data sets.
Challenge Benefit
Test new models using historic data on an ad hoc basis using full, live data sets at zero marginal cost
19 © Cloudera, Inc. All rights reserved. 19
How do we proac/vely address issues for our High Value Customers?
Who are my most valuable set of customers and how do I target them?
Pro-‐ac've Dashboard -‐ High LTV Customers with data usage issues are iden9fied real-‐9me and proac9vely approached to address the issue!
19 © 2014 Cloudera, Inc. All rights reserved.
20 © Cloudera, Inc. All rights reserved.
Driving Customer & Network Insights @ Telkomsel
BUSINESS CHALLENGE
Manage Data Growth & Drive Insights into Data With over 100% data volumes growth annually, Telkomsel needed an effec9ve way to offload the data from EDW and drive new analy9cal insights on its customers and network usage
Implemented Compelling Use Cases for Marke;ng & Proac;ve Care • Implemented diverse use cases for personalized marke'ng and proac've care – including Proac9ve Dashboard, Churn Analy9cs, Customer Life9me Value and Social Analy9cs
• Offload ETL opera9ons from the EDW for more cost-‐effec9ve data processing
SOLUTION DEPLOYED
Derive Business Insights from Massive amounts of Data faster Telkomsel deployed Cloudera’s Enterprise Data Hub on premise to derive valuable customer and network insights from data streaming from mobile devices. One of the first use cases was storing CDR data for longer data reten9on, followed by a full pipeline of use cases focused on enhancing consumer experience.
KEY BENEFITS REALIZED
21 © Cloudera, Inc. All rights reserved.
Why Cloudera?
Enterprise Security Meet compliance requirements and reduce risk exposure from storing sensi9ve data.
Data Governance Enable compliance and maximize analyst produc9vity.
Complete Management Deliver op9mum system u9liza9on and meet SLA commitments, on-‐premises or in the cloud, with minimum effort.
We deliver long-‐term produc9on success with enterprise Hadoop.
þ Open Source Innova/on No one knows Hadoop be_er than Cloudera. Cloudera leads development of enterprise Hadoop and offers the best support, training, and services.
þ Powerful Enterprise Tools Cloudera extends open source Hadoop with capabili9es required by the largest enterprises.
þ Ecosystem Cloudera partners with industry leaders to ensure Hadoop works with the plajorms, tools, and integrators our customers rely on.
22 © Cloudera, Inc. All rights reserved.
Explore the Possibili9es of SAS and Cloudera Execu/ve sponsored partnership which spans R&D, Product Management, Sales, Marke/ng, Consul/ng & Educa/on Services. SAS product integra/on with Cloudera is the most extensive of all the commercial Hadoop distribu/ons • SAS internal development teams have a Cloudera first policy and all internal work is performed on Cloudera clusters.
• Dedicated Cloudera resources at Cloudera HQ and SAS HQ working with SAS R&D • SAS has dedicated R&D resources to op9mize SAS solu9ons for the Cloudera plajorm
• Porjolio includes integra9on with Access to Hadoop, Access to Cloudera, Visual Analy9cs, In-‐Memory Sta9s9cs, High Performance Analy9cs, Scoring Accelerator for Cloudera Hadoop & Visual Sta9s9cs among others…
Nobody knows Hadoop like Cloudera. Nobody Knows Analy9cs like SAS. Together we deliver the BEST Big Data Analy9cs solu9ons!
Visit the Cloudera booth for more informa/on!
24 © Cloudera, Inc. All rights reserved.
25 © Cloudera, Inc. All rights reserved.
Thank you! [email protected]