sas forum kl - monetise your data with hadoop · pdf...

25
1 © Cloudera, Inc. All rights reserved. Mone9se your Data with Cloudera Hadoop Calvin Hoon Director of Strategic Alliances & Channels Sales Asia Pacific/Japan

Upload: lycong

Post on 28-Feb-2018

217 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

1  ©  Cloudera,  Inc.  All  rights  reserved.  

Mone9se  your  Data  with  Cloudera  Hadoop            Calvin  Hoon  Director  of  Strategic  Alliances  &  Channels  Sales  Asia  Pacific/Japan  

Page 2: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

2  ©  Cloudera,  Inc.  All  rights  reserved.  

Our  mission:  

Cloudera  helps  organiza9ons    profit  from  all  their  data  

Page 3: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

3  ©  Cloudera,  Inc.  All  rights  reserved.  

Cloudera  company  snapshot  

Founded  2008,  by  former  employees  of  Funding  $670M  cumula9ve  investment  Employees  Today  900+  worldwide  World  Class  Support  24x7  global  staff  

Pro-­‐ac9ve  &  predic9ve  support  programs  using  our  EDH  Mission  Cri9cal  Produc9on  deployments  in  run-­‐the-­‐business  applica9ons  

worldwide  –  Financial  Services,  Retail,  Telecom,  Media,  Health  Care,  Energy,  Government  

The  Largest  Ecosystem  More  than  1,500  Partners  Cloudera  University  Over  40,000  trained  Open  Source  Leaders  Cloudera  employees  are  leading  developers  &  contributors  to  

the  complete  Apache  Hadoop  ecosystem  of  projects  

Page 4: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

4  ©  Cloudera,  Inc.  All  rights  reserved.  

Customer  growth  and  reten9on  Cloudera  leads  on  all  fronts  

Categories  of  Hadoop  adop/on  

Big  Data  Maturity  

Training  

Services  &  Support  

Subscrip/on  

Free/Developer  

Business  Need  

Training  60%  of  Fortune  100  acended  Cloudera  training,  over  30,000  trained  since  2009  

Service  &  Support  9/10  for  support  sa9sfac9on,  ability  to  solve  technical  issues  #1  recommenda/on  

Subscrip9on   Over  2x  revenue  of  nearest  compe9tor,  90%  renewal  rate  

Free/Developer   Over  2.5  million  downloads  

Page 5: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

5  ©  Cloudera,  Inc.  All  rights  reserved.  

Customer  success  across  industries  Financial  Services  

Telecom  

Healthcare  &  Life  Sciences  

Media  &  Technology  

Retail  &    CP  

Public    Sector  

Page 6: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

6  ©  Cloudera,  Inc.  All  rights  reserved.  

The  future  of  Data  Management  

Page 7: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

7  ©  Cloudera,  Inc.  All  rights  reserved.  

Data  Sources  

Data  Systems  

Data  Access  

Business  Analy9cs  

Custom  Applica9ons  

Exis9ng  Data  

Databases  

Opera9onal  Applica9ons  

New  Data  

Limited  Data  Not  efficient  to  keep  exis9ng  data,  let  alone  handle  new  data  sources.  Time  consuming  to  transform  data  for  analysis  in  exis9ng  systems.  

Limited  Insights  Power  users  struggle  with  data.  Many  users  have  no  data.    

Compliance  and  Privacy  More  data,  more  users,  and  more  tools  create  complexity.  Need  to  balance  business  agility  with  security  and  governance.      

Tradi9onal  Architectures  Under  Pressure  

Page 8: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

8  ©  Cloudera,  Inc.  All  rights  reserved.  

Data  Sources  

Data  Systems  

Data  Access  

Business  Analy9cs  

Custom  Applica9ons  

Exis9ng  Data  

Databases  

Opera9onal  Applica9ons  

New  Data  

EDH,  more  value,  more  data,  more  users,  in  less  9me  

Enterprise  Data  Hub  

Security  and  Administra9on  

Unlimited  Storage  

Process   Discover   Model   Serve  Manage  Compliance  From  risk  due  to  regula0ons  and  customer  privacy  concerns,

to  trust  in  a  secure  and  compliant  pla8orm

Keep  Unlimited  Data  From  disparate  and  limited  views,

to  unlimited  informa0on  access

Unlock  Value  from  Data  From  analy0cs  for  some,  to

insights  for  all

Page 9: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

9  ©  Cloudera,  Inc.  All  rights  reserved.  

Cloudera  Enterprise  powered  by  Apache  Hadoop  

A  new  kind  of  data  plajorm.  • One  place  for  unlimited  data  • Unified,  mul9-­‐framework  data  access    Only  with  Cloudera:  •  Enterprise  Security  •  Data  Governance  •  Complete  Management  •  Open  source,  open  standards  

Security  and  Administra9on  

Unlimited  Storage  

Process   Discover   Model   Serve  

Deployment  Flexibility  

On-­‐Premises  Appliances  Engineered  Systems  

Public  Cloud  Private  Cloud  Hybrid  Cloud  

Page 10: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

10  ©  Cloudera,  Inc.  All  rights  reserved.  

One  Plajorm,  Many  Workloads  

Batch,  Interac9ve,  and  Real-­‐Time.  Leading  performance  and  usability  in  one  plajorm.  

•  End-­‐to-­‐end  analy9c  workflows  •  Access  more  data  •  Work  with  data  in  new  ways  •  Enable  new  users  

Security  and  Administra9on  

Process  Ingest  

Sqoop,  Flume  

Transform  MapReduce,  

Hive,  Pig,  Spark  

Discover  Analy9c  Database  

Impala  

Search  Solr  

Model  Machine  Learning  SAS,  R,  Spark,  

Mahout  

Serve  NoSQL  Database  

HBase  

Streaming  Spark  Streaming  

Unlimited  Storage  HDFS,  HBase  

YARN,  Cloudera  Manager,  Cloudera  Navigator  

Page 11: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

11  ©  Cloudera,  Inc.  All  rights  reserved.  

Cloudera  Enterprise  Data  Hub  

 CDH  (Cloudera  Distribu9on  for  Apache  Hadoop)  

Kaqa  

Sqoop  

Flume  

Sentry  

Impala  

Hive  

MapReduce  

YARN  

Spark  

Pig  

Avro  

Llama  

Solr  

Hue  

Parquet  

HDFS  

HBase  

Crunch  

Oozie  

HCatalog  

…  

Kite  

Mahout  

Zookeeper  

 Manager  

Deployment  

Configura9on  

Repor9ng  

Backup  &  DR  

Management  

Monitoring  

Diagnos9cs  

API  &  SNMP  

Partners  

Services   Training  

Enterprise  

 Director  

Provision  

Automate  

Elas9c  

API  

 Navigator  

Security  

Policy  

Lineage  

API  

Page 12: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

12  ©  Cloudera,  Inc.  All  rights  reserved.  

Balance  Security  and  Privacy  with  Business  Agility  

Cloudera  is  the  leader  in  Hadoop  security.    Unique  Capabili9es:  •  Comprehensive  and  Unified  

•  Secure  at  the  core  

• No  Performance  Impact  •  Jointly  engineered  with  Intel  

•  Compliance-­‐Ready  •  Only  distribu9on  to  pass  PCI  audit  

1.  Perimeter  Standards-­‐based  Authen9ca9on  

Security  and  Administra9on  

Unlimited  Storage  

Process   Discover   Model   Serve  

2.  Access  Unified  Role-­‐based  Authoriza9on  

4.  Data  Encryp9on  &  Key  Management  

3.  Visibility  Audi9ng  &  Governance  

Page 13: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

13  ©  Cloudera,  Inc.  All  rights  reserved.  

The  Cloudera  approach    Cloudera  Enterprise  

Enterprise  Data  Hub  

Security  and  Administra9on  

Unlimited  Storage  

Process   Discover   Model   Serve  

Manager  

Navigator  

Director  

CDH  

 Cloudera  Services  

Inges9on  and  ETL  Pilot  

Descrip9ve  Analy9cs  Pilot  

Cluster  Cer9fica9on  &  Opera9ons  

Pilot  and  or  Proof  of  Concept  

 Cloudera  Training  

Administrator  

Cer9fica9on  

Developer  

Analyst  

 Cloudera  Partners  

Page 14: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

14  ©  Cloudera,  Inc.  All  rights  reserved.  

Core  Benefits  of  the  Enterprise  Data  Hub  

©2014  Cloudera,  Inc.  All  rights  reserved.      

• Full-­‐Fidelity  Ac/ve  Archive  • Accelerate  Time  to  Insight  (Scale)  • Unlock  Agility  and  Explora/on  • Consolidate  Silos  for  360o  View  • Enable  Pervasive  Analy/cs  

Page 15: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

15  ©  Cloudera,  Inc.  All  rights  reserved.  

Case  Studies    and    Success  Stories  

Page 16: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

16  ©  Cloudera,  Inc.  All  rights  reserved.  

Big  Data  for  Opera9onal  Efficiency  Use  Cases  

Offload  resource  intensive  ETL  workloads  from  systems  

Migrate  old  data  and  ELT  workloads  off  of  EDW  

Store  old  data  online  so  analyst  can  access  historic  data  

ETL  Offload   EDW  Op9miza9on   Ac9ve  Archive  

Page 17: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

17  ©  Cloudera,  Inc.  All  rights  reserved.  

Store  and  process  months  of  transac9ons,  wai9ng  days  to  weeks  for  new  lines  of  enquiry?  

Mone9se  consumer  spending,  detect  fraud  from  a  PCI  compliant  repository  spanning  decades  

Page 18: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

18  ©  Cloudera,  Inc.  All  rights  reserved.  

Joint  Customer  Spotlight:  MasterCard  

Fraud  costs  credit  card  issuers    ~$10B  

per  year  and  is  detected  at  a  40%  rate.  

Most  detec9on  models  are  limited  by  

the  amount  of  data  that  is  available  for  

analysis  at  one  9me,  which  is  

constrained  by  extremely  high  cost.  

Move  ETL  and  storage  to  Hadoop

EDH  and  Impala  extends  queries  to    

data  sets  spanning  mul9ple  years,  not  just  the  tradi9onal  weeks  and  months.  

SAS®  Visual  Analy9cs  and  SAS  Visual  Sta9s9cs.  SAS/ACCESS  

Solu9on  

Significantly  cuts  costs  and  /me  to  data  

More  data  is  held  in  ac/ve  archive,  both  in  original  and  digested  formats,  so  it  is  available  for  future  analysis.  Test  new  models  using  historic  data  on  an  ad  hoc  basis  using  full  and  live  data  sets.  

Challenge   Benefit  

Test  new  models  using  historic  data  on  an  ad  hoc  basis  using  full,  live  data  sets  at  zero  marginal  cost  

Page 19: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

19  ©  Cloudera,  Inc.  All  rights  reserved.  19  

How  do  we  proac/vely  address  issues  for  our  High  Value  Customers?  

Who  are  my  most  valuable  set  of  customers  and  how  do  I  target  them?  

Pro-­‐ac've  Dashboard  -­‐  High  LTV  Customers  with  data  usage  issues  are  iden9fied  real-­‐9me  and  proac9vely  approached  to  address  the  issue!    

19  © 2014 Cloudera, Inc. All rights reserved.

Page 20: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

20  ©  Cloudera,  Inc.  All  rights  reserved.  

Driving  Customer  &  Network  Insights  @  Telkomsel  

BUSINESS  CHALLENGE  

Manage  Data  Growth  &  Drive  Insights  into  Data  With  over  100%  data  volumes  growth  annually,  Telkomsel  needed  an  effec9ve  way  to  offload  the  data  from  EDW  and  drive  new  analy9cal  insights  on  its  customers  and  network  usage  

Implemented  Compelling  Use  Cases  for  Marke;ng  &  Proac;ve  Care  •  Implemented  diverse  use  cases  for  personalized  marke'ng  and  proac've  care  –  including  Proac9ve  Dashboard,  Churn  Analy9cs,  Customer  Life9me  Value  and  Social  Analy9cs    

•  Offload  ETL  opera9ons  from  the  EDW  for  more  cost-­‐effec9ve  data  processing    

SOLUTION  DEPLOYED      

Derive  Business  Insights  from  Massive  amounts  of  Data  faster  Telkomsel  deployed  Cloudera’s  Enterprise  Data  Hub  on  premise  to  derive  valuable  customer  and  network  insights  from  data  streaming  from  mobile  devices.  One  of  the  first  use  cases  was  storing  CDR  data  for  longer  data  reten9on,  followed  by  a  full  pipeline  of  use  cases  focused  on  enhancing  consumer  experience.  

KEY  BENEFITS  REALIZED  

Page 21: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

21  ©  Cloudera,  Inc.  All  rights  reserved.  

Why  Cloudera?  

Enterprise  Security  Meet  compliance  requirements  and  reduce  risk  exposure  from  storing  sensi9ve  data.  

Data  Governance  Enable  compliance  and  maximize  analyst  produc9vity.  

Complete  Management  Deliver  op9mum  system  u9liza9on  and  meet  SLA  commitments,  on-­‐premises  or  in  the  cloud,  with  minimum  effort.  

We  deliver  long-­‐term  produc9on  success  with  enterprise  Hadoop.  

þ Open  Source  Innova/on  No  one  knows  Hadoop  be_er  than  Cloudera.    Cloudera  leads  development  of  enterprise  Hadoop  and  offers  the  best  support,  training,  and  services.  

þ Powerful  Enterprise  Tools  Cloudera  extends  open  source  Hadoop  with  capabili9es  required  by  the  largest  enterprises.  

þ Ecosystem  Cloudera  partners  with  industry  leaders  to  ensure  Hadoop  works  with  the  plajorms,  tools,  and  integrators  our  customers  rely  on.  

Page 22: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

22  ©  Cloudera,  Inc.  All  rights  reserved.  

Explore  the  Possibili9es  of    SAS  and  Cloudera  Execu/ve  sponsored  partnership  which  spans  R&D,  Product  Management,  Sales,  Marke/ng,  Consul/ng  &  Educa/on  Services.    SAS  product  integra/on  with  Cloudera  is  the  most  extensive  of  all  the  commercial  Hadoop  distribu/ons    •  SAS  internal  development  teams  have  a  Cloudera  first  policy  and  all  internal  work  is  performed  on  Cloudera  clusters.  

•  Dedicated  Cloudera  resources  at  Cloudera  HQ  and  SAS  HQ  working  with  SAS  R&D  •  SAS  has  dedicated  R&D  resources  to  op9mize  SAS  solu9ons  for  the  Cloudera  plajorm  

•  Porjolio  includes  integra9on  with  Access  to  Hadoop,  Access  to  Cloudera,  Visual  Analy9cs,  In-­‐Memory  Sta9s9cs,  High  Performance  Analy9cs,  Scoring  Accelerator  for  Cloudera  Hadoop  &  Visual  Sta9s9cs  among  others…  

Page 23: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

Nobody  knows  Hadoop  like  Cloudera.    Nobody  Knows  Analy9cs  like  SAS.    Together  we  deliver  the  BEST  Big  Data  Analy9cs  solu9ons!  

Visit  the  Cloudera  booth  for  more  informa/on!  

Page 24: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

24  ©  Cloudera,  Inc.  All  rights  reserved.  

Page 25: SAS Forum KL - Monetise your Data with Hadoop · PDF fileWith"over"100%"datavolumes"growth"annually,"Telkomsel"needed"an" ... SAS Forum KL - Monetise your Data with Hadoop Author:

25  ©  Cloudera,  Inc.  All  rights  reserved.  

Thank  you!  [email protected]