big data closing the analytics gap - sas institute€¦ · copyright © 2011, sas institute inc....

Post on 10-May-2020

2 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Copyright © 2011, SAS Institute Inc. All rights reserved.

make connections • share ideas • be inspired

Big Data – Closing the analytics gap André Münger Greenplum andre.muenger@emc.com

Copyright © 2011, SAS Institute Inc. All rights reserved.

Ben Mac

Mouth

Money Bang

B

a

n

d

Brother

Lebowski

F

i

v

e

Fish

Business

Was ist all diesen Begriffen gemeinsam?

Copyright © 2011, SAS Institute Inc. All rights reserved.

DATA MINING & BIG DATA Mythos

+ =

+

1. Überhaupt nichts, ist am Preiswertesten

2. Entschuldigungsbrief schreiben (minimaler Aufwand, minimale Kosten)

3. Gutschein über 300 Franken für den nächsten Flug

4. Eine Jahresmitgliedschaft in der VIP-Lounge

5. 25.000 geschenkte Frequent-Flyer-Meilen

Copyright © 2011, SAS Institute Inc. All rights reserved.

7 THESEN ZU BIG DATA

Copyright © 2011, SAS Institute Inc. All rights reserved.

1. Man kann vor Big

Data wegrennen,

sich aber nicht

verstecken.

Copyright © 2011, SAS Institute Inc. All rights reserved.

THRIVING IN THE BIG DATA ERA

Volume

Variety

Velocity

Value

The

Analytics

Gap

Copyright © 2011, SAS Institute Inc. All rights reserved.

2. Big Data wird

DAS

entscheidende

Differenzierungs-

kriterium.

Copyright © 2011, SAS Institute Inc. All rights reserved.

SAS Press Release im April 2011

“In early April 2011, SAS demonstrated the power of high

performance analytics at its Global Forum meeting. In the

first case, two racks (16 nodes) of Greenplum's Data

Computing Appliance (DCA) were used to run a logistic

regression of bank loan defaults across a database with a

billion records, applying just a few variables. The regression

was able to complete in less than 80 seconds (as compared

to 20 hours for an unspecified serial implementation).”

0.11% of time needed

unspecified serial implementation takes 900 times longer

SAS Code: proc logistic changed to proc HPlogistic

Copyright © 2011, SAS Institute Inc. All rights reserved.

3. Big Data braucht

neue

Fähigkeiten.

Copyright © 2011, SAS Institute Inc. All rights reserved.

Statistik / Mathematik

Business Verständnis

Hacking

Entdecker-geist

Team – und Kommuni-kations-fäjhigkeit

Offenheit

Neugierde

Engineering

Data Scientist

Copyright © 2011, SAS Institute Inc. All rights reserved.

4. Big Data meint,

man soll Social

Media nutzen.

Copyright © 2011, SAS Institute Inc. All rights reserved.

Im Web wird Realität erzeugt

„Nichts passiert, was nicht genügend gedacht,

gesagt oder geschrieben worden ist.“

Facebook alleine ist das

3. grösste Land auf der Welt.

Copyright © 2011, SAS Institute Inc. All rights reserved.

5. Big Data

benötigt neue

Werkzeuge und

Technologie.

Copyright © 2011, SAS Institute Inc. All rights reserved.

TDWI Study Big Data Analytics

Copyright © 2011, SAS Institute Inc. All rights reserved.

20 Jahre Evolution

Copyright © 2011, SAS Institute Inc. All rights reserved.

SAS High Performance

Analytics (Product) Lösungs-Architektur – MPP Datenbank Appliance

Server N Server 2 Server 1

Teradata- oder

EMC² Greenplum-DB

Speicher für SAS

In-Memory Analytics

SAS High Performance

Deployment

SAS Embedded

Process

SAS High Performance

Analytics Plattform

MPI MPI

proc hplogistic data=MPPLib.MyTable; class A B C D ; model y = a b c

b*d x1-x100; output

out=MPPlib.logout pred=p; run;

Multiple

Threads

Multiple

Threads

Multiple

Threads

Database Storage

Database Server

Embedded

Process

Database Storage

Database Server

Embedded

Process

Database Storage

Database Server

Embedded

Process

SAS STAT Client

SAS Enterprise Miner

www.sas.com/hpa

Copyright © 2011, SAS Institute Inc. All rights reserved.

Traditionelle Architekturen – Scale Up

Ihre Aufgabe:

Bringen Sie 300 Passagiere in ein Flugzeug.

1 Eingang.

Wie lange dauert es?

Copyright © 2011, SAS Institute Inc. All rights reserved.

MPP – Massiv Parallele Verarbeitung

Ihre Aufgabe:

Bringen Sie 300 Passagiere in einen Zug.

20 Wagen, 2 Eingänge pro Wagen.

Wie lange dauert es?

Copyright © 2011, SAS Institute Inc. All rights reserved.

SAS AND EMC GREENPLUM UAP INTEGRATED ARCHITECTURE

Data

Scientist

Data

Engineer

Data

Analyst

Bl

Analyst LOB

User

Data

Platform

Admin

DA

TA

SC

IEN

CE

TE

AM

Greenplum Chorus - Analytic Productivity Layer

SAS Analytics

Private/Hybrid Cloud Infrastructure or Appliance

SAS Business Intelligence

SAS Information Management

Greenplum Database Greenplum Hadoop

Data Access & Query Layer (SAS ACCESS, SQL, MapReduce)

Copyright © 2011, SAS Institute Inc. All rights reserved.

2011 EMC Data Hero Award Winner

Silver Spring Networks – Energy Category

6. Big Data

transformiert

ganze Industrien.

Copyright © 2011, SAS Institute Inc. All rights reserved.

Alte Big Data Weisheit

„Wenn starker Wind aufkommt,

dann bauen die Einen Mauern,

die Anderen errichten Windmühlen.“

Copyright © 2011, SAS Institute Inc. All rights reserved.

All Data

Faster Answers

Elastic & Scalable

1 The Journey To Big Data

2 Data Science

Collaboration

Self-Service

Agile Analytics People & Productivity Focus

3 Real-Time Decisions

New Applications

Data Monetization

Analytic Productivity Platform

Agile Process & Tools

Predictive Enterprise Application Focus

Big Data Enabled Apps

Big Data Analytics Technology Focus

Analytics Engines

Cloud Infrastructure

Analytic Engines

Copyright © 2011, SAS Institute Inc. All rights reserved.

7. Big Data

verändert Leben.

2011 EMC Data Hero Award Winner Global Virus Forecasting – Public Service Category

Copyright © 2011, SAS Institute Inc. All rights reserved.

www.datascienceseries.com

Die Data Science Series

sind ein Forum für Menschen,

die die Welt verändern möchten.

Copyright © 2011, SAS Institute Inc. All rights reserved.

Copyright © 2011, SAS Institute Inc. All rights reserved.

Copyright © 2011, SAS Institute Inc. All rights reserved.

The Human Face of BIG DATA

Finding Islands of Understanding in a Sea of Information.

www.thehumanfaceofbigdata.com

Copyright © 2011, SAS Institute Inc. All rights reserved.

Zusammenfassung

Big Data ist heute

Big Data differenziert

Big Data macht Spass

Menschen kommen vor Technologie

Copyright © 2011, SAS Institute Inc. All rights reserved.

Copyright © 2011, SAS Institute Inc. All rights reserved.

Data Is Digital Air

“Data is increasingly digital air:

the oxygen we breathe and the

carbon dioxide that we exhale.

It can be a source of both,

sustenance and pollution.”

Danah Boyd, Microsoft Researcher

Copyright © 2010, SAS Institute Inc. All rights reserved.

make connections • share ideas • be inspired

André Münger

+41 79 708 85 99

andre.muenger@emc.com

EMC Computer Systems AG

Hardturmstrasse 181

8005 Zürich / Schweiz

top related