dell emc data analytics strategy€¦ · momentum lack of maturity 1 idc 2 allied market research...

Post on 11-Aug-2020

2 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

GLOBAL SPONSORS

Dell EMC Data Analytics Strategy ANDREAS WALZEL

MANAGER SYSTEMS ENGINEERING, DELL

EMC

ANDREAS.WALZEL@DELL.COM

© Copyright 2017 Dell Inc. 2

Key investments in data analytics technologies

Boosting Investments

Hadoop Momentum

Lack of Maturity

1 IDC 2 Allied Market Research

With 63.4% investing in Hadoop based products2.

IDC warns of shortage of available data scientists, with negative impacts on business – IT relationships.

of enterprises will increase spending on data

analytics every year through 2020, with market worth $203B1. 11.7%

Benefits recognized but difficulties persist. Only small

percentage of organizations have Hadoop effectively deployed.

Key Indicators

© Copyright 2017 Dell Inc. 3

Data-driven organizations are more effective

• But getting there is a challenge

And 69.9% said it was mission critical1…

of organizations in a recent study said that

data analytics was unimportant to them1 1.8%

1 NewVantage Partners Big Data Executive Survey 2016.

BECOME DATA DRIVEN Your journey begins with a single step

Success of data analytics projects and gaining competitive

edge are key factors to growth of new initiatives

44ZB DATA

2020

IoT

Enterprise Challenges in Digital Transformation Drive for deeper insights is accelerating need

for architectures to enable unstructured

analytics

22ZB

2018

11ZB

2016

Manage Data Growth

Perform Advanced Analytics

Handle Unstructured Data Sources

Drive Real-Time Results

Organizations need to deliver analytics on more than

just their traditional structured data

Evolving spectrum of data analytics

Requires infrastructure that enables multiple applications and varied use cases

Predictive

Analytics

Business

Intelligence

Analytics of

Things

Cyber security

Analytics

Real-time

Analytics

Machine

Learning

Enables analytics for ALL of your data

Dell EMC Unstructured Analytics Portfolio

PowerEdge

Performance

Centric

Storage

Centric

Predictive

Analytics

Business

Intelligence

Analytics of

Things

Cyber security

Analytics

Real-time

Analytics

Machine

Learning

Archive

Centric

© Copyright 2017 Dell Inc. 7

Dell EMC Unstructured Analytics Portfolio

PowerEdge

Solution accelerators Splunk Ready System

Hadoop Ready Bundle

QuickStart for Hadoop

EDW Optimization Solutions

Hadoop Backup Solutions

SAS-Grid Solution with Isilon

Streaming Analytics Solutions

Proven solutions for unstructured analytics

© Copyright 2017 Dell Inc. 8

HADOOP DECISIONS

DAS

ECS

© Copyright 2017 Dell Inc. 9

3 TRADITIONAL DISCOVERY QUESTIONS

1

2

3

What do you hope to achieve with Hadoop?

Why is this impactful to your business?

Which Hadoop Distribution

will you choose?

© Copyright 2017 Dell Inc. 10

NEXT LEVEL QUESTIONS

Access Implementat

ion

Compliance

Scalability

Tools & Apps

Business Units

Consolidate

© Copyright 2017 Dell Inc. 11

EMC ISILON HDFS INTERFACE

• Native HDFS support

• Underlying file system is OneFS

• As simple as pointing the HDFS clients to the

DNS name of the Isilon cluster!

© Copyright 2017 Dell Inc. 12

Traditional “Share-Nothing” Hadoop

Existing Virtualized Data Center SHARE-NOTHING Hadoop Infrastructure

Unstructured Data

1

Existing Primary Storage

2 3 4 2 3 4 2 3 4 2 3 4

• Hadoop on a Stick (R=3)

means 5 data copies ($$$$)

• Data has to copy to the

Hadoop cluster before analysis

can begin (Time to Results)

How will you maintain data

consistency when a file changes

on your primary storage?

© Copyright 2017 Dell Inc. 13

Existing Virtualized Data Center

Existing Primary Storage

Isilon “Share-Everything” Hadoop

1

Start using Hadoop NOW with unused processing and RAM available in your VMware environment

No replication required (Use your existing data)

Access to same data via NAS and HDFS protocols

Time to results extremely fast using already existing data with NO COPIES or wasted $$$$

Analysis Can

Begin with

the 1st VM

New Hadoop Compute Nodes

Unstructured Data

Use Native HDFS Protocol

© Copyright 2017 Dell Inc. 14

Data Center Network

TIME-TO-RESULTS

Data Copy Analysis In-Place Analysis

Existing Primary Storage

Hadoop on a Stick

Have you ever

copied 100TB from

Primary Storage to

a Hadoop system?

How long does it

take to copy 100TB

from one place to

another over a

10Gb link?

>24 Hours

Data Center Network

Existing Primary Storage

Hadoop Compute Nodes

Reading

relevant

data to

analysis

© Copyright 2017 Dell Inc. 15

HADOOP WITH EMC DATA LAKE

1 Multi Protocol Scale-Out Storage Platform

• NFS, SMB, FTP, HTTP, HDFS, SWIFT

2 Enterprise Data Protection & Governance

• SnapshotIQ, SyncIQ, SmartLock, ACLs..

3 Industry-Leading Storage Efficiency

• >80% Storage Utilization

4 Independent Scalability with Optimized QoS

• Optimally Scale Storage & Compute

5 Consolidate Data Silos

• Industry Standard Protocols

• Bring Applications to Shared Data

6

Hadoop as a Service

• Eliminate Shadow IT

• Offer variations of Hadoop to all your BUs

7 Regulatory Compliance out-of-the-box

• PCI DSS, HIPAA, GINA, SOX, SEC

• ….

8

No Migrations and Minimized Management

• In-place Technology Swaps without Disruption

• Manage TB to PB within One Filesystem

© Copyright 2017 Dell Inc. 16

HADOOP OUTLOOK

80 % Utilizati

on

Capacity Independen

t Scalabilit

y

Implementation

Enterprise features

Data Access Applications

& Tools

Hadoop-as-a-service

Compliance &

Regulation

Fraud

Detection &

Risk Analytics

Trading / Tick

Data Analytics IoT

Data Driven

Business

Transformation

Unstructured Analytics Use Cases

Customer 360

Analytics

Enabling enterprises to improve operational efficiencies

and monetize new revenue streams

Right Solution Configuration for the use case

High Performance w/ cost as main driver

100% Compliance to Hadoop Operational features

Ability to scale down at cost On

e o

r

mo

re

Storage scaling faster than compute

Enterprise Grade File Mgmt.

Consolidation of IT Workloads

Aggregate capacity > 100 TB

On

e o

r

mo

re Data Compute

Geo-distributed single namespace

Analytics and Hadoop

Compute Data

Compute + Data

PowerEdge

PowerEdge

PowerEdge

Direct

Att

ach

ed

Sto

rag

e

Sh

are

d S

tora

ge

CUSTOMER REQUIREMENTS CONFIGURATION drive

Pe

rfo

rman

ce-

ce

ntr

ic

Sto

rag

e-

ce

ntr

ic

Arc

hiv

e-

ce

ntr

ic

© Copyright 2017 Dell Inc. 19

Data analytics offerings

© Copyright 2017 Dell Inc. 20

Visit: dellemc.com/bigdata

top related