dell emc data analytics strategy€¦ · momentum lack of maturity 1 idc 2 allied market research...
TRANSCRIPT
GLOBAL SPONSORS
Dell EMC Data Analytics Strategy ANDREAS WALZEL
MANAGER SYSTEMS ENGINEERING, DELL
EMC
© Copyright 2017 Dell Inc. 2
Key investments in data analytics technologies
Boosting Investments
Hadoop Momentum
Lack of Maturity
1 IDC 2 Allied Market Research
With 63.4% investing in Hadoop based products2.
IDC warns of shortage of available data scientists, with negative impacts on business – IT relationships.
of enterprises will increase spending on data
analytics every year through 2020, with market worth $203B1. 11.7%
Benefits recognized but difficulties persist. Only small
percentage of organizations have Hadoop effectively deployed.
Key Indicators
© Copyright 2017 Dell Inc. 3
Data-driven organizations are more effective
• But getting there is a challenge
And 69.9% said it was mission critical1…
of organizations in a recent study said that
data analytics was unimportant to them1 1.8%
1 NewVantage Partners Big Data Executive Survey 2016.
BECOME DATA DRIVEN Your journey begins with a single step
Success of data analytics projects and gaining competitive
edge are key factors to growth of new initiatives
44ZB DATA
2020
IoT
Enterprise Challenges in Digital Transformation Drive for deeper insights is accelerating need
for architectures to enable unstructured
analytics
22ZB
2018
11ZB
2016
Manage Data Growth
Perform Advanced Analytics
Handle Unstructured Data Sources
Drive Real-Time Results
Organizations need to deliver analytics on more than
just their traditional structured data
Evolving spectrum of data analytics
Requires infrastructure that enables multiple applications and varied use cases
Predictive
Analytics
Business
Intelligence
Analytics of
Things
Cyber security
Analytics
Real-time
Analytics
Machine
Learning
Enables analytics for ALL of your data
Dell EMC Unstructured Analytics Portfolio
PowerEdge
Performance
Centric
Storage
Centric
Predictive
Analytics
Business
Intelligence
Analytics of
Things
Cyber security
Analytics
Real-time
Analytics
Machine
Learning
Archive
Centric
© Copyright 2017 Dell Inc. 7
Dell EMC Unstructured Analytics Portfolio
PowerEdge
Solution accelerators Splunk Ready System
Hadoop Ready Bundle
QuickStart for Hadoop
EDW Optimization Solutions
Hadoop Backup Solutions
SAS-Grid Solution with Isilon
Streaming Analytics Solutions
Proven solutions for unstructured analytics
© Copyright 2017 Dell Inc. 8
HADOOP DECISIONS
DAS
ECS
© Copyright 2017 Dell Inc. 9
3 TRADITIONAL DISCOVERY QUESTIONS
1
2
3
What do you hope to achieve with Hadoop?
Why is this impactful to your business?
Which Hadoop Distribution
will you choose?
© Copyright 2017 Dell Inc. 10
NEXT LEVEL QUESTIONS
Access Implementat
ion
Compliance
Scalability
Tools & Apps
Business Units
Consolidate
© Copyright 2017 Dell Inc. 11
EMC ISILON HDFS INTERFACE
• Native HDFS support
• Underlying file system is OneFS
• As simple as pointing the HDFS clients to the
DNS name of the Isilon cluster!
© Copyright 2017 Dell Inc. 12
Traditional “Share-Nothing” Hadoop
Existing Virtualized Data Center SHARE-NOTHING Hadoop Infrastructure
Unstructured Data
1
Existing Primary Storage
2 3 4 2 3 4 2 3 4 2 3 4
• Hadoop on a Stick (R=3)
means 5 data copies ($$$$)
• Data has to copy to the
Hadoop cluster before analysis
can begin (Time to Results)
How will you maintain data
consistency when a file changes
on your primary storage?
© Copyright 2017 Dell Inc. 13
Existing Virtualized Data Center
Existing Primary Storage
Isilon “Share-Everything” Hadoop
1
Start using Hadoop NOW with unused processing and RAM available in your VMware environment
No replication required (Use your existing data)
Access to same data via NAS and HDFS protocols
Time to results extremely fast using already existing data with NO COPIES or wasted $$$$
Analysis Can
Begin with
the 1st VM
New Hadoop Compute Nodes
Unstructured Data
Use Native HDFS Protocol
© Copyright 2017 Dell Inc. 14
Data Center Network
TIME-TO-RESULTS
Data Copy Analysis In-Place Analysis
Existing Primary Storage
Hadoop on a Stick
Have you ever
copied 100TB from
Primary Storage to
a Hadoop system?
How long does it
take to copy 100TB
from one place to
another over a
10Gb link?
>24 Hours
Data Center Network
Existing Primary Storage
Hadoop Compute Nodes
Reading
relevant
data to
analysis
© Copyright 2017 Dell Inc. 15
HADOOP WITH EMC DATA LAKE
1 Multi Protocol Scale-Out Storage Platform
• NFS, SMB, FTP, HTTP, HDFS, SWIFT
2 Enterprise Data Protection & Governance
• SnapshotIQ, SyncIQ, SmartLock, ACLs..
3 Industry-Leading Storage Efficiency
• >80% Storage Utilization
4 Independent Scalability with Optimized QoS
• Optimally Scale Storage & Compute
5 Consolidate Data Silos
• Industry Standard Protocols
• Bring Applications to Shared Data
6
Hadoop as a Service
• Eliminate Shadow IT
• Offer variations of Hadoop to all your BUs
7 Regulatory Compliance out-of-the-box
• PCI DSS, HIPAA, GINA, SOX, SEC
• ….
8
No Migrations and Minimized Management
• In-place Technology Swaps without Disruption
• Manage TB to PB within One Filesystem
© Copyright 2017 Dell Inc. 16
HADOOP OUTLOOK
80 % Utilizati
on
Capacity Independen
t Scalabilit
y
Implementation
Enterprise features
Data Access Applications
& Tools
Hadoop-as-a-service
Compliance &
Regulation
Fraud
Detection &
Risk Analytics
Trading / Tick
Data Analytics IoT
Data Driven
Business
Transformation
Unstructured Analytics Use Cases
Customer 360
Analytics
Enabling enterprises to improve operational efficiencies
and monetize new revenue streams
Right Solution Configuration for the use case
High Performance w/ cost as main driver
100% Compliance to Hadoop Operational features
Ability to scale down at cost On
e o
r
mo
re
Storage scaling faster than compute
Enterprise Grade File Mgmt.
Consolidation of IT Workloads
Aggregate capacity > 100 TB
On
e o
r
mo
re Data Compute
Geo-distributed single namespace
Analytics and Hadoop
Compute Data
Compute + Data
PowerEdge
PowerEdge
PowerEdge
Direct
Att
ach
ed
Sto
rag
e
Sh
are
d S
tora
ge
CUSTOMER REQUIREMENTS CONFIGURATION drive
Pe
rfo
rman
ce-
ce
ntr
ic
Sto
rag
e-
ce
ntr
ic
Arc
hiv
e-
ce
ntr
ic
© Copyright 2017 Dell Inc. 19
Data analytics offerings
© Copyright 2017 Dell Inc. 20
Visit: dellemc.com/bigdata