ibm cds overview
TRANSCRIPT
The Future of Analytics in ActionIBM Cloud Data Services
©2015 IBM Corporation
Why the Journey to Cloud-based Analytics?
MISSIONTo provide the best experience for developers and
enterprises with a comprehensive set of rich,
integrated cloud data services covering content, data
and analytics.
Fully managed 24x7 so you
can focus on new
development
Pay as you go with no
big up-front capital
investments
Instant provisioning saves
weeks of data center setup
FASTER
INNOVATION
BETTER IT
ECONOMICS
LOWER RISK
OF FAILURE
©2015 IBM Corporation
IBM Cloud Data Services
Cloudant dashDBBigInsights on
Cloud
Spark as a
ServiceDB2 on Cloud
NoSQL DBaaSAnalytic Data
WarehouseHadoop in the Cloud
Fully-managed
Spark Service
Hosted Database in
the Cloud
• Global data
distribution
• Massively scalable
• Eventually
consistent data
model
• Built for mobile,
Systems of
Engagement
• SQL interface
• Massively parallel
• ACID compliance
• Columnar, in-
memory
performance
• BLU augmented
with NZ in-DB
analytics
• Built for Systems of
Insight
• Bare metal
performance
• Build on reference
architecture
• BigInsights
enterprise features
• Optimized for
extremely fast and
large scale data
processing
• Spark SQL,
Streaming, MLlib,
GraphX
• Build and run apps
benefiting from
operational,
maintenance and
hardware
excellence
• Power of DB2
• Fast Provisioning
• Flexible pricing
• No loss of DBA
control
• Built for Systems of
Record
©2015 IBM Corporation
Watson AnalyticsAnalytics & Visualization
Services
DataWorksData Refinery
Services
BigInsights on Cloud• Spark for in-memory Hadoop
• Built on IBM Open Platform
• Bare metal performance
• BigInsights enterprise features
Cloudant• Database as a Service (DBaaS)
• Massively scalable for global data distribution
• Eventually consistent data model
• Built for mobile, Systems of Engagement
dashDB• SQL interface
• ACID compliance
• Columnar, in-memory performance
• BLU augmented with Netezza in-DB analytics
• Built for Systems of Insight
• Native integration with Watson Analytics
DB2 on Cloud• DB2 RDBMS provisioned
on Bluemix
• SQL interface
• ACID compliance
• Fast provisioning
• Built for Systems of Record
ANALYTICAL TRANSACTIONAL
UNSTRUCTURED
STRUCTURED
Mixed workloads and data types are knit together with DataWorks for true hybrid services
IBM Cloud Data Services
©2015 IBM Corporation
The IBM Cloud
- “Bare-metal” outperforms virtualized
- Dedicated hardware
- 40 data centers worldwide
©2015 IBM Corporation
What is Bluemix?
Bluemix is an open-standard, cloud-based platform for building,
managing, and running applications of all types (web, mobile, big
data, new smart devices, and so on).
Go Live in Seconds
Zero to running in one click.
Development plans deploy in
seconds. Enterprise plans
deploy in 1-2 days.
DevOps
Development, monitoring,
deployment, and logging tools
allow the developer to run the
entire application.
APIs and Services
A catalog of IBM, third party,
and open source API services
allow the developer to stitch an
application together in minutes.
On-Prem Integration
Build hybrid environments.
Connect to on-premise assets
plus other public and private
clouds.
Flexible Pricing
Sign up in minutes. Pay as
you go and subscription
models offer choice and
flexibility.
Layered Security
IBM secures the platform and
infrastructure and provides
you with the tools to secure
your apps.
7 © 2015 IBM Corporation
Security and Compliance for All CDS Offerings
Vulnerability Scanning
Audit Log consolidation and analysis
Use Access management
PSRIT – Security Incident Management
Legal, Regulations and Compliance
Education, training and Awareness
Business Continuity and Disaster Recovery
Network Architecture and Design
Intrusion Prevention
Operation System security hardening
Secure Engineering Development practices: threat modeling, risk assessment, static and dynamiccode analysis
Secured development life Cycle
Security Architecture and Design
Access Control, Authentication and Authorization
Data Protection
Security Logging
Functional
Infrastructure
Development
Governance & Compliance
Operational
SoftLayer: physical security compliance
Scales & remains available to
1 billion users across Asia,
North America, Europe
Transactional
Throughput
300 million requests / day
( 3,500 / second )
Cluster Distribution Global (mobile devices)
Media Types
Ingested
Structured, semi-structured,
unstructured (logs, audio)
fully managed services in
support of massive
concurrent user growth
Transactional
Throughput
2 billion requests / day
( 20 : 1 read-writes )
Data Volume 130 TB
Cluster
Growth
From 6 to over 200 servers
(in 12 months)
©2015 IBM Corporation
A Large Investment Research & Management Firm: needed a
persistent data store to maintain and access financial analytical reports
Cloudant’s schema-less architecture and horizontal scalability enables
their users users to have real-time access to reports and analytics
generated by IBM PureData for Analytics
Use Case
©2015 IBM Corporation
Use Case
The Red 10 is a data-driven marketing analytics firm in the UK. With dashDB, they are able to provide
real-time analytics and updates to give an accurate view to the audiences
With dashDB, they can provide 1) a live view of the UK & Ireland markets, 2) new segmentation based
on live contact views, 3) an instant view of all relevant information, and 4) the right message, at the right
time, through the right medium.
This enables growth for less and increased conversion across the sales funnel for their clients.
©2015 IBM Corporation
A global pharmaceutical company with more than 90 years of innovation and leadership in
diabetes care is using IBM’s Enterprise Hadoop as a Service (EHaaS) offering
They are using BigInsights on Cloud on sets of electronic medical records (EMR) data to
analyze the relevance of a pharmacological treatment of obesity and obtain costs estimates to
build an economic model for obesity treatments.
Pharmaceutical Use Case
©2015 IBM Corporation
Common Spark use cases
1. Running large data processing batch jobs (e.g. nightly ETL from production systems, primary Hadoop use case)
2. Interactive querying of very large data sets (e.g. BI)
3. Complex analytics and data mining across various types of data
4. Building and deploying rich analytics models (e.g. risk metrics)
5. Implementing near-realtime stream event processing (e.g. fraud / security detection)
The Search Continues:
SETI Institute enhances
E.T. search with advanced
analytic platform
Need
• Allen Telescope Array (ATA) has been recording and sifting
data from the cosmos for SETI in an effort to
explore, understand, and explain the origin and nature of
life in the universe.
• Perform omni-data analysis on over a decade of radio
telescope signal with new analytic algorithm to look for
narrow band signals
Benefits
• Tens of millions of ATA signal events have been recorded in binary files, which in turn are linked to hundreds of millions of records in a structured database that provides additional information about the signal event, such as the exact date-time of the signal, the target coordinates, and other details. The IBM Spark project is linking these two data sets – in their entirety – for the first time.
• IBM Spark : By analyzing the vast archives of ATA content, new algorithms are already being developed to isolate human radio frequency interference (RFI) from external signals which deserve further scrutiny.
©2015 IBM Corporation
Systems of Insight
Systems of Engagement
(NoSQL, Mobile Apps, Social Media, IoT, others)
Systems of Record(DB2, Oracle, HDP, flat files, others;
cloud-based or on-premise)
Continuous
Synchronization
IBM & Third Party Integrations(Watson Analytics, Cognos, SPSS, SAS,
Tableau, ESRI ArcGIS, Aginity, others)
Watson
Analytics
Cloud data services for systems of engagement, insight, and record with self-service BI
– helping you understand and engage with your customers better