partner webcast – oracle big data preparation cloud service: transform to self service data...
TRANSCRIPT
Oracle Big Data Preparation Cloud Service Transform to Self Service Data Preparation for Business Users
Jernej Kase Cloud & Digital Partner Programs, Alliance & Channels Oracle EMEA
Oracle Confidential – Internal/Restricted/Highly Restricted Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |
Safe Harbor Statement
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle.
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. |
“Big Data’s dirty little secret is that 90% of time spent on a project is devoted to preparing data… After all the preparation work, there isn’t enough time left to do sophisticated analytics on it…”
Source: Thomas Davenport - Wall Street Journal, 2014
4
In the past year, data preparation has become indispensable due to its overwhelming contribution to analyses and decision support. Source: Gartner (http://blogs.gartner.com/lakshmi-randall/2015/05/11/whats-next-data-preparation/)
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 5
Data Discovery & Visualization
Enterprise Reporting
Internet
Logs
90% of time is spent WRANGLING DATA
MONTHS of effort spent on each new
dataset
PROGRAMERS writing scripts or complex ETL
Enterprise ETL & Data Integration
Companies are struggling to derive value from big data initiatives…
Traditional methods
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. |
Apache Spark ML + Hadoop + Semantic Graph
Any-Structured Business Data
6
Oracle’s Solution: Big Data Preparation Cloud Service
Data Preparation Designed for data domain
experts, not programmers
Focused on cleansing, enriching and transforming unstructured business data
Operationalize data flows into ETL or Business Intelligence
Key Benefits Easy to get started with
browser based application
Better recommendations engine combines machine learning with semantic technologies
Integrated into Oracle Cloud
Data Visualization
ETL Processing
Enterprise Reporting
Ingest from Sources Enrich Publish
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 7
Load Data for Oracle Business Intelligence Cloud Service
Easy Publishing to Oracle BICS Pre-integrated RESTful service
connectivity
Shared single sign on access across Oracle Cloud
Common operational support from OPC services
Key Benefits Self Service access from non
technical users
Cloud based applications can bypass need for extensive IT support for on premise tech
Accelerated Value by enabling business users to quickly ingest data for operational BI
BDP-CS BICS
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 8
Integrated Data Verification, Transformation, and Visualization Intuitive User Interface
Knowledge Driven Recommendations
Interactive Transform Script
Metadata and Data Views Profile Metrics
and Visualizations
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 9
Highly Differentiated from Other Data Prep Tools
Better Recommendations Engine Only Data Prep/Wrangling tool
to combing Natural Language Processing (Apache NLP) with Machine Learning (Spark ML)
Leverages Linked Open Data graph of domain knowledge
Key Benefits More Efficient Mapping by
leveraging a more effective recommendation service
Higher Quality automation “gets it right” more often
Data Enrichment leveraging domain data for enriching sparse data sets
Hadoop
Oracle Cloud
Semantics based
Knowledge Graph
Spark Machine Learning
Natural Language
Processing
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 10
Oracle Big Data Preparation Core Capabilities
Publish
• On Demand or Scheduled
• Source / Target Definition
• Restful APIs • Event Driven
Govern and Monitor
• Dashboards • Automated Alerts
• Reusable user policies • System Controls
• Security • Stats & Metrics
Ingest
• Import & Ingest • Detect Schema • Cleanse, Normalize
& De-duplicate • Detect & Mask
Sensitive Data
Enrich
• Profile • Annotate • Data Classification • Semantic
Enrichments • Missing Data
Interpolation
• Unified solution to prepare unstructured data
• Simple to use tooling designed for non-programmers
• Unique technology approach combines Machine Learning (ML) with Natural Language Processing (NLP) engine
• Powered by Apache Spark, Hadoop, and UIMA
• Cloud operated from the Oracle Public Cloud
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. |
Parse Click Stream Logs
Repair App Data
Classify Social Data
Structured
Unreliable
Unstructured
High Velocity
Unstructured
High Volume
Invalid and missing data
Sensitive data
Embedded information
No reliable patterns
Embedded information
in unstructured text
Invalid
emails
NLP
SSN Credit Card Info
Entities
Big Data Preparation and Enrichment Examples
11
Supported Formats
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 12
What’s New
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
BDP 16.1.3
• Runtime Null Checks – Allows users to define a runtime
threshold for number of nulls for a column.
– System checks for nulls at runtime and will throw error when user defined threshold is violated
– Job details will give users information about which columns violated DQ checks
13
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Custom Reference Knowledge
• Knowledge Import Feature – Allows users to import custom
knowledge in seconds (CSV or TSV)
– Custom Knowledge used to enhance Knowledge Service capabilities
• Knowledge Maintenance Page – Allows users to manage reference
knowledge • Create
• Rename
• Activate and Deactivate
• Delete
14
Knowledge Service Custom Knowledge
Import
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Data Blending Assisted by Relationship Discovery
• Data Blending – Empowers Business Analysts to Blend
Datasets from Multiple Sources, in any format into a single enriched file ready for downstream processes.
• Cross Source Relationship Discovery – Assists Business Analysts by discovering
and recommending relations between two datasets that can be used as blend keys
– Powerful algorithm leverages deep column profile and fingerprinting
15
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
New and Improved User Interface/User Experience
• New Home Screen – Real-time Processing Metrics
– Quick Start Menu with Video Assist
– Quick Links to Documentation
• New Simplified Creation Flows – Asynchronous Ingestion Process
16
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 17
Accelerate Analytics Faster insights from growing data sources Increased collaboration between business and IT for data preparation
Lower Development Costs No more custom coding Less IT time spent on data set creation Data experts curate the data
Reduce Risks Avoid costly and error prone data curation efforts Data experts work directly on the data – not through requirements docs
Deeper Insights with Trustworthy Data Consistent, complete, trustworthy data Data driven decision making Governed data preparation
!
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Q&A
Jernej Kase ISV Migration Center blog: http://blogs.oracle.com/imc ISV Migration Center email: [email protected]
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Big Data Preparation Cloud Service Demo
Jernej Kase Cloud & Digital Partner Programs, Alliance & Channels Oracle EMEA
19
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. |
Getting started with Big Data Preparation Cloud Service
cloud.oracle.com/big-data-preparation
20
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 21
Copyright © 2016, Oracle and/or its affiliates. All rights reserved. |
Oracle Partner Hub ISV Migration Center
Oracle.com Partner Hub
Team Info, Events/Activities Schedule, etc
Migration Center Team Blog
Webcasts, Howto, Demos, Guides, etc Youtube: OracleIMCteam
Slideshare: Oracle_IMC_team
twitter.com/OracleIMC
plus.google.com/+OracleIMC
facebook.com/OracleIMC
linkedin.com/groups/Oracle-Partner-Hub-Migration-Center-4535240
feeds.feedburner.com/oracleimc