oracle big data discovery 994294
TRANSCRIPT
Oracle Big Data Discovery Product Overview
1
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Richard Tomlinson Director, Product Management Oracle Big Data Discovery
Kip Bowes VP Services Cloudera
2
Speakers
3 © Cloudera, Inc. All rights reserved.
Strong Partnership for Enterprise Big Data
+
4 © Cloudera, Inc. All rights reserved.
Data Changes How We Work
Everything that can be measured will be measured.
Employees and customers expect more personal interactions, but not at the cost of their privacy.
The most innovative companies embrace experimentation and agility.
Instrumentation Consumerization Experimentation
5 © Cloudera, Inc. All rights reserved.
Cloudera Enterprise powered by Apache Hadoop
A new kind of data platform. • One place for unlimited data
• Unified, multi-framework data access
Only with Cloudera:
• Enterprise Security
• Data Governance
• Complete Management
• Open source, open standards
Security and Administration
Unlimited Storage
Process Discover
Model Serve
Deployment Flexibility
On-Premises Appliances Engineered Systems
Public Cloud Private Cloud Hybrid Cloud
6 © Cloudera, Inc. All rights reserved.
Data Discovery is the #1 fastest growing workload for enterprise analytics.
7 © Cloudera, Inc. All rights reserved.
Data Discovery & Analytics (DD&A) : The ability to find enterprise data and quickly uncover new insights and optimize existing analytics. (AKA: Self-service BI, BI, Data Discovery, Advance Analytics, Machine Learning)
8 © Cloudera, Inc. All rights reserved.
Discovery and Analytics is an Iterative Process
Report, Model, or Rules
Ingest
Transformation
80% of Time Preparing
Diverse Ingest Search and lineage Agile Transforms
20% of Time Analyzing
SQL Statistical
Machine Learning
Implement Point Solution Custom App
Analysis Technique
Access
Data Generatio
n
Data Discovery & Analytics
Flow
9 © Cloudera, Inc. All rights reserved.
The Challenge for Data Discovery Projects
How do we make data preparation 20% of the effort so businesses can focus 80% of their time on executing from analytics?
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Requires a Fundamentally New Approach
10
quickly transform and enrich it to make
it better
unlock big data for anyone to discover
and share new value
A single intuitive and visual user interface, to...
find and explore big data to understand its
potential
find explore transform discover share
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 11
Oracle Big Data Discovery. The Visual Face of Hadoop
find explore transform discover share
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Oracle Big Data Discovery. The Visual Face of Hadoop
12
find explore transform discover share See the potential in big data
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Catalog
13
• Access a rich, interactive catalog of all data in Hadoop
• Familiar search and guided navigation for ease of use
• See data set summaries, user annotation and recommendations
• Provision personal and enterprise data to Hadoop via self-service
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Explore
14
• Visualize all attributes by type
• Sort attributes by information potential
• Assess attribute statistics, data quality and outliers
• Use scratch pad to uncover correlations between attributes
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Oracle Big Data Discovery. The Visual Face of Hadoop
15
find explore transform discover share Quickly make big data better
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 16 16
• Intuitive, user driven data wrangling
• Extensive library of powerful data transformations and enrichments
• Preview results, undo, commit and replay transforms
• Test on sample data then apply to full data set in Hadoop
Transform
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Oracle Big Data Discovery. The Visual Face of Hadoop
17
find explore transform discover share Unlock big data for everyone
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 18
• Join and blend data for deeper perspectives
• Compose project pages via drag and drop
• Use powerful search and guided navigation to ask questions
• See new patterns in rich, interactive data visualizations
Discover
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 19
• Share projects, bookmarks and snapshots with others
• Build galleries and tell big data stories
• Collaborate and iterate as a team
• Publish blended data to HDFS for leverage in other tools
Share
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Oracle Big Data Discovery. Technical Innovation on CDH
Oracle Confidential – Internal 20
Oracle Big Data Discovery Workloads
Hadoop Cluster (BDA or Commodity
Hardware)
BDD node
data node
data node
data node
data node
name node Data Processing, Workflow & Monitoring
• Profiling: catalog entry creation, data type &
language detection, schema configuration • Sampling: dgraph (index) file creation • Transforms: >100 functions • Enrichments: location (geo), text (cleanup,
sentiment, entity, key-phrase, whitelist tagging)
Self-Service Provisioning & Data Transfer
• Personal Data: Upload CSV and XLS to HDFS
In-Memory Discovery Indexes
• DGraph: Search, Guided Navigation, Analytics
Studio
• Web UI: Find, Explore, Transform, Discover, Share
Hadoop 2.x
Filesystem (HDFS)
Workload Mgmt (YARN)
Metadata (HCatalog)
Other Hadoop Workloads
MapReduce
Spark
Hive
Pig
Oracle Big Data SQL (BDA only)
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |
Oracle Big Data Discovery. A Game Changing Platform
Business Benefits
• Get value faster. Rapidly turn raw data into actionable insights, leveraged across the enterprise
• Democratize value from Big Data. Increase the size, diversify the skills, and improve the efficiency of Big Data teams
21
See the Potential in Big Data, Quickly Make it Better and Unlock Value for Everyone
Technical Benefits
• Destroy existing technical barriers. Run natively on Hadoop cluster for maximum scalability and performance
• Publish, secure and leverage. Integrate with Hadoop open standards and leverage the unified Oracle big data ecosystem
Product Demo
22
www.oracle.com/bigdatadiscovery
Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 23
Questions?
Please submit questions via the
“Q and A” box and we will
answer them live.