cloudera showcase cask

16
Simple Access to Powerful Technology http://cask.co

Upload: cloudera-inc

Post on 06-Aug-2015

386 views

Category:

Technology


0 download

TRANSCRIPT

Simple Access to Powerful Technology

http://cask.co

Simple Access to Powerful Technology

http://cask.co

Tom Aliotti

Big Data Technology Explosion

Hadoop Challenges

CASK DATA APP PLATFORM

ABSTRACTION Hide Complexity and

Enable Reuse

INTEGRATION Provide Capabilities

over Features

TOOLS & SERVICES

Support Applications from Dev to Prod

Open Source, Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications

6

Hadoop is the Distributed OS CDAP is the Distributed App Framework

PROPRIETARY & CONFIDENTIAL

Cask is delivering the Developer Platform for Big Data Applications

•  Founded by early Hadoop engineers from Facebook and Yahoo!

•  Focused on developers and enabling big data applications

We are the WebLogic of Big Data

SIMPLE ACCESS TO POWERFUL TECHNOLOGY

7

CASK DATA APP PLATFORM

CDAP Capabilities

Datasets

Programs

+ Ingestion, Egress, Tools & User Experience

• Standardized containers providing consistency for diverse processing paradigms

• Services for developers to enable richer apps with less hassle; and production to enable application and data management

• Libraries to build reusable data access patterns spanning multiple storage technologies

Runtime Services

9

PROPRIETARY & CONFIDENTIAL

CDAP Integration with Cloudera Cask Data Application Platform (CDAP) – Cloudera Integration Today

Cloudera Manager – CDAP CSD enables install, update, monitor of CDAP within CM

Impala – CDAP adapter for Impala enables data transformation into Impala optimized file formats with just a few simple commands

Future:

Further Impala integration

Integration with Sentry

Integration with Navigator

Support for Spark Streaming

Support for Cloudera Search DeploymentFlexibility

Unlimited Storage

Security and Administration

Process Discover Model Serve

On-PremisesAppliancesEngineered Systems

Public CloudPrivate CloudHybrid Cloud

Cloudera’s Enterprise Data Hub

Programs

Batch Programs Realtime Programs

CASK DATA APPLICATION PLATFORM (CDAP)

Event /DataIngestion

Tools andUser Experience

Datasets

Runtime Services

Egress

Adapters

Data ApplicationExamples

Anomaly Detection

360o

Consumerprofile

NetworkAnalytics

Multi-logCorrelation

Analytics

•  New role-based user interface with capability for user-defined dashboards •  Code-free data ingestion, exploration, and transformations from UI and Shell •  Pre-built, out of the box support for real-time and batch ETL pipelines •  Application templates and plugins to speed development and enable reuse •  Addition of OLAP Cube dataset •  Support for multi-tenancy with easy to configure and manage namespaces •  Enhanced metrics and workflow support

Integrated features and pre-built modules for new users to become instantly productive

Powerful capabilities and open source extensibility for advanced users to move fast

CDAP  Demo  

12

Cask on Cloudera Use case: Marketing SaaS Company

Challenges •  Technical: 15B real-time events / day with consistency

•  Talent: Domain experts don’t know Hadoop; Hadoop consultants didn’t know domain

•  Budget: Specialized skills expensive

•  Operational: Utilize established best practices

Goals

•  Velocity: Real-time customer response

•  Revenue: Increase ACV

•  Competitive Advantage: Differentiate with scale and data consistency

Solution •  CDAP delivered scale while maintaining data

consistency

•  CDAP abstractions enabled domain experts to deliver without learning Hadoop

•  CDAP integrated into their existing development process

Results •  Development to production in 3 months after 9 month

failed effort to write natively on Hadoop with consultants

•  Budget saved avoiding Hadoop consultants

•  New service driving revenue with existing customers

14

CDAP on Hadoop compared to Hadoop alone

Lines of code 82% reduction

Development time 86% reduction

Other advantages

•  Reduced Cyclomatic complexity •  Improved Testability •  Code readability and maintenance •  Application deployment and maintenance •  Egress support for application data •  Simplified knowledge transfer

Actual Developer’s Experience Top 5 SaaS Company

Who Application ISVs SaaS Providers Opportunity Build new applications and services on CDAP Value Propositions

•  Lower TCO

•  Better use of developer resources

•  Faster time to market

•  Enable new features or services that require real-time ingestion, processing with data integrity

Cloudera Partners Opportunities to engage with Cask

Who System Integrators Consulting Partners Opportunity Incorporate CDAP as a development platform within your big data practice Value Propositions

•  Broaden pool of big data developers to include Java developers

•  Build solutions beyond offline analytics to increase business value

•  Lower cost of delivery by leveraging reuse and lower cost skill sets

•  Increase service capacity by accelerating time to market

Who Infrastructure ISVs Opportunity Integration with CDAP (Datasets, Programs, Templates, etc.) Value Propositions

•  Address potential gaps in your customers’ ability to grow footprint on your solution

•  Easily extend integration opportunities into the rest of the ecosystem via CDAP

Download CDAP

100% Apache 2 Open Source

CDAP 3.0 with ZeroApp UI + Shell

http://cask.co/download

http://www.cloudera.com/content/cloudera/en/downloads/cdap/

latest.html

Use Cases and Examples

http://cask.co/get-started

Cloudera Partners Next steps to work with Cask

Live Technical Webinar

Cask hosts a 2-hour, invite-only live technical webinar for customers and partners to learn more about CDAP

Next Webinar will be on

Wednesday, June 3rd, 2015

To register, please e-mail

[email protected]

CDAP Certification

The Cask Certified Partner Program is available to Cloudera Partners

at no cost

Two day, on-site technical training session at Cask HQ in Palo Alto for

developers. Includes basic and advanced CDAP courses and labs.

To register, please e-mail

[email protected]

If you have any questions…

Jonathan Gray, CEO [email protected]

Tom Aliotti, SVP Field Ops [email protected]

Yuri Bukhan, Cloudera [email protected]