cloudera showcase cask
TRANSCRIPT
CASK DATA APP PLATFORM
ABSTRACTION Hide Complexity and
Enable Reuse
INTEGRATION Provide Capabilities
over Features
TOOLS & SERVICES
Support Applications from Dev to Prod
Open Source, Integrated Platform for Developers and Organizations to Build, Deploy, and Manage Data Applications
PROPRIETARY & CONFIDENTIAL
Cask is delivering the Developer Platform for Big Data Applications
• Founded by early Hadoop engineers from Facebook and Yahoo!
• Focused on developers and enabling big data applications
We are the WebLogic of Big Data
SIMPLE ACCESS TO POWERFUL TECHNOLOGY
7
CDAP Capabilities
Datasets
Programs
+ Ingestion, Egress, Tools & User Experience
• Standardized containers providing consistency for diverse processing paradigms
• Services for developers to enable richer apps with less hassle; and production to enable application and data management
• Libraries to build reusable data access patterns spanning multiple storage technologies
Runtime Services
9
PROPRIETARY & CONFIDENTIAL
CDAP Integration with Cloudera Cask Data Application Platform (CDAP) – Cloudera Integration Today
Cloudera Manager – CDAP CSD enables install, update, monitor of CDAP within CM
Impala – CDAP adapter for Impala enables data transformation into Impala optimized file formats with just a few simple commands
Future:
Further Impala integration
Integration with Sentry
Integration with Navigator
Support for Spark Streaming
Support for Cloudera Search DeploymentFlexibility
Unlimited Storage
Security and Administration
Process Discover Model Serve
On-PremisesAppliancesEngineered Systems
Public CloudPrivate CloudHybrid Cloud
Cloudera’s Enterprise Data Hub
Programs
Batch Programs Realtime Programs
CASK DATA APPLICATION PLATFORM (CDAP)
Event /DataIngestion
Tools andUser Experience
Datasets
Runtime Services
Egress
Adapters
Data ApplicationExamples
Anomaly Detection
360o
Consumerprofile
NetworkAnalytics
Multi-logCorrelation
Analytics
• New role-based user interface with capability for user-defined dashboards • Code-free data ingestion, exploration, and transformations from UI and Shell • Pre-built, out of the box support for real-time and batch ETL pipelines • Application templates and plugins to speed development and enable reuse • Addition of OLAP Cube dataset • Support for multi-tenancy with easy to configure and manage namespaces • Enhanced metrics and workflow support
Integrated features and pre-built modules for new users to become instantly productive
Powerful capabilities and open source extensibility for advanced users to move fast
Cask on Cloudera Use case: Marketing SaaS Company
Challenges • Technical: 15B real-time events / day with consistency
• Talent: Domain experts don’t know Hadoop; Hadoop consultants didn’t know domain
• Budget: Specialized skills expensive
• Operational: Utilize established best practices
Goals
• Velocity: Real-time customer response
• Revenue: Increase ACV
• Competitive Advantage: Differentiate with scale and data consistency
Solution • CDAP delivered scale while maintaining data
consistency
• CDAP abstractions enabled domain experts to deliver without learning Hadoop
• CDAP integrated into their existing development process
Results • Development to production in 3 months after 9 month
failed effort to write natively on Hadoop with consultants
• Budget saved avoiding Hadoop consultants
• New service driving revenue with existing customers
14
CDAP on Hadoop compared to Hadoop alone
Lines of code 82% reduction
Development time 86% reduction
Other advantages
• Reduced Cyclomatic complexity • Improved Testability • Code readability and maintenance • Application deployment and maintenance • Egress support for application data • Simplified knowledge transfer
Actual Developer’s Experience Top 5 SaaS Company
Who Application ISVs SaaS Providers Opportunity Build new applications and services on CDAP Value Propositions
• Lower TCO
• Better use of developer resources
• Faster time to market
• Enable new features or services that require real-time ingestion, processing with data integrity
Cloudera Partners Opportunities to engage with Cask
Who System Integrators Consulting Partners Opportunity Incorporate CDAP as a development platform within your big data practice Value Propositions
• Broaden pool of big data developers to include Java developers
• Build solutions beyond offline analytics to increase business value
• Lower cost of delivery by leveraging reuse and lower cost skill sets
• Increase service capacity by accelerating time to market
Who Infrastructure ISVs Opportunity Integration with CDAP (Datasets, Programs, Templates, etc.) Value Propositions
• Address potential gaps in your customers’ ability to grow footprint on your solution
• Easily extend integration opportunities into the rest of the ecosystem via CDAP
Download CDAP
100% Apache 2 Open Source
CDAP 3.0 with ZeroApp UI + Shell
http://cask.co/download
http://www.cloudera.com/content/cloudera/en/downloads/cdap/
latest.html
Use Cases and Examples
http://cask.co/get-started
Cloudera Partners Next steps to work with Cask
Live Technical Webinar
Cask hosts a 2-hour, invite-only live technical webinar for customers and partners to learn more about CDAP
Next Webinar will be on
Wednesday, June 3rd, 2015
To register, please e-mail
CDAP Certification
The Cask Certified Partner Program is available to Cloudera Partners
at no cost
Two day, on-site technical training session at Cask HQ in Palo Alto for
developers. Includes basic and advanced CDAP courses and labs.
To register, please e-mail
If you have any questions…
Jonathan Gray, CEO [email protected]
Tom Aliotti, SVP Field Ops [email protected]
Yuri Bukhan, Cloudera [email protected]