cloudera sessions - clinic 2 center of excellence development

14
Fostering Hadoop Excellence Architecting a Center of Excellence <Speaker Name> <Title, Company>

Upload: cloudera-inc

Post on 20-Aug-2015

350 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Cloudera sessions - Clinic 2 Center of Excellence Development

Fostering Hadoop ExcellenceArchitecting a Center of Excellence

<Speaker Name><Title, Company>

Page 2: Cloudera sessions - Clinic 2 Center of Excellence Development

COE Value Proposition

Identify Big Data TechnologiesLearn New SkillsDevelop and Test Processes to Lower Risk

A Center of Excellence (COE) is where organizations:

Page 3: Cloudera sessions - Clinic 2 Center of Excellence Development

Month 1 Month 2 Month 3

Formal Training

Infrastructure Deployment

Hadoop Deployment

Deployment cont’d

Monitoring IntegrationBenchmarki

ngFailure and Recovery

Month 4 Month 5

Recovery cont’d

Ingestion Architecture

Ingestion Development

Database Integration

HBase Deployment

HBase Operations

COE Development Roadmap

Page 4: Cloudera sessions - Clinic 2 Center of Excellence Development

Learn Develop

Operate

Deploy

Publish

Research

The Center of Excellence Model

Page 5: Cloudera sessions - Clinic 2 Center of Excellence Development

Integration Infrastructure

Lab and Development Clusters

Architectural Staff

Development Staff

Operations Staff

COE Resources

Page 6: Cloudera sessions - Clinic 2 Center of Excellence Development

COE Team

2 – Architect2 – Project Manager30 – Developers1 – Administrator

Project 1

Project 2

Project 3

Project 4

Project 5

Project 6

Project 7

Project 8

COE Staffing

Page 7: Cloudera sessions - Clinic 2 Center of Excellence Development

CoE Team

DesignPlanning

Project ManagementExecution

Architect

Identify Applications with Business

Business

Formal Proposal to the CoE

Architect and PM

Review and Accept Proposal

Architect and PM

Planning and Staffing

Architect and PM

Project PlanTime and Cost Est.

CoE Team

LDDOP

CoE Team andBusiness

Hand Over

CoE Team

Reference Architecture

COE Process

Page 8: Cloudera sessions - Clinic 2 Center of Excellence Development

Background in Java, Data Management, ETLKnowledge of Systems Hadoop Integrates WithRegular Training on New Versioning and Frameworks

COE Skills

Page 9: Cloudera sessions - Clinic 2 Center of Excellence Development

Architecting Center of Excellence

Analytics Services

Page 10: Cloudera sessions - Clinic 2 Center of Excellence Development

Data Science is a Central ResourceScientists are Assigned to BusinessEmbedding Scientists encourages Data Driven practices

Data Science Teams

Page 11: Cloudera sessions - Clinic 2 Center of Excellence Development

Typical Duration is 3-6 monthsFocus on Discrete Business ProblemsBring an Understanding of Data as an Asset

Embedding Data Scientists

Page 12: Cloudera sessions - Clinic 2 Center of Excellence Development

Science provides the ProofCoE and Business handle DevelopmentResearch results are Published and Shared

Data Science and Development

Page 13: Cloudera sessions - Clinic 2 Center of Excellence Development

COE cluster deployment?Multi-tenancy?Security?Performance metrics?Report generation?

Other Questions

Page 14: Cloudera sessions - Clinic 2 Center of Excellence Development

14