cloudera sessions - clinic 2 center of excellence development

Post on 20-Aug-2015

350 Views

Category:

Documents

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Fostering Hadoop ExcellenceArchitecting a Center of Excellence

<Speaker Name><Title, Company>

COE Value Proposition

Identify Big Data TechnologiesLearn New SkillsDevelop and Test Processes to Lower Risk

A Center of Excellence (COE) is where organizations:

Month 1 Month 2 Month 3

Formal Training

Infrastructure Deployment

Hadoop Deployment

Deployment cont’d

Monitoring IntegrationBenchmarki

ngFailure and Recovery

Month 4 Month 5

Recovery cont’d

Ingestion Architecture

Ingestion Development

Database Integration

HBase Deployment

HBase Operations

COE Development Roadmap

Learn Develop

Operate

Deploy

Publish

Research

The Center of Excellence Model

Integration Infrastructure

Lab and Development Clusters

Architectural Staff

Development Staff

Operations Staff

COE Resources

COE Team

2 – Architect2 – Project Manager30 – Developers1 – Administrator

Project 1

Project 2

Project 3

Project 4

Project 5

Project 6

Project 7

Project 8

COE Staffing

CoE Team

DesignPlanning

Project ManagementExecution

Architect

Identify Applications with Business

Business

Formal Proposal to the CoE

Architect and PM

Review and Accept Proposal

Architect and PM

Planning and Staffing

Architect and PM

Project PlanTime and Cost Est.

CoE Team

LDDOP

CoE Team andBusiness

Hand Over

CoE Team

Reference Architecture

COE Process

Background in Java, Data Management, ETLKnowledge of Systems Hadoop Integrates WithRegular Training on New Versioning and Frameworks

COE Skills

Architecting Center of Excellence

Analytics Services

Data Science is a Central ResourceScientists are Assigned to BusinessEmbedding Scientists encourages Data Driven practices

Data Science Teams

Typical Duration is 3-6 monthsFocus on Discrete Business ProblemsBring an Understanding of Data as an Asset

Embedding Data Scientists

Science provides the ProofCoE and Business handle DevelopmentResearch results are Published and Shared

Data Science and Development

COE cluster deployment?Multi-tenancy?Security?Performance metrics?Report generation?

Other Questions

14

top related