enterprise data science (chief data scientist forum 2016)

29
Enterprise Data Science: Navigating the Matrix ENDA RIDGE, PHD ALGORITHMS LEAD, UK FMCG Copyright Enda Ridge 2016 #GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

Upload: enda-ridge

Post on 17-Jan-2017

202 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: Enterprise Data Science (Chief Data Scientist Forum 2016)

Enterprise Data Science:Navigating the MatrixENDA RIDGE, PHDALGORITHMS LEAD, UK FMCG

Copyright Enda Ridge 2016

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

Page 2: Enterprise Data Science (Chief Data Scientist Forum 2016)

2What You Will Learn Enterprise Data Science

Reminder of how Data Science really works Challenges in a matrix organisation Navigating the matrix. 5 steps to build an Enterprise Data Science capability

How this will help you How to fit Data Science into the enterprise Why the Chief Data Scientist is critical The most important decisions in year 1

Copyright Enda Ridge 2016#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

Page 3: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

3What I’ve Learned

PhD‘Design of Experime

nts for Tuning

Algorithms’

Boutique Consultanc

y

Forensic Analytics Manager

Senior Manager

Professional

Services

Head of Algorith

ms

Copyright Enda Ridge 2016

No matter the industry, doing agile data science always faces the same challenges…

2004 2008 2010 2012 2015

Enterprises rarely have the flexibility to accommodate Data Science

Page 4: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

4What is Data Science?

“Data Science is the discipline of discovering data and understanding data to find opportunities, efficiencies and improvements”

Copyright Enda Ridge 2016

Page 5: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

5What is Data Science?

“Data Science is the discipline of discovering data and understanding data to find opportunities, efficiencies and improvements”

Copyright Enda Ridge 2016

Opportunities in new data sources, new products, new understanding

Efficiencies in automation, process changes, organisation change Improvements in product features, product offerings

Page 6: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

6What is Data Science?

“Data Science is the discipline of discovering data and understanding data to find opportunities, efficiencies and improvements”

Copyright Enda Ridge 2016

Data Science uses the scientific method Experiments to test hypotheses Rigorous measurement Making changes and observing effects

Page 7: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

7What Data Science is not…

Copyright Enda Ridge 2016

https://vimeo.com/88093956

Page 8: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

8What Data Science is not…

Big Data

Copyright Enda Ridge 2016

https://vimeo.com/88093956

Page 9: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

9What Data Science is not…

Big Data

Business Intelligence & Analytics

Copyright Enda Ridge 2016

https://vimeo.com/88093956

Page 10: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

10What Data Science is not…

Big Data

Business Intelligence & Analytics

Playing around with data

Copyright Enda Ridge 2016

https://vimeo.com/88093956

Page 11: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

11Uncertainty Data Process Questions Solutions

Copyright Enda Ridge 2016

Page 12: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

12New Data Disparate sources Surveys Web scrapes Logs 3rd party

Copyright Enda Ridge 2016

Page 13: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

13Variety Data joins Visualizations Algorithms Languages

Copyright Enda Ridge 2016

Page 14: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

14High maturity Enterprise Data Science

Frame a business hypothesis

Gather and generate data

AnalyseConfirm with experiment

Copyright Enda Ridge 2016

Data-driven operations

Data-driven products

Page 15: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

15The Enterprise Matrix

Copyright Enda Ridge 2016

Marketing Sales / Trading Logistics Other

IT, InfoSec, ArchitectureProduct Development

BI & AnalyticsHR & Recruitment

Page 16: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

16The Enterprise Matrix

Copyright Enda Ridge 2016

Marketing Sales / Trading Logistics Other

IT, InfoSec, ArchitectureProduct Development

BI & AnalyticsHR & Recruitment

Data Science

Page 17: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

175 Challenges in Enterprise Data Science

Org structure & the customer

Enabling the team

Making insights actionable

Integration with the data community

Getting and keeping people

Copyright Enda Ridge 2016

Page 18: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

18Challenge #1: structure and customer

Copyright Enda Ridge 2016

Marketing Sales / Trading Logistics Other

IT, InfoSec, ArchitectureProduct Development

BI & AnalyticsHR & Recruitment

Data Science

You need to demonstrate value fast

But All want to own ‘the sexiest

job of 20th century’ Rebranding Perhaps non-Agile ways of

working Perhaps not ready to execute

Page 19: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

19Action #1: Chief Data Scientist (CDS) Chief Data Scientist in hub and

spoke

Clear Engagement Model to help those customers ‘Ready’ ‘Done’ Pipeline Simple project artefacts

Distributed model when organisation matures

Copyright Enda Ridge 2016

Project

Project

Project

CDS

Page 20: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

20Challenge #2: Enabling the team

Copyright Enda Ridge 2016

Marketing Sales / Trading Logistics Other

IT, InfoSec, ArchitectureProduct Development

BI & AnalyticsHR & Recruitment

Data Science

You need Data and Technology to do Data Science

But Tradition of control Generally slow Incentivised to maintain

status quo

Page 21: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

21Action #2: build tactical environment Quick and simple ‘within the walls’

Copyright Enda Ridge 2016

Lab

Data store

Applications

Dev tools

Page 22: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

22Action #2: build tactical environment Quick and simple ‘within the walls’

Then build out Scale Permission groups Proxy access Local admin rights Licencing Tech Support Data governance Data feeds

Copyright Enda Ridge 2016

Lab

Data store

Applications

Dev tools

Page 23: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

23Challenge #3: making insights actionable

Copyright Enda Ridge 2016

Marketing Sales / Trading Logistics Other

IT, InfoSec, ArchitectureProduct Development

BI & AnalyticsHR & Recruitment

Data Science

You need Data Science turned into Algorithms in products

But Own opinions on tech Not familiar with Data

Science methods and code Product lens

Page 24: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

24Action #3: Operating Model Worst case – rewrite code in Dev Best case – push algorithms into

services

Realistic middle ground Use version control Create ‘integration tests cases’ Create Data Science user stories Keep algorithms simple See ‘Guerrilla Analytics’

Copyright Enda Ridge 2016

Lab Development

Interfaces

Page 25: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

25Challenge #4: data community

Copyright Enda Ridge 2016

Marketing Sales / Trading Logistics Other

IT, InfoSec, ArchitectureProduct Development

BI & AnalyticsHR & Recruitment

Data Science

You need access to data You need customers

But Gatekeepers Perceived threat Rebranding Confusion with customer

Page 26: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

26Action #4: Set out your stall Create Terms of Reference Quick wins Create marketing materials for Data

Science Have clear Engagement materials Engage with broader data community

(forums, talks etc)

Copyright Enda Ridge 2016

Page 27: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

27Challenge #5: Getting & keeping people

Copyright Enda Ridge 2016

Marketing Sales / Trading Logistics Other

IT, InfoSec, ArchitectureProduct Development

BI & AnalyticsHR & Recruitment

Data Science

You need key hires and the market is competitive

But Existing pay structures Existing job formats Hiring agencies

Page 28: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

28Action #5: Prioritised Hires Less genius, more resilience and

practicality Prefer experience in early days Clear progression structures and

performance management Establish training budget and guidance

Copyright Enda Ridge 2016

X

Page 29: Enterprise Data Science (Chief Data Scientist Forum 2016)

#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge

29Enterprise Data Science Be clear on what Data Science is 5 steps

Establish Chief Data Scientist, Hub and Spoke Enable people with tactical technology Op Model to make insights actionable Set out your stall Prioritised hires

Find me on Twitter @enda_ridge on my blog http://Guerrilla-Analytics.net

Copyright Enda Ridge 2016