enterprise data science (chief data scientist forum 2016)
TRANSCRIPT
Enterprise Data Science:Navigating the MatrixENDA RIDGE, PHDALGORITHMS LEAD, UK FMCG
Copyright Enda Ridge 2016
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
2What You Will Learn Enterprise Data Science
Reminder of how Data Science really works Challenges in a matrix organisation Navigating the matrix. 5 steps to build an Enterprise Data Science capability
How this will help you How to fit Data Science into the enterprise Why the Chief Data Scientist is critical The most important decisions in year 1
Copyright Enda Ridge 2016#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
3What I’ve Learned
PhD‘Design of Experime
nts for Tuning
Algorithms’
Boutique Consultanc
y
Forensic Analytics Manager
Senior Manager
Professional
Services
Head of Algorith
ms
Copyright Enda Ridge 2016
No matter the industry, doing agile data science always faces the same challenges…
2004 2008 2010 2012 2015
Enterprises rarely have the flexibility to accommodate Data Science
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
4What is Data Science?
“Data Science is the discipline of discovering data and understanding data to find opportunities, efficiencies and improvements”
Copyright Enda Ridge 2016
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
5What is Data Science?
“Data Science is the discipline of discovering data and understanding data to find opportunities, efficiencies and improvements”
Copyright Enda Ridge 2016
Opportunities in new data sources, new products, new understanding
Efficiencies in automation, process changes, organisation change Improvements in product features, product offerings
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
6What is Data Science?
“Data Science is the discipline of discovering data and understanding data to find opportunities, efficiencies and improvements”
Copyright Enda Ridge 2016
Data Science uses the scientific method Experiments to test hypotheses Rigorous measurement Making changes and observing effects
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
7What Data Science is not…
Copyright Enda Ridge 2016
https://vimeo.com/88093956
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
8What Data Science is not…
Big Data
Copyright Enda Ridge 2016
https://vimeo.com/88093956
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
9What Data Science is not…
Big Data
Business Intelligence & Analytics
Copyright Enda Ridge 2016
https://vimeo.com/88093956
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
10What Data Science is not…
Big Data
Business Intelligence & Analytics
Playing around with data
Copyright Enda Ridge 2016
https://vimeo.com/88093956
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
11Uncertainty Data Process Questions Solutions
Copyright Enda Ridge 2016
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
12New Data Disparate sources Surveys Web scrapes Logs 3rd party
Copyright Enda Ridge 2016
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
13Variety Data joins Visualizations Algorithms Languages
Copyright Enda Ridge 2016
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
14High maturity Enterprise Data Science
Frame a business hypothesis
Gather and generate data
AnalyseConfirm with experiment
Copyright Enda Ridge 2016
Data-driven operations
Data-driven products
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
15The Enterprise Matrix
Copyright Enda Ridge 2016
Marketing Sales / Trading Logistics Other
IT, InfoSec, ArchitectureProduct Development
BI & AnalyticsHR & Recruitment
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
16The Enterprise Matrix
Copyright Enda Ridge 2016
Marketing Sales / Trading Logistics Other
IT, InfoSec, ArchitectureProduct Development
BI & AnalyticsHR & Recruitment
Data Science
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
175 Challenges in Enterprise Data Science
Org structure & the customer
Enabling the team
Making insights actionable
Integration with the data community
Getting and keeping people
Copyright Enda Ridge 2016
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
18Challenge #1: structure and customer
Copyright Enda Ridge 2016
Marketing Sales / Trading Logistics Other
IT, InfoSec, ArchitectureProduct Development
BI & AnalyticsHR & Recruitment
Data Science
You need to demonstrate value fast
But All want to own ‘the sexiest
job of 20th century’ Rebranding Perhaps non-Agile ways of
working Perhaps not ready to execute
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
19Action #1: Chief Data Scientist (CDS) Chief Data Scientist in hub and
spoke
Clear Engagement Model to help those customers ‘Ready’ ‘Done’ Pipeline Simple project artefacts
Distributed model when organisation matures
Copyright Enda Ridge 2016
Project
Project
Project
CDS
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
20Challenge #2: Enabling the team
Copyright Enda Ridge 2016
Marketing Sales / Trading Logistics Other
IT, InfoSec, ArchitectureProduct Development
BI & AnalyticsHR & Recruitment
Data Science
You need Data and Technology to do Data Science
But Tradition of control Generally slow Incentivised to maintain
status quo
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
21Action #2: build tactical environment Quick and simple ‘within the walls’
Copyright Enda Ridge 2016
Lab
Data store
Applications
Dev tools
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
22Action #2: build tactical environment Quick and simple ‘within the walls’
Then build out Scale Permission groups Proxy access Local admin rights Licencing Tech Support Data governance Data feeds
Copyright Enda Ridge 2016
Lab
Data store
Applications
Dev tools
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
23Challenge #3: making insights actionable
Copyright Enda Ridge 2016
Marketing Sales / Trading Logistics Other
IT, InfoSec, ArchitectureProduct Development
BI & AnalyticsHR & Recruitment
Data Science
You need Data Science turned into Algorithms in products
But Own opinions on tech Not familiar with Data
Science methods and code Product lens
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
24Action #3: Operating Model Worst case – rewrite code in Dev Best case – push algorithms into
services
Realistic middle ground Use version control Create ‘integration tests cases’ Create Data Science user stories Keep algorithms simple See ‘Guerrilla Analytics’
Copyright Enda Ridge 2016
Lab Development
Interfaces
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
25Challenge #4: data community
Copyright Enda Ridge 2016
Marketing Sales / Trading Logistics Other
IT, InfoSec, ArchitectureProduct Development
BI & AnalyticsHR & Recruitment
Data Science
You need access to data You need customers
But Gatekeepers Perceived threat Rebranding Confusion with customer
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
26Action #4: Set out your stall Create Terms of Reference Quick wins Create marketing materials for Data
Science Have clear Engagement materials Engage with broader data community
(forums, talks etc)
Copyright Enda Ridge 2016
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
27Challenge #5: Getting & keeping people
Copyright Enda Ridge 2016
Marketing Sales / Trading Logistics Other
IT, InfoSec, ArchitectureProduct Development
BI & AnalyticsHR & Recruitment
Data Science
You need key hires and the market is competitive
But Existing pay structures Existing job formats Hiring agencies
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
28Action #5: Prioritised Hires Less genius, more resilience and
practicality Prefer experience in early days Clear progression structures and
performance management Establish training budget and guidance
Copyright Enda Ridge 2016
X
#GuerrillaAnalytics http://guerrilla-analytics.net @enda_ridge
29Enterprise Data Science Be clear on what Data Science is 5 steps
Establish Chief Data Scientist, Hub and Spoke Enable people with tactical technology Op Model to make insights actionable Set out your stall Prioritised hires
Find me on Twitter @enda_ridge on my blog http://Guerrilla-Analytics.net
Copyright Enda Ridge 2016