yhat - applied data science - feb 2016

47
Applied Data Science Making insights accessible and actionable PRESENTED BY Colin Ristig Product Manager [email protected] Austin Ogilvie Founder & CEO [email protected]

Upload: yhat-inc

Post on 15-Apr-2017

495 views

Category:

Software


1 download

TRANSCRIPT

Page 1: Yhat - Applied Data Science - Feb 2016

Applied Data ScienceMaking insights accessible and actionable

PRESENTED BY

Colin RistigProduct [email protected]

Austin OgilvieFounder & [email protected]

Page 2: Yhat - Applied Data Science - Feb 2016

Agenda

Quick Intro to Data Science

Understanding the Value Chain

Designing Your Data Science Process

Page 3: Yhat - Applied Data Science - Feb 2016

About Us

Page 4: Yhat - Applied Data Science - Feb 2016

We help data scientists build & deploy apps

Page 5: Yhat - Applied Data Science - Feb 2016

Founded 2013Headquarters in NYC

Page 6: Yhat - Applied Data Science - Feb 2016

You may know us from

Page 7: Yhat - Applied Data Science - Feb 2016

Data Sciencein 30 seconds

Page 8: Yhat - Applied Data Science - Feb 2016

Data Science in 30 Seconds

Broadly…

A multidisciplinary field concerning

problem solving using data,

statistics & software.

Page 9: Yhat - Applied Data Science - Feb 2016

“ What distinguishes data science itself from

the tools and techniques is the central goal

of deploying effective decision-making

models to a production environment. ”

Data Science is not “Interesting Research”

~ Nina Zumel & John Mount, Practical Data Science with R

Page 10: Yhat - Applied Data Science - Feb 2016

It’s about day-to-day problems

Carl wants to watch a good movie.

Page 11: Yhat - Applied Data Science - Feb 2016

And practical, real-world solutions

Carl wants to watch a good movie.

Hey, Carl. Check these out!

Page 12: Yhat - Applied Data Science - Feb 2016

Explanation isn’t always important

Carl wants to watch a good movie.

Carl

Cindy

http://courses.washington.edu/css490/2012.Winter/lecture_slides/08b_collaborative_filtering_1_r1.pdf

Carl would like Frozen because Cindy liked it.

Page 13: Yhat - Applied Data Science - Feb 2016

Data ScienceChallenges

Page 14: Yhat - Applied Data Science - Feb 2016

30%

Page 15: Yhat - Applied Data Science - Feb 2016

Why?

Page 16: Yhat - Applied Data Science - Feb 2016

Key obstacles data science teams face

Lack of Understanding

Page 17: Yhat - Applied Data Science - Feb 2016

Key obstacles data science teams face

Difficulty of Experimentation

Page 18: Yhat - Applied Data Science - Feb 2016

Hey, Trey. Online sales are down. What can we do to keep users engaged and shopping carts full?

Trey is asked to “look into something”

I’ll look into it.

Page 19: Yhat - Applied Data Science - Feb 2016

Hm...cool. Can you talk to the

dev team?

Here’s what we should do:

Trey uncovers a bunch of things we didn’t know

Page 20: Yhat - Applied Data Science - Feb 2016

Trey hands his work to deployment engineers

Page 21: Yhat - Applied Data Science - Feb 2016

“Throw it over the wall” projects

Execs Data Science Application Developers

Page 22: Yhat - Applied Data Science - Feb 2016

Common reasons these types of projects stall

- Unclear benefits- Skepticism about effectiveness- Too complex to operationalize- Too time-consuming- Unclear how to measure ROI

Page 23: Yhat - Applied Data Science - Feb 2016

Data ScienceValue Chain

Page 24: Yhat - Applied Data Science - Feb 2016

Making data valuable

Collect and display individual records

Structure, link, metadata, interact, share

Understand, infer, learn

Drive value,

change

Clean, aggregate, visualize

Actions

Predictions

Reports

Charts

Records

Extracting value from data is like any other value chain.

Value

Page 25: Yhat - Applied Data Science - Feb 2016

Like a raw material, data has no obvious utility to start out.

Collect and display individual records

Structure, link, metadata, interact, share

Understand, infer, learn

Drive value,

change

Clean, aggregate, visualize

Actions

Predictions

Reports

Charts

Records

Value

Making data valuable

Page 26: Yhat - Applied Data Science - Feb 2016

We make it valuable through sequential refinement.

Collect and display individual records

Structure, link, metadata, interact, share

Understand, infer, learn

Drive value,

change

Clean, aggregate, visualize

Actions

Predictions

Reports

Charts

Records

Value

Making data valuable

Page 27: Yhat - Applied Data Science - Feb 2016

Cost of Creating that Value

Building data products requires lots of work

Page 28: Yhat - Applied Data Science - Feb 2016

Cost of Creating that Value

But most of the value is generated at the end

Page 29: Yhat - Applied Data Science - Feb 2016

Cost of Creating that Value

Data Teams

Managers

Customers

Everyone has to see past a lot of challenges

Page 30: Yhat - Applied Data Science - Feb 2016

DataScienceCustomers

Page 31: Yhat - Applied Data Science - Feb 2016

- Consumers

Several types of customers

Carl wants to watch a good movie.

Page 32: Yhat - Applied Data Science - Feb 2016

- Consumers- App Developers

Cambria needs to call credit models from Salesforce.

Several types of customers

Page 33: Yhat - Applied Data Science - Feb 2016

Douglas needs 3 AM server outages to stop.

Several types of customers

- Consumers- App Developers- Infrastructure Admins

Page 34: Yhat - Applied Data Science - Feb 2016

Gordon wants sales reps calling the hottest leads.

Several types of customers

- Consumers- App Developers- Infrastructure Admins- Sales & Marketing

Page 35: Yhat - Applied Data Science - Feb 2016

DataScience5 Attributes for Success

Page 36: Yhat - Applied Data Science - Feb 2016

1. Focus on the customer

5 Attributes of Successful Data Science Teams

Page 37: Yhat - Applied Data Science - Feb 2016

1. Focus on the customer2. Identify practical constraints

5 Attributes of Successful Data Science Teams

Page 38: Yhat - Applied Data Science - Feb 2016

1. Focus on the customer2. Identify practical constraints3. Start small but ship quickly

5 Attributes of Successful Data Science Teams

Page 39: Yhat - Applied Data Science - Feb 2016

1. Focus on the customer2. Identify practical constraints3. Start small but ship quickly4. Measure the impact

5 Attributes of Successful Data Science Teams

Page 40: Yhat - Applied Data Science - Feb 2016

1. Focus on the customer2. Identify practical constraints3. Start small but ship quickly4. Measure the impact5. Relentless iteration

5 Attributes of Successful Data Science Teams

Page 41: Yhat - Applied Data Science - Feb 2016

1. Focus on the customer2. Identify practical constraints3. Start small but ship quickly4. Measure the impact5. Relentless iteration

5 Attributes of Successful Data Science Teams

Page 42: Yhat - Applied Data Science - Feb 2016

Demo

Page 43: Yhat - Applied Data Science - Feb 2016

Hm...cool. Can you talk to the

dev team?

Here’s what we should do:

Trey uncovers a bunch of things we didn’t know

Page 44: Yhat - Applied Data Science - Feb 2016

Trey hands his work to deployment engineers

Page 45: Yhat - Applied Data Science - Feb 2016

“Throw it over the wall” projects

Data Science Application Developers

Page 46: Yhat - Applied Data Science - Feb 2016

Deploy Models Faster

Data Science Application Developers

Page 47: Yhat - Applied Data Science - Feb 2016