building predictive analytics solution with azure ml (odsc workshop)

20
Building a predictive analytics solution with Azure ML Fidan Boylu Uz, Ph.D Syed Fahad Allam Shah, Ph.D Data Scientists, Microsoft

Upload: fahad-shah

Post on 07-Aug-2015

145 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: Building predictive analytics solution with Azure ML (ODSC Workshop)

Building a predictive analytics solution with Azure ML

Fidan Boylu Uz, Ph.DSyed Fahad Allam Shah, Ph.DData Scientists, Microsoft

Page 2: Building predictive analytics solution with Azure ML (ODSC Workshop)

Advanced AnalyticsBeyond business intelligence

Source: Gartner

VA

LU

E

DIFFICULTY

HINDSIGHT

INSIGHT

FORESIGHT

Descriptive Analytics

DiagnosticAnalytics

Predictive Analytics

Prescriptive Analytics

What happened?

Why did it happen?

What will happen?

How can we make it happen?

Traditional BI Advanced AnalyticsINFORMATION

OPTIMIZATION

Page 3: Building predictive analytics solution with Azure ML (ODSC Workshop)

Machine LearningComputing systems that improve with experience

Page 4: Building predictive analytics solution with Azure ML (ODSC Workshop)

1 1 5 4 3

7 5 3 5 3

5 5 9 0 6

3 5 2 0 0

Training examples Training labels

Accurate digit classifier

2

Machine learning system

Bing Translator App

Page 5: Building predictive analytics solution with Azure ML (ODSC Workshop)

Predictive AnalyticsPredicting future performance from historical data

Recommenda-tion engines

Advertising analysis

Weather forecasting for business planning

Social network analysis

IT infrastructure and web app optimization

Legal discovery and document archiving

Pricing analysisFraud detection

Churn analysis

Equipment monitoring

Location-based tracking and services

Personalized Insurance

Predictive analytics should address the likelihood of something happening in the future, even if it is just an instant later…

Page 6: Building predictive analytics solution with Azure ML (ODSC Workshop)

Transformational trends

cloud computing

2011 2016 5x increase

emerging data science talent

Universities filling 300,000 US talent gap

90% of the data in the world today has been created in the last two years alone

data explosion

connected customers

1B+200M10.4M 160M

Page 7: Building predictive analytics solution with Azure ML (ODSC Workshop)

The old Advanced Analytics landscape No improvement in generations

Huge set-up costs of tools, expertise, and compute/storage capacity Expensive

Siloed and cumbersome data management restricts access to dataSiloed data

Complex and fragmented tools limit participation in exploring data and building models

Disconnected tools

Many models never achieve business value due to difficulties with deploying to production

Deployment complexity

Page 8: Building predictive analytics solution with Azure ML (ODSC Workshop)

Differentiation

Model Your Way[Data Scientist]

All Skill Levels Business-tested Algorithms

R & Python

Deploy in Minutes[Data Scientist, IT & Developers]

One Click DeploymentManage via Cloud Portal

Model accessed as a Web Service

Expand your Reach[Ecosystem & Developers]

Azure MarketplaceGlobal Scale

The Data Science “Inside”

Accessibility

Page 9: Building predictive analytics solution with Azure ML (ODSC Workshop)

Predictive MaintenanceDemo

Page 10: Building predictive analytics solution with Azure ML (ODSC Workshop)

Scenario

11

This is Karl.Karl owns a company that

operates vending machines in Washington.

His job is to make sure that his 100 vending machines are selling drinks & obtaining

revenue.

Karl wants revenue to always be high & his

business to be profitable

Page 11: Building predictive analytics solution with Azure ML (ODSC Workshop)

Scenario

12

Sadly, vending machine will occasionally break & may take up to 7 days to fix, thus hurting

sales.

To eliminate this occurrence, Karl must maintain operations & figure out the best way to utilize

resources in order to optimize revenue.

Page 12: Building predictive analytics solution with Azure ML (ODSC Workshop)

Questions & Solutions

3. Which Machines Will Soon Fail

13

1. Which Machines Have Lost Sales?

2. Which Machines Have Failed?

Page 13: Building predictive analytics solution with Azure ML (ODSC Workshop)

Cloud

ConsumptionAnalysis / ManagementAnalysis/ StorageData Collection

Stream Analytics

Demo architecture: Advanced Analytics

API Link

Event Hubs

Data Factory

Azure Machine Learning

Power BI

Excel

Field Data

MicrosoftAzure Portal

Power BI / ExcelMicrosoft Azure

Blob Storage

14

Page 14: Building predictive analytics solution with Azure ML (ODSC Workshop)

Advanced analytics architecture

Data to model to web services in minutes

Data preparation Business valueModeling Deployment

• HDFS• RDBMS• NoSQL stores• Blobs and tables

Data

• Desktop files

• Spreadsheets

• Server stores

• Sensors

Cloud

Local

Apps, dashboardsand processes

Storage space

Integrated development environment for machine

learning

MLStudio

http://studio.azureml.net

API

Model is now a web svc

Monetize this API

MMarketplace

Web

• Data factory• Stream analytics

• Machine learning• HDInsight

• Marketplace• Azure portal

• Power BI• Apps

Page 15: Building predictive analytics solution with Azure ML (ODSC Workshop)

Establish mechanisms to conduct data science activities end-to-end in the cloud or on premises, friction free.

Set up a Data Science Environment in the cloud Move data from on premise to cloud Explore and understand your data Build a model with Azure Machine Learning Deploy model as web-service and consume it End-to-End walkthroughs with real datasets

Advanced Analytics Process & Technology (ADAPT)

http://aka.ms/adapt

Page 16: Building predictive analytics solution with Azure ML (ODSC Workshop)

ADAPT Hands On Walkthroughs

Setup Cloud Environment

Load DataExplore Data

Engineer Features

Sample Data

Build Model Deploy Model Consume Model

Page 17: Building predictive analytics solution with Azure ML (ODSC Workshop)

Today: Hands On

Setup Cloud Environment

Load DataExplore Data

Engineer Features

Sample Data

Build Model Deploy Model Consume Model

Page 18: Building predictive analytics solution with Azure ML (ODSC Workshop)

All taxi trips and fares paid for trips in NYC from January 2013 to December 2013. ~20GB of compressed CSV data (~48GB uncompressed). >173 million rows.

Each record includes Pickup and drop-off location and time Anonymized hack license number Medallion number (i.e. the taxi’s unique id number).

trip_data CSV contains trip details, and trip_fare CSV contains details of the fare paid.

Unique key to join trip_data and trip_fare is: medallion, hack_licence, and pickup_datetime.

New York City Taxi Dataset

Page 19: Building predictive analytics solution with Azure ML (ODSC Workshop)

Let’s get going!

Page 20: Building predictive analytics solution with Azure ML (ODSC Workshop)

Demo

Setup Cloud Environment

Load DataExplore Data

Engineer Features

Sample Data

Build Model Deploy Model Consume Model