703 introducingcoreml 04 d - devstreaming-cdn.apple.com · void convolutionlayer(int kernelwidth,...

#WWDC17

Gaurav Kapoor, Core ML Michael Siracusa, Core ML Lizi Ottens, Core ML

•Introducing Core ML

System Frameworks

Personalization Face Detection Emotion Detection

Search Ranking Machine Translation Image Captioning

Real Time Image Recognition Text Prediction Entity Recognition

Speaker Identification Music Tagging Text Summarization

Sentiment Analysis Handwriting Recognition Style Transfer

Personalization Face Detection Emotion Detection

Search Ranking Machine Translation Image Captioning

Real Time Image Recognition Text Prediction Entity Recognition

Speaker Identification Music Tagging Text Summarization

Sentiment Analysis Handwriting Recognition Style Transfer

•Why?

Task Show all images of roses

// Use color if color == "reddish"

// Use shapeif shape == ???

Machine Learning

Training

+ Labels

Offline

Training

Learning Algorithm

+ Labels

Offline

Training

Learning Algorithm Model

+ Labels

Offline

Inference

Label:

Confidence:

Rose 95%

Inference

Challenging!

Label:

Confidence:

Rose 95%

void convolutionLayer(int kernelWidth, int kernelHeight, int inputFeatureChannels, int outputFeatureChannels, int strideX, int strideY, int numRows, int numCols, float* input, float* output, float* weights, int widthPadding, int heightPadding, float alpha, float beta) { memset(output, 0, ((numRows - kernelWidth + 2*widthPadding)/2 + 1) * ((numRows - kernelWidth + 2*widthPadding)/2 + 1)*outputFeatureChannels * sizeof(float)); for (int depthInd = 0; depthInd < outputFeatureChannels; depthInd++) { // loop over input (color) channels for (int colorInd = 0; colorInd < inputFeatureChannels; colorInd++) { int numRowsOut = (numRows - kernelWidth + 2*widthPadding)/strideX + 1; int numColsOut = (numCols - kernelHeight + 2*heightPadding)/strideY + 1; // loop over the pixels of the image for (int i=0; i < numRowsOut; i++) { for(int j=0; j < numColsOut; j++) { // loop over this kernel for(int m=0; m < kernelWidth; m++) { int mm = kernelWidth - 1 - m; for(int n=0; n < kernelHeight; n++) { int nn = kernelHeight - 1 - n; int ii = i + (m - kernelWidth/2); int jj = j + (n - kernelHeight/2); if( ii >= 0 && ii < numRows && jj >= 0 && jj < numCols) float weight = weights[nn + mm * kernelHeight + depthInd * kernelHeight * kernelWidth]; float value = input[jj + ii*numCols + colorInd*numCols*inputFeatureChannels] * weight; output[j + i*numColsOut + depthInd*numColsOut*numColsOut] += value; } } } } } } // loop and apply nonlinearity for (int i = 0; i < ((numRows - kernelWidth + 2*widthPadding)/strideX + 1) * ((numRows - kernelWidth + 2*widthPadding)/strideY + 1) * outputFeatureChannels; i++) { output[i] = alpha*tanh(beta*output[i]); } }

Challenges

Correctness

Performance

Energy Efficiency

•ML Frameworks

ML Frameworks

Your app

ML Frameworks

NLPVision

Your app

NEWDomain Specific Frameworks

ML Frameworks

Core ML

Vision

Your app

Domain Specific Frameworks

ML Framework

ML Frameworks

Accelerate

Core ML

Vision

Your app

ML Framework

ML Performance Primitives

Vision Framework

Accelerate MPS

Core ML

Vision NLP

Your app

Vision Framework

Accelerate MPS

Core ML

Vision NLP

Your app

Object Tracking

Vision Framework

Accelerate MPS

Core ML

Vision NLP

Your app

Object Tracking Face Detection

Natural Language Processing

Accelerate MPS

Core ML

Vision NLP

Your app

Accelerate MPS

Core ML

Vision NLP

Your app

Pablo y yo ya regresamos de nuestras vacaciones en

Finlandia.

Language Identification

Language: Spanish

Accelerate MPS

Core ML

Vision NLP

Your app

Pablo y yo ya regresamos de nuestras vacaciones en

Finlandia.

Language: Spanish

Language Identification Named Entity Recognition

Place Identified: Finland

Pablo and I are back from Finland

Place Identification

Core ML

Accelerate MPS

Core ML

Vision NLP

Your app

Core ML

Accelerate MPS

Core ML

Vision NLP

Your app

Music Tagging

Core ML

Accelerate MPS

Core ML

Vision NLP

Your app

Music Tagging Image Captioning

Accelerate and MPS

High performance math

Inference for custom ML models

Accelerate MPS

Core ML

Vision NLP

Your app

Run on Device

User Privacy

Run on Device

User Privacy Data Cost

Run on Device

User Privacy Server CostData Cost

Run on Device

User Privacy Always Available

Server CostData Cost

ML Frameworks

Accelerate

Core ML

Vision

Your app

ML Framework

ML Performance Primitives

Michael Siracusa, Core ML

•Core ML

•Overview

•Overview•Models

•Overview•Models•Development Flow

•Overview •Models •Development Flow

Focus on the experience you are trying to enable

Simple

PerformantSimple

PerformantSimple Compatible

Unified inference API

Xcode integrationSimple

Fine tuned inference engines

Built on Accelerate and MetalPerformant

Public model format

Support for popular training librariesCompatible

Function learned from data

Observed inputs

Predicts outputs

?Model

photo: flowerType:

Underlying Function

Scene Classification Style TransferTranslation

Sentiment Analysis Handwriting Recognition

Music Tagging Predicting Text

Underlying Function

😃That was totally awesome Leo!

I love you mom 사랑해 엄마

Do you know the way to San Jose

Model Types

Convolutional Neural Networks

Feed Forward Neural Networks

Recurrent Neural Networks

Support Vector MachinesTree Ensembles Generalized Linear Models

Focus on Use Cases

Core ML Model

Single document

Public format

CoreML in Depth Hall 3 Thursday 9:00AM

Where do models come from?

Sample Models https://developer.apple.com/machine-learning

Core ML models

Ready to use

Task specific

Explore!

Thriving communities

Popular ML libraries

Many models

Tap Into ML Community

Convert to Core ML

Core ML Tools

Convert to Core ML

Core ML Tools

Convert to Core ML

Core ML Tools

Convert to Core ML

Core ML Tools

Open Source

Convert to Core ML

CoreML in Depth Hall 3 Thursday 9:00AM

Core ML Tools

Open Source

Model as Code

Xcode Your App

Model as Code

Xcode Your App

Model as Code

•Development Flow

Lizi Ottens, Core ML

Getting the Model

Convert

•Demo •Image based flower identifier

•Demo Recap •Image based flower identifier

Demo Recap Xcode integration

Demo Recap Simple usage

let flowerModel = FlowerClassifier() if let prediction = try? flowerModel.prediction(flowerImage: image) { return prediction.flowerType }

Type of model abstracted

Input/output strongly typed

Generated Source

class FlowerClassifierInput { var flowerImage: CVPixelBuffer }

class FlowerClassifierOutput { let flowerType: String let flowerTypeProbs: [String: Double] }

class FlowerClassifier { convenience init() func prediction(flowerImage: CVPixelBuffer) throws -> FlowerClassifierOutput }

Generated Source

More Advanced Underlying API

class FlowerClassifier { convenience init()

func prediction(flowerImage: CVPixelBuffer) throws -> FlowerClassifierOutput }

Programmatic access to model for power users

class FlowerClassifier { convenience init()

let model: MLModel func prediction(flowerImage: CVPixelBuffer) throws -> FlowerClassifierOutput }

class MLModel { var modelDescription: MLModelDescription func prediction(from input: MLFeatureProvider) throws -> MLFeatureProvider }

MLModel

Access to model description

MLModel

Access to model description

Flexibility in how input is provided

Xcode Your App

Behind the Scenes Model compilation

Quick initialization

Behind the Scenes Model compilation

Quick initialization

Optimized prediction

Model Goals

NEWReduce size

Model Goals

NEWReduce size

Improve accuracy

Model Goals

NEWReduce size

Improve accuracy

Decrease prediction times

Model Goals

Summary

Machine learning frameworks

Summary

Core ML

Summary

Core ML

Development flow in action

More Informationhttps://developer.apple.com/wwdc17/703

Related Sessions

Natural Language Processing and your Apps Hall 3 Wednesday 9:00AM

Vision Framework: Building on Core ML Hall 2 Wednesday 3:10PM

Core ML in depth Hall 3 Thursday 9:00AM

Accelerate and Sparse Solvers Executive Ballroom Thursday 10:00AM

Using Metal 2 for Compute Grand Ballroom A Thursday 4:10PM

Core ML and Natural Language Processing Lab Technology Lab D Thu 11:00AM-3:30PM

Core ML & Natural Language Processing Lab Technology Lab D Fri 1:50AM-4:00PM

703 introducingcoreml 04 d - devstreaming-cdn.apple.com · void convolutionlayer(int kernelwidth,...

Documents

swift in practice - devstreaming-cdn.apple.com · swift in...

universityofwashington* using...

for judgement · sthalekar[int], ritesh agrawal[int], ram...

amarcord - karmanitalia.it · amarcord ap121 1b int ap121...

mpi_alltoallv function outline int mpi_alltoallv ( void...

232 advanced user interfaces df -...

creating great app previews - devstreaming-cdn.apple.com ·...

funciones 2da parte - martinnievas.com · 2/25 #include ...

bringing your ios apps to os x -...

graphics and games - devstreaming-cdn.apple.com ·...

ios memory deep dive - devstreaming-cdn.apple.com · memory...

spr08-31 invitation to comment - california courtsinvitation...

central state university...mech eng tech cad 3 re uirements...

what’s new in authentication -...

p4.org applications working group update · int source int...

brian mitchell (bmitchel@mcs.drexel.edu) - drexel university...

required functions for program 3 int readuntilvalidbaseread(...

pdf compressor - hcioe.org84-90)_compressed.pdf · (ict/it)...

int ap time( int ao handle, int data, int ticks, int interv...

app development using tvmlkit -...