deep neural networks for mobile platforms · pruning basics unimportant criteria yann le cun, et...

Post on 16-Jun-2020

0 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Deep Neural Networks for Mobile Platforms

Oleksandr Baiev

PhD, Sr. EngineerSamsung R&D Instutute Ukraine

AI Ukraine, 2016

Outline

Problems overviewKey problemsDeeper problem overview

TopologyOverview

QuantizationOverview

PruningOverviewExamples

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 1 / 27

Outline

Problems overviewKey problemsDeeper problem overview

TopologyOverview

QuantizationOverview

PruningOverviewExamples

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 2 / 27

Neural networks

Cutting edge results for CV, NLP, Signal Processing,Recommendations.

Unified solution for different problems.

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 3 / 27

Neural networksConvolution neural network

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 4 / 27

Deployment stage problemModels quality

Alex Krizhevsky, et al. ImageNet Classification with Deep Convolutional Neural Networks, 2012Christian Szegedy, et al. Going Deeper with Convolutions, 2014K. Simonyan, et al. Very Deep Convolutional Networks for Large-Scale Image Recognition, 2014Kaiming He, et al. Deep Residual Learning for Image Recognition, 2015

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 5 / 27

Deployment stage problemModels size

Application more than 100MB requires WiFi for downloading via app stores

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 6 / 27

Deployment stage problemModels execution

Test with https://github.com/sh1r0/caffe-android-lib

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 7 / 27

Outline

Problems overviewKey problemsDeeper problem overview

TopologyOverview

QuantizationOverview

PruningOverviewExamples

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 8 / 27

Problem overviewApproximate layers distribution

Fully Connected are bigger thanConvolution layers in terms ofMB

Convolution takes much moretime for forward pass

Target device have to storelayer’s feature maps in RAM forat least one layer

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 9 / 27

Problem overviewExecution time and size by layer

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 10 / 27

Problem overviewParameters importance

Feature map’s width, height influence on execution time

Feature map’s depth influences on model size

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 11 / 27

Outline

Problems overviewKey problemsDeeper problem overview

TopologyOverview

QuantizationOverview

PruningOverviewExamples

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 12 / 27

Bottlenecks

Min Lin, et al. Network In Network, 2013

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 13 / 27

Inception module

Compose different kernel sizes

Christian Szegedy, et. al., Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning, 2016

Christian Szegedy, et. al., Rethinking the Inception Architecture for Computer Vision, 2015

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 14 / 27

Outline

Problems overviewKey problemsDeeper problem overview

TopologyOverview

QuantizationOverview

PruningOverviewExamples

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 15 / 27

Quantization

Pete Warden, How to Quantize Neural Networks with TensorFlow, 2016

Matthieu Courbariaux, et. al., BinaryConnect: Training Deep Neural Networks with binary weights during propagations,2015

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 16 / 27

Quantization Schema

All operations use precalculatedMin and Max values which areused for rescaling

Min and Max values selectedfrom function behavior and realnumber values

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 17 / 27

Outline

Problems overviewKey problemsDeeper problem overview

TopologyOverview

QuantizationOverview

PruningOverviewExamples

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 18 / 27

Pruning basics

The idea of pruning is removing unimportant weights. The one question ishow to define “unimportant”.

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 19 / 27

Pruning basicsUnimportant criteria

Yann Le Cun, et al. Optimal Brain Damage, 1990

Babak Hassibi, et al. Second Order Derivatives for Network Pruning: Optimal Brain Surgeon, 1992

Song Han, et al., Learning both Weights and Connections for Efficient Neural Networks, 2015

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 20 / 27

Pruning basicsIterative Process

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 21 / 27

Outline

Problems overviewKey problemsDeeper problem overview

TopologyOverview

QuantizationOverview

PruningOverviewExamples

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 22 / 27

Pruning example VGG

Song Han, DSD: Regularizing Deep Neural Networks with Dense-Sparse-Dense Training Flow, 2016

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 23 / 27

Pruning example VGG

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 24 / 27

Pruning example GoogLeNet

Hao Li, et.al., Pruning Filters for Efficient ConvNets, 2016

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 25 / 27

Pruning example ResNet-152

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 26 / 27

Summary

Deep neural networks provides excellent quality, but requires powerfulcomputation instances

There are several simple and useful approaches for reducing requiredmemory size and execution time without increasing hardware cost

Oleksandr Baiev, PhD, Sr. Engineer Samsung R&D Instutute Ukraine DNN for Mobile Platforms 27 / 27

top related