advances in deep learning - wojciech zaremba in deep learning.pdf · 2017-05-21 · house number...

Advances in Deep Learningby Wojciech Zaremba

Grad student at Intern at Google BrainEx-Intern at

Outline

● Success stories● Neural networks ● Convolutional neural networks● Recurrent neural networks● Flaws

House Number Identification in Street View*

*“Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks“ by Ian J. Goodfellow, Yaroslav Bulatov, Julian Ibarz, Sacha Arnoud, Vinay Shet

Image Classification (query animal)

ImageNet classification results1M training images, 1K categories, top-5 error

Best deep-learning models ~9%

Non-deep learning modelsISI, JapanOxford, EnglandINRIA, FranceUniversity of Amsterdam, etc.

Speech recognition

Text understanding*

*“Efficient estimation of word representations in vector space” by Tomas Mikolov, Kai Chen, Greg Corrado, Jeffrey Dean

Translation*

*”Exploiting similarities among languages for machine translation” by Tomas Mikolov, Quoc V Le, Ilya Sutskever

Neural networks

● Depth is essential for goodperformance

● Large amount of data showstheir power

● Success due to fast GPUs

Convolutional neural networks

Assumptions:● on the very small scale every piece of image

should be processed the same way

Convolutional networks - setting

● 60 million parameters● Training takes ~5 days on GTX Titan● ~90% time takes convolutions, and ~10%

time fully connected● ~5% of parameters are in convolutions and

~95% of parameters in fully connected layers

Recurrent neural networks - schema

Wojciech

Long-short-term memory (LSTM)

● Variant of RNN● Differentiable memory

unit● Equivalent to RNN but

easier to train

Language modeling (Penn tree bank)

Model Perplexity (lower is better)

Previous state of the art 73

Our result 68

Recurrent networks - applications

● Text understanding● Video modeling thought consecutive frame

prediction● Translation● Speech recognition● Autocompletion

Recurrent neural networks

the tradition of the ancient humanreproduction: it isless favorable to thegood boy for when toremove her bigger.

The meaning of life is

Flaws*

Correctlypredictedobject

Predictsostrich

*”Intriguing properties of neural networks” joint work with Christian Szegedy, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian Goodfellow, and Rob Fergus

How to?

● Negative examples generated with Backpropagation

● Constrained to be in feasible set (proper color range)

Cross model transferTraining Error

Model A 0%

Model B 0%

Negative examples for Model A

Negative examples for Model B

Gaussian noise std = 0.1

Model A 100% 6.6% 0%

Model B 20.3% 100% 0%

Different fully connected networks trained on MNIST dataset. Average distortions by ~6%.

Cross training data transferTraining P1 Training P2

Model A 0% 2.4%

Model B 2.5% 0%

Different fully connected networks trained on MNIST dataset. Distortions by ~6%.

Test distortion for A Test distortion for B

Model A 100% 6.25%

Model B 26.2% 100%

Flaws - conclusions

● Different networks share properties, which are dependent on statistics of training sets (not only particular samples).

● Can negative examples be used to improve generalization ?

● Success stories● Neural networks ● Convolutional neural networks● Recurrent neural networks● Flaws

I am happy to take any questions.

advances in deep learning - wojciech zaremba in deep learning.pdf · 2017-05-21 · house number...

Documents

multi-digit number recognition from street view imagery...

elm street - loopnet › d2 › tg8fy0udy7srydvq7...elm...

388 george street sydney · pdf file 388 george street...

deep geologic repository joint review panel · deep...

40 years of fleshtone, an interview with peter...

elm street - loopnet · 2018. 8. 29. · elm street deep...

marlborough street | balaclava community housing ·...

images don’t lie: transferring deep visual … don’t...

the dynamic discovery of web services using wsmx presented...

expanding procurement card usage twelfth annual state...

cultural affairs division economic & workforce development...

113 e first street | downtown clayton, nc · deep river...

cs230 deep...

hereditary neurocutaneous angioma: a genetic...

using deep learning and google street view to estimate the...

deep yellow level 1 329 hay street subiaco wa 6008 po box

cashio street -...

city of meridian fy2012 budget hearing mayor – tammy de...

42 deep ellum's proposal for a new main street in deep ellum

bbq smokin’ deep south platters favoritesplatters. deep...