cpsc340 - computer science at ubcnando/340-2012/lectures/l23.pdf · cpsc340 nando de freitas ......

52
CPSC340 Nando de Freitas November, 2012 University of British Columbia State-of-the-art Neural Networks

Upload: builiem

Post on 31-Aug-2018

221 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

CPSC340

Nando de FreitasNovember, 2012University of British Columbia

State-of-the-art

Neural Networks

Page 2: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Outline of the lecture

This lecture provides an overview of two state-of-the-art neural networks: The google net and dropout nets. It discusses:

� Unsupervised learning with NNs (autoencoders)� Pooling� Local contrast normalizationLocal contrast normalization� Randomization for regularization

Page 3: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

MLP – Regressionu o

θ

1

2

u o

1

θ

θ

3

4

θ

6

7

15θ

11

12

21

12

11

xiyi = E(yi | xi ,θ θ θ θ )^

Page 4: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

MLP – Regressionu o

θ

1

2

u o

1

θ

θ

3

4

θ

6

7

15θ

11

12

21

12

11

xiyi = E(yi | xi ,θ θ θ θ )^

Page 5: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Backpropagationu o

x

θ

1

2

u o

1

θ

θ

3

4

u oθ

θ

6

7

15θ

11

12

21 21

12

11

Page 6: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Backpropagationu o

x

θ

1

2

u o

1

θ

θ

3

4

u oθ

θ

6

7

15θ

11

12

21 21

12

11

Page 7: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

How do we deal with unsupervised data?

In life, we get little supervisionsupervision

e.g., no one teaches us how to see

Page 8: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 9: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 10: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 11: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 12: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

More layers:

deep learningdeep learning

Page 13: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 14: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 15: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Google autoencoder

Page 16: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 17: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 18: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 19: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 20: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 21: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 22: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 23: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 24: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

from Quiroga et al. “Invariant visual representation by single neurons in the human brain” Nature 2005

“Here we report on a remarkable subset of MTL neurons

that are selectively activated by strikingly different

pictures of given individuals, landmarks or objects and in

some cases even by letter strings with their names.”

24

Halle Berry neuron

Page 25: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Unsupervised Learning With 1B Parameters

DATA: 10M youtube (unlabeled) frames of size 200x200.

Ranzato

Page 26: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Deep Net:

– 3 stages

– each stage consists of local filtering, L2 pooling, LCN

- 18x18 filters

- 8 filters at each location

Unsupervised Learning With 1B Parameters

- 8 filters at each location

- L2 pooling and LCN over 5x5 neighborhoods

– training jointly the three layers by:

- reconstructing the input of each layer

- sparsity on the code

RanzatoLe et al. “Building high-level features using large-scale unsupervised learning” ICML 2012

Page 27: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Unsupervised Learning With 1B Parameters

Filtering L2Pooling LCN

One stage (zoom)

Ranzato

Whole system

1st stage 2nd stage 3rd stage

Input Image

Page 28: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Validating Unsupervised Learning

The network has seen lots of objects during training, but

without any label.

Q.: how can we validate unsupervised learning?

Q.: Did the network form any high-level representation?

Ranzato

Q.: Did the network form any high-level representation?

E.g., does it have any neuron responding for faces?

– build validation set with 50% faces, 50% random images

- study properties of neurons

Page 29: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Validating Unsupervised Learningn

euro

n resp

on

ses

1st stage 2nd stage 3rd stage

neu

ron

respo

nses

Ranzato

Face / No face

Page 30: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Top Images For Best Face Neuron

Page 31: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Best Input For Face Neuron

Ranzato

Page 32: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Invariance Properties

Ranzato

Page 33: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Invariance Properties

Ranzato

Page 34: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Cat Neuron

Ranzato

Page 35: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Top Images for Cat Neuron

Ranzato

Page 36: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Object Recognition on ImageNet

IMAGENET v.2011 (16M images, 20K categories)

METHOD ACCURACY %

Weston & Bengio 2011 9.3

Deep Net (from random)

Deep Net (from unsup.)

13.6

15.8

Linear Classifier on deep features 13.1

Ranzato

Page 37: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 38: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

• Classification: For 20 classes, predict presence/absence of an example of that class in the test image.• Detection: Predict the bounding box and label of each object from the 20 target classes in the test image.

Visual Object Classes Challenge 2012 (VOC2012)

Person: personAnimal: bird, cat, cow, dog, horse, sheepVehicle: aeroplane, bicycle, boat, bus, car, motorbike, trainIndoor: bottle, chair, dining table, potted plant, sofa, tv/monitor

Page 39: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Team name Error (5 guesses) Description

SuperVision 0.15315 Using extra training data from ImageNet Fall 2011 release

SuperVision 0.16422 Using only supplied training data

ISI 0.26172Weighted sum of scores from each classifier with SIFT+FV, LBP+FV, GIST+FV, and CSIFT+FV, respectively.

ISI 0.26602 Weighted sum of scores from classifiers using each FV.

ISI 0.26646 Naive sum of scores from classifiers using each FV.

ISI 0.26952Naive sum of scores from each classifier with SIFT+FV, LBP+FV, GIST+FV, and CSIFT+FV, respectively.

OXFORD_VGG 0.26979Mixed selection from High-Level SVM scores and Baseline Scores, decision is

Task 1 (classification)

OXFORD_VGG 0.26979performed by looking at the validation performance

XRCE/INRIA 0.27058

OXFORD_VGG 0.27079High-Level SVM over Fine Level Classification score, DPM score and Baseline Classification scores (Fisher Vectors over Dense SIFT and Color Statistics)

OXFORD_VGG 0.27302 Baseline: SVM trained on Fisher Vectors over Dense SIFT and Color Statistics

University of Amsterdam 0.29576 See text aboveXRCE/INRIA 0.33419

LEAR-XRCE 0.34464 Trained on ILSVRC'12 - using a mixture of NCM classifiers

LEAR-XRCE 0.36184 Trained on ILSVRC'12 - using NCM

LEAR-XRCE 0.38006 Trained on ILSVRC'10 - using a mixture of NCM classifiers

LEAR-XRCE 0.41048 Trained on ILSVRC'10 - using NCM

Page 40: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Team name Error (5 guesses) Description

SuperVision 0.335463 Using extra training data for classification from ImageNet Fall 2011 release

SuperVision 0.341905 Using only supplied training data

OXFORD_VGG 0.500342Re-ranked DPM detection over Mixed selection from High-Level SVM scores and Baseline Scores, decision is performed by looking at the validation performance

Task 2 (localization)

OXFORD_VGG 0.50139 Re-ranked DPM detection over High-Level SVM Scores

OXFORD_VGG 0.522189 Re-ranked DPM detection over High-Level SVM Scores - First bbox selection heuristic

OXFORD_VGG 0.529482 DPM detection over baseline classification scores

ISI 0.536474We use the cascade object detection with deformable part models, restricting the sizes of bounding boxes.

ISI 0.536546We use the cascade object detection with deformable part models, restricting the sizes of bounding boxes.

Page 41: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Training on Small Image Patches

Pick patches at random

locations for training

Ranzato

Page 42: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

This is not a good way to extend the model

to big images: block artifacts

From Patches to High-Resolution Images

But we could also take

them from a grid

Ranzato

Page 43: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

IDEA: have one subset of filters applied to these locations,

From Patches to High-Resolution Images

Ranzato

Page 44: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

IDEA: have one subset of filters applied to these locations,

another subset to these locations

From Patches to High-Resolution Images

Ranzato

Page 45: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

IDEA: have one subset of filters applied to these locations,

another subset to these locations, etc.

From Patches to High-Resolution Images

Gregor LeCun arXiv 2010Ranzato, Mnih, Hinton NIPS 2010

Train jointly all parameters.

No block artifacts

Reduced redundancy

Page 46: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Hinton

Page 47: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Hinton

Page 48: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Hinton

Page 49: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Hinton

Page 50: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 51: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two
Page 52: CPSC340 - Computer Science at UBCnando/340-2012/lectures/l23.pdf · CPSC340 Nando de Freitas ... Neural Networks . Outline of the lecture This lecture provides an overview of two

Next lecture

The next lecture introduces random forests.