visualizing and understanding convolutional networksqye/ma721/presentations/... · visualizing and...

Visualizing and Understanding Convolutional Networks Matthew D. Zeiler, Rob Fergus Presented by Huan Jin

Upload: others

Post on 21-May-2020

10 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Visualizing and UnderstandingConvolutional Networks

Matthew D. Zeiler, Rob Fergus

Presented by Huan Jin

Page 2: Visualizing and Understanding Convolutional Networksqye/MA721/presentations/... · Visualizing and Understanding Convolutional Networks MatthewD. Zeiler, Rob Fergus Presented by Huan

Overview

• What are the models learning?

• Which part of the model is key to performance?

• Do the features generalize?

Page 3: Visualizing and Understanding Convolutional Networksqye/MA721/presentations/... · Visualizing and Understanding Convolutional Networks MatthewD. Zeiler, Rob Fergus Presented by Huan

Convolutional Neural Network

Page 4: Visualizing and Understanding Convolutional Networksqye/MA721/presentations/... · Visualizing and Understanding Convolutional Networks MatthewD. Zeiler, Rob Fergus Presented by Huan

Visualization with a Deconvnet• Unpooling:

• Max-pooling: non-invertible.• Switch: record the location of the maxima within each pooling region.

• Rectification: relu.• Filtering: apply transposed version of the filter in convnet to the rectified maps.

Page 5: Visualizing and Understanding Convolutional Networksqye/MA721/presentations/... · Visualizing and Understanding Convolutional Networks MatthewD. Zeiler, Rob Fergus Presented by Huan

Convnet Visualization• Feature Visualization

Layer2: corners and other edge/color conjunctions.

Layer3: more complex invariances, capturing similar textures. (mesh patterns, text)

Page 6: Visualizing and Understanding Convolutional Networksqye/MA721/presentations/... · Visualizing and Understanding Convolutional Networks MatthewD. Zeiler, Rob Fergus Presented by Huan

Convnet Visualization• Feature Visualization

Layer4: significant variation, class-specific.

Layer5: entire objects with significant pose variations.

Page 7: Visualizing and Understanding Convolutional Networksqye/MA721/presentations/... · Visualizing and Understanding Convolutional Networks MatthewD. Zeiler, Rob Fergus Presented by Huan

Convnet Visualization• Feature Evolution during Training

The lower layers of the model converge within a few epochs;The upper layers develop after a considerable number of epochs (40-50).Let the model train until fully converge.

1 2 5 10 20 30 40 64 1 2 5 10 20 30 40 64 1 2 5 10 20 30 40 64 1 2 5 10 20 30 40 64

Page 8: Visualizing and Understanding Convolutional Networksqye/MA721/presentations/... · Visualizing and Understanding Convolutional Networks MatthewD. Zeiler, Rob Fergus Presented by Huan

Convnet Visualization• Feature Invariance

Translation (Horizontal)

Out

put

Laye

r 1

Laye

r 7Dramatic effect Lesser impact

Page 9: Visualizing and Understanding Convolutional Networksqye/MA721/presentations/... · Visualizing and Understanding Convolutional Networks MatthewD. Zeiler, Rob Fergus Presented by Huan

Convnet Visualization• Feature Invariance

Scale Invariance

Out

put

Laye

r 1

Laye

r 7Dramatic effect Lesser impact

Page 10: Visualizing and Understanding Convolutional Networksqye/MA721/presentations/... · Visualizing and Understanding Convolutional Networks MatthewD. Zeiler, Rob Fergus Presented by Huan

Convnet Visualization• Feature Invariance

Out

put

Laye

r 1

Laye

r 7

Rotation Variance

Page 11: Visualizing and Understanding Convolutional Networksqye/MA721/presentations/... · Visualizing and Understanding Convolutional Networks MatthewD. Zeiler, Rob Fergus Presented by Huan

Convnet Visualization• Occlusion Sensitivity

• Identifying the location of the object rather than using the surrounding context.

Page 12: Visualizing and Understanding Convolutional Networksqye/MA721/presentations/... · Visualizing and Understanding Convolutional Networks MatthewD. Zeiler, Rob Fergus Presented by Huan

Convnet Visualization• Occlusion Sensitivity

The strongest feature doesn’t correspond to the class label. – Multiple feature maps!

Page 13: Visualizing and Understanding Convolutional Networksqye/MA721/presentations/... · Visualizing and Understanding Convolutional Networks MatthewD. Zeiler, Rob Fergus Presented by Huan

Convnet Visualization• Correspondence Analysis

• 𝜖𝜖𝑖𝑖𝑙𝑙 = 𝑥𝑥𝑖𝑖𝑙𝑙 − �𝑥𝑥𝑖𝑖𝑙𝑙 ,• 𝑥𝑥𝑖𝑖𝑙𝑙𝑎𝑎𝑎𝑎𝑎𝑎 �𝑥𝑥𝑖𝑖𝑙𝑙 𝑎𝑎𝑎𝑎𝑎𝑎 𝑡𝑡𝑡𝑎𝑎 𝑓𝑓𝑎𝑎𝑎𝑎𝑡𝑡𝑓𝑓𝑎𝑎𝑎𝑎 𝑣𝑣𝑎𝑎𝑣𝑣𝑡𝑡𝑣𝑣𝑎𝑎𝑣𝑣 𝑎𝑎𝑡𝑡 𝑙𝑙𝑎𝑎𝑙𝑙𝑎𝑎𝑎𝑎 𝑙𝑙 𝑓𝑓𝑣𝑣𝑎𝑎 𝑡𝑡𝑡𝑎𝑎 𝑣𝑣𝑎𝑎𝑜𝑜𝑜𝑜𝑜𝑜𝑎𝑎𝑎𝑎𝑙𝑙 𝑎𝑎𝑎𝑎𝑎𝑎 𝑣𝑣𝑣𝑣𝑣𝑣𝑙𝑙𝑓𝑓𝑎𝑎𝑎𝑎𝑎𝑎 𝑜𝑜𝑖𝑖𝑎𝑎𝑜𝑜𝑎𝑎𝑣𝑣.

• ∆𝑙𝑙= ∑𝑖𝑖,𝑗𝑗=1,𝑖𝑖≠𝑗𝑗5 ℋ(𝑣𝑣𝑜𝑜𝑜𝑜𝑎𝑎 𝜖𝜖𝑖𝑖𝑙𝑙 , 𝑣𝑣𝑜𝑜𝑜𝑜𝑎𝑎(𝜖𝜖𝑗𝑗𝑙𝑙))

• The lower ∆, the greater consistency.

Layer 5 implicitly establishes some form of correspondence of parts.

Layer 7 tries to discriminate between the different breeds of dog.