visualizing and understanding convolution...

18
Visualizing and Understanding Convolution Networks Authors: Mathew Zeiler and Rob Fergus New York University Select slides from Hamid Izadinia Presentation by Jason Driver 10/13/2016

Upload: others

Post on 08-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

Visualizing and Understanding Convolution

NetworksAuthors: Mathew Zeiler and Rob Fergus

New York UniversitySelect slides from Hamid Izadinia

Presentation by Jason Driver

10/13/2016

Page 2: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

Problem and Contributions

• Greatly improved performance using convolution neural network architecture over the previous state of the art

• No clear understanding why CNN work• Technique for visualizing individual layers with a Deconvolution Network• Troubleshoot the output of each layer• Introduces an image occlusion experiment for spatial understanding• Optimal architecture

Page 3: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

Hierarchical ConvolutionNonlinear operations (max pooling, ReLu)

Page 4: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew
Page 5: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew
Page 6: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew
Page 7: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew
Page 8: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

AlexNet

Page 9: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew
Page 10: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

Feature Visualization

Page 11: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

Feature Visualization

Page 12: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

Feature Visualization

Page 13: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

Feature Visualization

Lower Order Features Higher Order Features

Page 14: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

Feature Invariance in Layers 1, 7, 8

Euclidean distance between feature of transformed and original

Rota

tion

Scal

eTr

ansla

tion

Page 15: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

Architecture Selection

Layer 1 Layer 2

• Smaller filters (7x7 vs. 11x11) and smaller stride (2 vs. 4)• 1st Layer: Increased coverage of mid-frequencies (Original is b)• 2nd Layer: no aliasing, no “dead” features (Original is d)

Page 16: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

Occlusion Sensitivity

Page 17: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

Correspondence Analysis

• 5th Layer preserves correspondence

• 7th Layer discriminates different classes of dog

Page 18: Visualizing and Understanding Convolution Networksweb.cs.ucdavis.edu/~yjlee/teaching/ecs289g-fall2016/jason.pdf · Visualizing and Understanding Convolution Networks Authors: Mathew

Results