image representation usage guide
DESCRIPTION
Use the deepness to view image representation.TRANSCRIPT
![Page 1: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/1.jpg)
Lu, Wang-Chou
Image Representation Usage Guide
2014/10/01 @ 林⼜⼝口
![Page 2: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/2.jpg)
Why Image Representation?
Machine Learning Course @ Caltech
Xi = 500 x 375 D
Low Dimensional Vector
![Page 3: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/3.jpg)
Map of Representation
Robust
Computation Cost
Color Histogram, !PCA, !
Sparse Coding, !Bag of Visual Word, !
DPM, !Deep Learning…
![Page 4: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/4.jpg)
Outline❖ Hand Crafted Features!
❖ Machine Learning approach!
❖ Hierarchical Approaches!
❖ When to use? Real-time vs Precision
![Page 5: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/5.jpg)
Hand Crafted Features❖ Color Histogram, Template, Haar Features!
❖ Interested Point Detector + HOG!
❖ Bag of Visual Word
![Page 6: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/6.jpg)
Simple FeaturesHistogram Based Haar Features
Template Based
![Page 7: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/7.jpg)
SIFT Like Approach
HOG, 3780D, !overlapped 7 x 15 cells * !
( normalized 2x2 grid) * 9 binsSIFT 128D,|V| = 1!4 X 4 grid * 8 bins
David Lowe [IJCV 2004] N Dalal et al [CVPR2005]
![Page 8: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/8.jpg)
Bag of Visual Word + SPM
SVM
SIFT!Descriptor
S. Lazebnik et al [cvpr06]
![Page 9: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/9.jpg)
Machine Learning Approach❖ Dimensionality Reduction!
❖ PCA, Manifold Learning, Sparse Coding, LSH!
❖ Deformable Part Model!
❖ Neural Network!
❖ Convolution Neural Network
![Page 10: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/10.jpg)
Principle Component Analysis
MA Turk et al [cvpr91]
![Page 11: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/11.jpg)
Manifold Learning
[ISOMAP, LLE 2003]
![Page 12: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/12.jpg)
Sparse Coding
H Lee et al. [NIPS 2007]
reconstruction error sparsity
Y: Input Vector!B: Basis Matrix!
Z: weight
![Page 13: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/13.jpg)
Locality Sensitive Hashing Embedding
![Page 14: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/14.jpg)
Deformable Part Model
Pedro F. Felzenszwalb et al [PAMI 2010]
![Page 15: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/15.jpg)
Neural Network
Tanh & Sigmoid !nonlinear function
![Page 16: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/16.jpg)
Convolution Neural Network
LeCun 1989
Krizhevsky et al. [NIPS2012]
ReLu
![Page 17: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/17.jpg)
State of the Art
GoogleNet 2014
MSRA2014
![Page 18: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/18.jpg)
Deepness Table
Convolution Neural Network & Deformable Part Model use max pooling, !others use sum pooling or say histogram pooling
![Page 19: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/19.jpg)
Image Representation Usage Guide
1
1.5
2
2.5
3
3.5
4
4.5
5
iPhone 5s Tegra K1 or PC Geforce Titan HPC
Real Time ApplicationInteractive Application
Color Histogram
SIFT/HOG
Bag of Visual Word
BoW+ SPM, Deformable Part Model
Convolution Neural Network
45 gflops 370 gflops 5.1 tera flops
Deepness
gflops for Single Precision, PC: i7 3.5G 4 cores parellel computing
TRAINING TIME NOT INCLUDED
![Page 20: Image representation usage guide](https://reader034.vdocument.in/reader034/viewer/2022042607/558a60f6d8b42a6e6e8b463a/html5/thumbnails/20.jpg)
Some Tips❖ GPU ~= 50 CPU Cores!
❖ Hand Crafted Feature is shallow, higher feature template need to be learnt.!
❖ Do Dimensionality Reduction!
❖ Deeper Features, More Training Data!
❖ Handle Invariance: Registration vs Spatial Pooling!
❖ The Learnt Deep Representation(CNN) is shareable