ima generative short - ucla statisticssczhu/talks/ima_generative_short.pdf · 2012-11-02 · 5 ima...

15
IMA workshop on Visual Learning and Recognition, May 2006, Generative models Song-Chun Zhu IMA workshop on Visual Learning and Recognition, May 2006, Modeling One fundamental problem, in my opinion, that we are facing today is to represent the enormous amount of visual knowledge needed at all levels of vision. modeling = visual knowledge representation More specifically, it is to distill data (raw images, video) into visual knowledge in terms of various kind of models.

Upload: others

Post on 03-Mar-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

1

IMA workshop on Visual Learning and Recognition, May 2006,

Generative models

Song-Chun Zhu

IMA workshop on Visual Learning and Recognition, May 2006,

Modeling

One fundamental problem, in my opinion, that we are facing today is to

represent the enormous amount of visual knowledge

needed at all levels of vision.

modeling = visual knowledge representation

More specifically, it is to distill data (raw images, video) into visual knowledge in terms of various kind of models.

Page 2: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

2

IMA workshop on Visual Learning and Recognition, May 2006,

Visual Knowledge

There are two kinds of visual knowledge

1. Representational knowledge:---hierarchical generative models

--- descriptive models, such as MRF

2. Computational knowledge: --- discriminative models

(a) Visual vocabulary --- dictionaries at all levels of vision

(b) Spatial relations and context

discriminative features for various variables ordering of bottom-up tests …

IMA workshop on Visual Learning and Recognition, May 2006,

Three families of models in vision

Generative modelsHierarchic models, Grammar (SCFG) Wavelet/sparse coding…

Descriptive modelsMarkov random fieldsGraphical models,Mixed Markov fields

Discriminative methodsClustering/detection,Adaboosting,SVM, …

context sensitive graph grammar etc.

Data-Driven

Markov chain

Monte Carloconditionalrandom fields

Page 3: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

3

IMA workshop on Visual Learning and Recognition, May 2006,

Descriptive models1. A flat graph representation G=<V, E>.

G could be directed, undirected, such as chain, tree, DAG, lattice, etc.the nodes in the graph are at the same semantic level.

2. hard constraints or soft “energy” between vertices for regularity and context

Examples, constraint-satisfaction, line drawing interpretation, scene labeling, deformable templates,image restoration, segmentation, graph partition/coloring, shape from stereo/motion/shading

IMA workshop on Visual Learning and Recognition, May 2006,

Learning descriptive models

Conceptualization and Modeling with Descriptive models

Examples:Gibbs,MRF, FRAME,Mixed Markov model

Model pursuit: choose informative features and statistics to minimize the log-volume of the set or Shannon entropy of the model --- minimax entropy principle.

Page 4: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

4

IMA workshop on Visual Learning and Recognition, May 2006,

Example

Iobs Isyn ~ Ω(h) k=0 Isyn ~ Ω(h) k=1 Isyn ~ Ω(h) k=3 Isyn ~ Ω(h) k=7Isyn ~ Ω(h) k=4

The same procedure has been applied to various of levels of context models:

textures, texton process, simple shapes, scene context, …

Markov random fields and Gibbs models on pixels

IMA workshop on Visual Learning and Recognition, May 2006,

Generative model

A hierarchic graph representationthe nodes in the graph are at different semantic levels.the edges show the decomposition.

hidden variables

observables

Examples, wavelets, sparse coding, stochastic context free grammar, …,

levels of dictionaries

Page 5: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

5

IMA workshop on Visual Learning and Recognition, May 2006,

Learning generative models

Model pursuit: choose optimal dictionary (epsilon-balls) to cover the maximum probability mass or minimize the Kolmogorov entropy

--- again, minimax entropyIt is closely related to binding with maximum mutual information.

A generative model is a joint probability with a series of dictionaries

It has to be a descriptive model

Key concepts:A dictionary is a set of epsilon balls (manifolds)A hidden variable is an index to the dictionary.

IMA workshop on Visual Learning and Recognition, May 2006,

Integrated generative model

Generative methods Descriptive methodscontext sensitive graph grammar etc.

Necessity is obvious: we need both hierarchic composition and context.

But how?1. When do we represent regularity by a constraint (descriptive)

or by a production rule (generative)?2. How to handle the continuous transition between

texture (descriptive) and structures/shapes (generative)?

Page 6: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

6

IMA workshop on Visual Learning and Recognition, May 2006,

Leaves at a range of scales

DB

A

C

This picture contains trees/leavesat four ranges of distance, overwhich our perception changes.

A: see individual leaves with sharp edge/boundary(occlusion model)

B: see leaves but blurry edge(additive model)

C: see a texture impression(MRF)

D: see constant area(iid Gaussian)

IMA workshop on Visual Learning and Recognition, May 2006,

Model regime transitions in scale space

We need a seamless transition between different regimes of models

scale 1 scale 2 scale 3 scale 4

scale 5 scale 6 scale 7

Page 7: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

7

IMA workshop on Visual Learning and Recognition, May 2006,

By analogy: A picture of the universe

At different temperatureswe observe differententropy patterns.

At different scales,different forces rulethe systems.

A photo from Cosmology. Our image space is very much like this, it contains patterns of wide range of entropy regimes.

IMA workshop on Visual Learning and Recognition, May 2006,

Integrated learning

Page 8: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

8

IMA workshop on Visual Learning and Recognition, May 2006,

MRF0pixels

∆2

MRF2parts

∆3MRF3objects

∆4scene

generative

descriptive

Entro

pyor

codin

g len

gthlearning steps

Augmentation of the Integrated generative model

The pursuit of an integrated model is to minimize the Shannon and Kolmogorov entropy in turns,

1

1MRF∆

primitives

IMA workshop on Visual Learning and Recognition, May 2006,

A low level example: primal sketch

sketching pursuit process

sketch imagesynthesized textures

org image

syn image

+=

sketches

Page 9: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

9

IMA workshop on Visual Learning and Recognition, May 2006,

Examples of the dictionary of image primitives

IMA workshop on Visual Learning and Recognition, May 2006,

Example: primal sketch

Spatial MRF

Texture MRF

Page 10: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

10

IMA workshop on Visual Learning and Recognition, May 2006,

Middle-level example: the dictionary of graphlets

A graphlet is a graph composed of 2-8 image primitives with open bonds.

IMA workshop on Visual Learning and Recognition, May 2006,

Middle level example

Page 11: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

11

IMA workshop on Visual Learning and Recognition, May 2006,

High level examples: the dictionary of human figure

IMA workshop on Visual Learning and Recognition, May 2006,

Composing the parts

Each sub-template is a vertex in a composite template, and verticesare connected through “bonds”.

g2

g1

11β

12β13β

21β

33β32β

31β23β

22β

g3

24β25β

Page 12: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

12

IMA workshop on Visual Learning and Recognition, May 2006,

The ideas of binding, composition, hierarchical, and reusable parts

A

B C

a ccb

Or-node

And-node

leaf -node

A grammar rule:

IMA workshop on Visual Learning and Recognition, May 2006,

Embedding the integrated models into an And-Or graph

6

8

1 10

some graph configurations generated by the And-Or graph

<B,C>

<C,D>

<C,D>

<B,C>

<N,O

>

A

B

KJI

P

GFE

DC

NL

T

H

SRQ

O

1 8765432 11109

and-nodeor-node

leaf node

M

U

<B,C>

<L,M>

2

4

9

9

<C, D

>

Page 13: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

13

IMA workshop on Visual Learning and Recognition, May 2006,

IMA workshop on Visual Learning and Recognition, May 2006,

Examples of new configurations (synthesis)

Note that the number of sub-templates and their connections change.Each is a possible “configuration”.

Page 14: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

14

IMA workshop on Visual Learning and Recognition, May 2006,

Examples of human upper-cloth representation

IMA workshop on Visual Learning and Recognition, May 2006,

Examples of human upper-cloth representation

Page 15: IMA generative short - UCLA Statisticssczhu/Talks/IMA_generative_short.pdf · 2012-11-02 · 5 IMA workshop on Visual Learning and Recognition, May 2006, Learning generative models

15

IMA workshop on Visual Learning and Recognition, May 2006,

Back to history

In the 1980s, we have a popular model for low and middle level vision(Geman and Geman, Blake and Zisserman, Koch and Poggio, Mumford and Shah)

Three questions:1. Why is the potential quadratic?

2. Why is the gradient operator?

3. Where is the concept of edge from?