recognizing and learning object categories: year...

47
Li Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Recognizing and Learning Object Categories: Year 2007 Object Categories: Year 2007 CVPR 2007 Minneapolis, Short Course, June 17 CVPR 2007 Minneapolis, Short Course, June 17

Upload: others

Post on 26-Apr-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Li Fei-Fei, Princeton

Rob Fergus, MITAntonio Torralba, MIT

Recognizing and Learning Recognizing and Learning Object Categories: Year 2007Object Categories: Year 2007

CVPR 2007 Minneapolis, Short Course, June 17CVPR 2007 Minneapolis, Short Course, June 17

Page 2: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

AgendaAgenda

• Introduction

• Bag-of-words models

• Part-based models

• Discriminative methods

• Segmentation and recognition

• Datasets & Conclusions

Page 3: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning
Page 4: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

perceptibleperceptible visionvision materialmaterialthingthing

Page 5: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Plato said…• Ordinary objects are classified together if they

`participate' in the same abstract Form, such as the Form of a Human or the Form of Quartz.

• Forms are proper subjects of philosophical investigation, for they have the highest degree of reality.

• Ordinary objects, such as humans, trees, and stones, have a lower degree of reality than the Forms.

• Fictions, shadows, and the like have a still lower degree of reality than ordinary objects and so are not proper subjects of philosophical enquiry.

Page 6: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Bruegel, 1564

Page 7: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

How many object categories are there?

Biederman 1987

Page 8: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

So what does object recognition involve?

Page 9: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Verification: is that a lamp?

Page 10: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Detection: are there people?

Page 11: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Identification: is that Potala Palace?

Page 12: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Object categorization

mountain

building

tree

banner

vendorpeople

street lamp

Page 13: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Scene and context categorization

• outdoor

• city

• …

Page 14: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Computational photography

Page 15: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Assisted driving

meters

met

ers

Ped

Ped

Car

Lane detection

Pedestrian and car detection

• Collision warning systems with adaptive cruise control, • Lane departure warning systems, • Rear object detection systems,

Page 16: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Improving online search

Query:STREET

Organizing photo collections

Page 17: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Challenges 1: view point variation

Michelangelo 1475-1564

Page 18: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Challenges 2: illumination

slide credit: S. Ullman

Page 19: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Challenges 3: occlusion

Magritte, 1957

Page 20: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Challenges 4: scale

Page 21: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Challenges 5: deformation

Xu, Beihong 1943

Page 22: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Challenges 6: background clutter

Klimt, 1913

Page 23: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

History: single object recognition

Page 24: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

History: single object recognition

• Lowe, et al. 1999, 2003

• Mahamud and Herbert, 2000• Ferrari, Tuytelaars, and Van Gool, 2004• Rothganger, Lazebnik, and Ponce, 2004• Moreels and Perona, 2005• …

Page 25: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Challenges 7: intra-class variation

Page 26: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

History: early object categorization

Page 27: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

• Turk and Pentland, 1991• Belhumeur, Hespanha, &

Kriegman, 1997• Schneiderman & Kanade 2004• Viola and Jones, 2000

• Amit and Geman, 1999• LeCun et al. 1998• Belongie and Malik, 2002

• Schneiderman & Kanade, 2004• Argawal and Roth, 2002• Poggio et al. 1993

Page 28: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning
Page 29: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Object categorization: Object categorization: the statistical viewpointthe statistical viewpoint

)|( imagezebrap

)( ezebra|imagnopvs.

• Bayes rule:

)(

)(

)|(

)|(

)|(

)|(

zebranop

zebrap

zebranoimagep

zebraimagep

imagezebranop

imagezebrap⋅=

posterior ratio likelihood ratio prior ratio

Page 30: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Object categorization: Object categorization: the statistical viewpointthe statistical viewpoint

)(

)(

)|(

)|(

)|(

)|(

zebranop

zebrap

zebranoimagep

zebraimagep

imagezebranop

imagezebrap⋅=

posterior ratio likelihood ratio prior ratio

• Discriminative methods model posterior

• Generative methods model likelihood and prior

Page 31: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Discriminative

• Direct modeling of

Zebra

Non-zebra

Decisionboundary

)|(

)|(

imagezebranop

imagezebrap

Page 32: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

• Model and

Generative)|( zebraimagep ) |( zebranoimagep

Middle�LowHigh

MiddleLow

)|( zebranoimagep)|( zebraimagep

Page 33: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Three main issuesThree main issues

• Representation– How to represent an object category

• Learning– How to form the classifier, given training data

• Recognition– How the classifier is to be used on novel data

Page 34: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Representation

– Generative / discriminative / hybrid

Page 35: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Representation

– Generative / discriminative / hybrid

– Appearance only or location and appearance

Page 36: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Representation

– Generative / discriminative / hybrid

– Appearance only or location and appearance

– Invariances• View point• Illumination• Occlusion• Scale• Deformation• Clutter• etc.

Page 37: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Representation

– Generative / discriminative / hybrid

– Appearance only or location and appearance

– invariances– Part-based or global

w/sub-window

Page 38: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Representation

– Generative / discriminative / hybrid

– Appearance only or location and appearance

– invariances– Parts or global w/sub-

window– Use set of features or

each pixel in image

Page 39: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

– Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning

Learning

Page 40: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

– Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning)

– Methods of training: generative vs. discriminative

Learning

0 0.2 0.4 0.6 0.8 10

1

2

3

4

5

clas

s de

nsiti

es

p(x|C1)

p(x|C2)

x0 0.2 0.4 0.6 0.8 1

0

0.2

0.4

0.6

0.8

1

post

erio

r pr

obab

ilitie

s

x

p(C1|x) p(C

2|x)

Page 41: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

– Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning)

– What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.)

– Level of supervision• Manual segmentation; bounding box; image

labels; noisy labels

Learning

Contains a motorbike

Page 42: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

– Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning)

– What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.)

– Level of supervision• Manual segmentation; bounding box; image

labels; noisy labels– Batch/incremental (on category and image

level; user-feedback )

Learning

Page 43: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

– Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning)

– What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.)

– Level of supervision• Manual segmentation; bounding box; image

labels; noisy labels– Batch/incremental (on category and image

level; user-feedback ) – Training images:

• Issue of overfitting• Negative images for discriminative methods

Priors

Learning

Page 44: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

– Unclear how to model categories, so we learn what distinguishes them rather than manually specify the difference -- hence current interest in machine learning)

– What are you maximizing? Likelihood (Gen.) or performances on train/validation set (Disc.)

– Level of supervision• Manual segmentation; bounding box; image

labels; noisy labels– Batch/incremental (on category and image

level; user-feedback ) – Training images:

• Issue of overfitting• Negative images for discriminative methods

– Priors

Learning

Page 45: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

– Scale / orientation range to search over

– Speed– Context

Recognition

Page 46: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

Hoiem, Efros, Herbert, 2006

Page 47: Recognizing and Learning Object Categories: Year …cs.haifa.ac.il/~dkeren/recognition/categories.pdfLi Fei-Fei, Princeton Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning

OBJECTS

ANIMALS INANIMATEPLANTS

MAN-MADENATURALVERTEBRATE…..

MAMMALS BIRDS

GROUSEBOARTAPIR CAMERA