object recognition: history and overvie · 2008. 9. 9. · object recognition: history and overview...

45
Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Upload: others

Post on 27-Aug-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Object Recognition: History and Overview

Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Page 2: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

OutlineWhat is object category recognition?History of object representation

• Geometric• Global appearance-based• Sliding window• Indexing with local features• Constellation models• Bags of features

Issues in statistical recognition• Generative vs. discriminative models• Supervised vs. unsupervised• Different kinds of recognition tasks• Datasets

Page 3: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

How many object categories are there?

Biederman 1987

Page 4: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline
Page 5: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

OBJECTS

ANIMALS INANIMATEPLANTS

MAN-MADENATURALVERTEBRATE…..

MAMMALS BIRDS

GROUSEBOARTAPIR CAMERA

Page 6: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

So what does object recognition involve?

Page 7: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Verification: is that a lamp?

Page 8: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Detection: are there people?

Page 9: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Identification: is that Potala Palace?

Page 10: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Object categorization

mountain

buildingtree

banner

vendorpeople

street lamp

Page 11: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Scene and context categorization

• outdoor• city• …

Page 12: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Variability: Camera positionIlluminationInternal parameters

Within-class variations

Modeling variability

Page 13: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Within-class variations

Page 14: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Variability:Camera positionIlluminationInternal parameters

θ

Alignment

Roberts (1965); Lowe (1987); Faugeras & Hebert (1986); Grimson & Lozano-Perez (1986); Huttenlocher & Ullman (1987)

Shape: assumed known

Page 15: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Recall: Alignment

• Alignment: fitting a model to a transformation between pairs of features (matches) in two images

∑ ′i

ii xxT )),((residual

Find transformation Tthat minimizesT

xixi

'

Page 16: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Recall: Origins of computer vision

L. G. Roberts, Machine Perception of Three Dimensional Solids,Ph.D. thesis, MIT Department of Electrical Engineering, 1963.

Page 17: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Alignment: Huttenlocher & Ullman (1987)

Page 18: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Variability Camera positionIlluminationInternal parameters

Invariance to:

Duda & Hart ( 1972); Weiss (1987); Mundy et al. (1992-94);Rothwell et al. (1992); Burns et al. (1993)

Page 19: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

General 3D objects do not admit monocular viewpoint invariants (Burns et al., 1993)

Projective invariants (Rothwell et al., 1992):

Example: invariant to similarity transformations computed from four points

A

B

CD

Page 20: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

ACRONYM (Brooks and Binford, 1981)

Representing and recognizing object categoriesis harder...

Binford (1971), Nevatia & Binford (1972), Marr & Nishihara (1978)

Page 21: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Recognition by components

Geons (Biederman 1987)

???

Page 22: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Zisserman et al. (1995)

Generalized cylindersPonce et al. (1989)

Forsyth (2000)

General shape primitives?

Page 23: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Empirical models of image variability

Appearance-based techniques

Turk & Pentland (1991); Murase & Nayar (1995); etc.

Page 24: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Eigenfaces (Turk & Pentland, 1991)

Page 25: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Color Histograms

Swain and Ballard, Color Indexing, IJCV 1991.

Page 26: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

H. Murase and S. Nayar, Visual learning and recognition of 3-d objects from appearance, IJCV 1995

Appearance manifolds

Page 27: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Limitations of global appearance models

• Can work on relatively simple patterns

• Not robust to clutter, occlusion, lighting changes

Page 28: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Sliding window approaches

• Turk and Pentland, 1991• Belhumeur, Hespanha, &

Kriegman, 1997• Schneiderman & Kanade 2004• Viola and Jones, 2000

• Schneiderman & Kanade, 2004• Argawal and Roth, 2002• Poggio et al. 1993

Page 29: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

– Scale / orientation range to search over – Speed– Context

Sliding window approaches

Page 30: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Hoi

em, E

fros,

Her

bert,

200

6

Context

Page 31: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Lowe’02

Mahamud & Hebert’03

Local featuresCombining local appearance, spatial constraints, invariants, and classification techniques from machine learning.

Schmid & Mohr’97

Page 32: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Local features for recognition of object instances

Page 33: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

• Lowe, et al. 1999, 2003• Mahamud and Hebert, 2000• Ferrari, Tuytelaars, and Van Gool, 2004• Rothganger, Lazebnik, and Ponce, 2004• Moreels and Perona, 2005• …

Local features for recognition of object instances

Page 34: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Representing categories: Parts and Structure

Weber, Welling & Perona (2000), Fergus, Perona & Zisserman (2003)

Page 35: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Parts-and-shape representation• Model:

– Object as a set of parts– Relative locations between parts– Appearance of part

Figure from [Fischler & Elschlager 73]

Page 36: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

ObjectObject Bag of Bag of ‘‘wordswords’’

Bag-of-features models

Page 37: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Objects as texture• All of these are treated as being the same

• No distinction between foreground and background: scene recognition?

Page 38: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Today: A comeback for global models?

• The “gist” of a scene: Oliva & Torralba (2001)

Page 39: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

J. Hays and A. Efros, Scene Completion using Millions of Photographs, SIGGRAPH 2007

Page 40: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

NIPS 2007

Page 41: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

Timeline of recognition• 1965-late 1980s: alignment, geometric primitives• Early 1990s: invariants, appearance-based

methods• Mid-late 1990s: sliding window approaches• Late 1990s: feature-based methods• Early 2000s: parts-and-shape models• 2003 – present: bags of features• Present trends: combination of local and global

methods, modeling context, integrating recognition and segmentation

Page 42: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

What “works” today

• Reading license plates, zip codes, checks

Page 43: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

What “works” today

• Reading license plates, zip codes, checks• Fingerprint recognition

Page 44: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

What “works” today

• Reading license plates, zip codes, checks• Fingerprint recognition• Face detection

Page 45: Object Recognition: History and Overvie · 2008. 9. 9. · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce. Outline

What “works” today

• Reading license plates, zip codes, checks• Fingerprint recognition• Face detection• Recognition of flat textured objects (CD covers,

book covers, etc.)