Transcript
Page 1: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Object Recognition: History and Overview

Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce

Page 2: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

How many visual object categories are there?

Biederman 1987

Page 3: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio
Page 4: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

OBJECTS

ANIMALS INANIMATEPLANTS

MAN-MADENATURALVERTEBRATE…..

MAMMALS BIRDS

GROUSEBOARTAPIR CAMERA

Page 5: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

So what does object recognition involve?

Page 6: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Scene categorization

• outdoor• city• …

Page 7: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Image-level annotation: are there people?

• outdoor• city• …

Page 8: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Object detection: where are the people?

Page 9: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Image parsing

mountain

treebuilding

tree

bbanner

street lamp

vendor

street lamp

vendorpeople

Page 10: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Modeling variability

Variability: Camera positionIlluminationShape parametersShape parameters

Within-class variations?

Page 11: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Within-class variationsWithin class variations

Page 12: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

θθ

Variability: Camera positionIllumination Alignment

Shape: assumed known

Roberts (1965); Lowe (1987); Faugeras & Hebert (1986); Grimson & Lozano-Perez (1986); Huttenlocher & Ullman (1987)

p

Page 13: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Recall: Alignment

• Alignment: fitting a model to a transformationAlignment: fitting a model to a transformation between pairs of features (matches) in two images

∑ ′

Find transformation Tthat minimizesT

xixi

'

∑ ′i

ii xxT )),((residual

Page 14: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Recall: Origins of computer vision

L. G. Roberts, Machine Perception of Three Dimensional Solids,Ph.D. thesis, MIT Department of pElectrical Engineering, 1963.

Page 15: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Alignment: Huttenlocher & Ullman (1987)

Page 16: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Variability Camera positionInvariance to:y pIlluminationInternal parameters

Duda & Hart ( 1972); Weiss (1987); Mundy et al. (1992-94);Rothwell et al. (1992); Burns et al. (1993)

Page 17: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Recall: invariant to similarity transformations computed from four Btransformations computed from four points C

D

A

D

Projective invariants (Rothwell et al., 1992):

General 3D objects do not admit monocular viewpoint invariants (Burns et al., 1993)

Page 18: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Representing and recognizing object categoriesi h dis harder...

ACRONYM (Brooks and Binford 1981)ACRONYM (Brooks and Binford, 1981)Binford (1971), Nevatia & Binford (1972), Marr & Nishihara (1978)

Page 19: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Recognition by componentsRecognition by components

???

Geons (Biederman 1987)

Page 20: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

General shape primitives?

Generali ed c lindersGeneralized cylindersPonce et al. (1989)

Zisserman et al. (1995) Forsyth (2000)

Page 21: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Empirical models of image variability

Appearance-based techniques

Turk & Pentland (1991); Murase & Nayar (1995); etc.

Page 22: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Eigenfaces (Turk & Pentland, 1991)

Page 23: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Color HistogramsColor Histograms

Swain and Ballard, Color Indexing, IJCV 1991.

Page 24: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Appearance manifoldspp

H. Murase and S. Nayar, Visual learning and recognition of 3-d objects from appearance, IJCV 1995

Page 25: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Limitations of global appearance d lmodels

• Requires global registration of patternsRequires global registration of patterns• Not robust to clutter, occlusion, geometric

transformations

Page 26: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Sliding window approachesSliding window approaches

• Turk and Pentland 1991Turk and Pentland, 1991• Belhumeur, Hespanha, &

Kriegman, 1997• Schneiderman & Kanade 2004Schneiderman & Kanade 2004• Viola and Jones, 2000

• Schneiderman & Kanade, 2004• Argawal and Roth, 2002• Poggio et al. 1993

Page 27: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Sliding window approaches

– Scale / orientation range to search over

g pp

– Speed– Context

Page 28: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Local featuresCombining local appearance, spatial constraints, invariants,

L ’02

Combining local appearance, spatial constraints, invariants, and classification techniques from machine learning.

Lowe’02

Mahamud & Hebert’03

Schmid & Mohr’97

Page 29: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Local features for recognition of object instances

Page 30: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Local features for recognition of object instances

• Lowe, et al. 1999, 2003• Mahamud and Hebert, 2000• Ferrari, Tuytelaars, and Van Gool, 2004• Rothganger, Lazebnik, and Ponce, 2004

M l d P 2005• Moreels and Perona, 2005• …

Page 31: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Representing categories: Parts and Structure

Weber, Welling & Perona (2000), Fergus, Perona & Zisserman (2003)

Page 32: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Parts-and-shape representationParts and shape representation• Model:

– Object as a set of parts– Relative locations between parts– Appearance of part

Figure from [Fischler & Elschlager 73]

Page 33: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Bag-of-features models

Obj tObj t Bag of Bag of

g

ObjectObject gg‘words’‘words’

Page 34: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Objects as textureObjects as texture• All of these are treated as being the sameg

• No distinction between foreground and o d st ct o bet ee o eg ou d a dbackground: scene recognition?

Page 35: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Timeline of recognition• 1965-late 1980s: alignment, geometric primitives• Early 1990s: invariants appearance-based• Early 1990s: invariants, appearance-based

methods• Mid-late 1990s: sliding window approachesMid late 1990s: sliding window approaches• Late 1990s: feature-based methods• Early 2000s: parts-and-shape models• Early 2000s: parts-and-shape models• 2003 – present: bags of features• Present trends: combination of local and global• Present trends: combination of local and global

methods, modeling context, emphasis on “image parsing”parsing

Page 36: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Global scene contextGlobal scene context

• The “gist” of a scene: Oliva & Torralba (2001)The gist of a scene: Oliva & Torralba (2001)

http://people.csail.mit.edu/torralba/code/spatialenvelope/

Page 37: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

J. Hays and A. Efros, Scene Completion using Millions of Photographs SIGGRAPH 2007Millions of Photographs, SIGGRAPH 2007

Page 38: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Scene-level context for image parsing

J. Tighe and S. Lazebnik, ECCV 2010 submission

Page 39: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

Geometric context

D Hoiem A Efros and M Herbert Putting Objects inD. Hoiem, A. Efros, and M. Herbert. Putting Objects in Perspective. CVPR 2006.

Page 40: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

What “works” todayWhat works today

• Reading license plates, zip codes, checksReading license plates, zip codes, checks

Page 41: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

What “works” todayWhat works today

• Reading license plates, zip codes, checksReading license plates, zip codes, checks• Fingerprint recognition

Page 42: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

What “works” todayWhat works today

• Reading license plates, zip codes, checksReading license plates, zip codes, checks• Fingerprint recognition• Face detectionFace detection

Page 43: Object Recognition: History and Overvielazebnik/spring10/lec16_recognition_intro.pdf · Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio

What “works” todayWhat works today

• Reading license plates, zip codes, checksReading license plates, zip codes, checks• Fingerprint recognition• Face detectionFace detection• Recognition of flat textured objects (CD covers,

book covers etc )book covers, etc.)


Top Related