3d layoutcrf derek hoiem carsten rother john winn

3D LayoutCRF

Derek Hoiem

Carsten Rother

John Winn

Goal 1: Object Description

Object Description:

• Bounding Box

• Viewpoint

• Color

• Pose

• Subclass

Goal 2: Object Segmentation

• Combine object-level and pixel-level reasoning

Key Idea

Recognition Requires Object-Level Reasoning

• Position

• Shape/Size

• Viewpoint/Pose

• Style/Color

Recognition Requires Object-Level Reasoning

Solution: Window Detector?

• 45 degree range of viewpoints

• Minor scale/position variation

What if we have a really good model?

Recognition Requires Part-Level Reasoning

• Propose good global model

Recognition Requires Part-Level Reasoning

• Propose good global model

• Occlusions

Context Requires Both Object and Part-Level Info

• Size relationships require object model

Context Requires Both Object and Part-Level Info

• Surface relationships require occlusion info

Visibly sitting on ground

Not visibly sitting on ground

Our Object/Part Model

Ti = {

hj object parts

bounding box, viewpoint, color model, instance cost }

part consistency

occlusions

h1 h2 h3 h4

h5 h6 h7 h8

h9 h10 h11 hn

Extension from [Winn Shotton 2006]

Modeling Viewpoint

Parameterized by Bounding Box and Corner

Assigning Parts from Model

Training Image

Training Annotation

Assigned Parts3D Parts Model

Part Assignment Consistency

Relabeling

• Allowing slight deformations, relabel training data

Training Image

Original Labels

New Labels

Eight Viewpoint/Scale Ranges

Height Range

• Appearance (but not location) constant within each range

Modeling Part Appearance

• Template patches (normalized xcorr)

• Intensity / Color

Image Edges (DT)

Modeling Part Appearance

• Randomized decision trees– 25 trees, 250 leaf nodes

• Once:– Learn structure on 50,000 object / 50,000 background

pixels

• For each appearance model:– Learn parameters on all pixels (850 LabelMe images)

Inference

Input Image

Inference

Input Image

Proposals

• One per appearance model

• Objects proposed by connected components

Proposal Stage Model

hi object parts

part consistency

occlusions

h1 h2 h3 h4

h5 h6 h7 h8

h9 h10 h11 hn

• CRF Inference (TRW-BP)

Inference

Refinement

• One per proposal

• Incorporate viewpoint, size information

Proposals

Input Image

Refinement Stage Model

Ti = {

hi object parts

bounding box, viewpoint }

part consistency

occlusions

h1 h2 h3 h4

h5 h6 h7 h8

h9 h10 h11 hn

Inference

Refinement

Proposals

Arbitration

• Includes color model, instance penalty (graph cuts)

Input Image

Preliminary Results on UIUC

• Trained on 20, tested on rest• Quantitatively comparable to best

Preliminary Results on UIUC

Without Instance Cost

With Instance Cost

h1 h2 h3 h4

h5 h6 h7 h8

h9 h10 h11 hn

Preliminary Results on PASCAL’06

• 25 images– One proposal (viewpoint within 45 degrees,

scale of 26-38 pixels)

Preliminary Results on PASCAL’06

Without Color Model

With Color Model

Conclusion

• Combined object-level and pixel-level reasoning – Object-level: Position/Size, Viewpoint, Color– Pixel-level: Part appearance, Occlusion

reasoning

• Good preliminary results

3d layoutcrf derek hoiem carsten rother john winn

objectlevel reasoningsolution

object modelcontext

good model

pascalevaluate color

hj object partsbounding

level infosize relationships

pascal06preliminary

bestpreliminary results

Documents

rother valley railway proposed level...

2011 - rother tourism economic impact estimates

carsten rother, dmitrij schlesinger

computational photography cs498dh derek hoiem 8/25/11

textonboost : joint appearance, shape and context modeling...

learning jigsaws for clustering appearance and shape john...

visual scene understanding (cs 598) derek hoiem course...

winn dixie project

winn$tock 2010

computational photography cs498dwh derek hoiem 8/24/10

winn their hearts!

eastbourne borough council€¦ · web view1. rother...

rother valley railway economic impacts report

clustering appearance and shape by learning jigsaws anitha...

winn portfolio

rother baron: the pulse of god

summary experiment - rother district web viewme an even...

rother district council elections

winn - unsuspected eloquence

for the winn