introduction to computer vision - brown universitycs.brown.edu/courses/cs143/2009/lecture21.pdfcs143...

Post on 24-Apr-2018

236 Views

Category:

Documents

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

©Michael J. Black CS143 Intro to Computer Vision

Introduction to Computer Vision

Michael J. Black

Nov 2009

Perspective projection and affine

motion

©Michael J. Black CS143 Intro to Computer Vision

©Michael J. Black CS143 Intro to Computer Vision

Goals

• Today

– Perspective projection

– 3D motion

• Wed

– Projects

• Friday

– Regularization and robust statistics

©Michael J. Black

Reading

• Szeliski

– 2.2.2: 2D tranformations including affine

– 2.1.3: 3D transformations including rotation

matrices (more detailed than we need)

– 2.1.4: 3D to 2D projections

CS143 Intro to Computer Vision

©Michael J. Black CS143 Intro to Computer Vision

Affine Flow

©Michael J. Black CS143 Intro to Computer Vision

Why Affine?

• Where does this affine approximation come from?

• All our models are approximations to the world. What are the assumptions in the affine approximation?

• For this we need some geometry.

©Michael J. Black CS143 Intro to Computer Vision

Pinhole cameras

• Abstract camera model - box with a small hole in it.

• Easy to build but needs a lot of light.

©Michael J. Black CS143 Intro to Computer Vision

The equation of projection

Image plane

©Michael J. Black CS143 Intro to Computer Vision

Perspective Projection

f

(x,y,z)

(x’,y’)

X

Z

Y

O

Move image plane in

front of focal point

(image not inverted)

Optical Axis

Camera

coordinate

frame

Many to one

©Michael J. Black CS143 Intro to Computer Vision

Perspective Projection

f

(x,y,z)

(x’,y’)

X

Z

Y

O

Short focal length

means wider field of

view.

©Michael J. Black CS143 Intro to Computer Vision

Perspective Projection

f

(x,y,z)

(x’,y’)

X

Z

Y

r

r’

O

©Michael J. Black CS143 Intro to Computer Vision

Perspective Projection

f

(x,y,z)

(x’,y’)

X

Z

Y

r

r’

z

O

©Michael J. Black CS143 Intro to Computer Vision

Perspective Projection

f

(x,y,z)

(x’,y’)

X

Z

Y

r

r’

z

O

©Michael J. Black CS143 Intro to Computer Vision

Perspective Projection

f

(x,y,z)

(x’,y’)

X

Z

Y

r

r’

z

=f

z

x ' =fx

z

y ' =fy

z

O

(important slide)

©Michael J. Black CS143 Intro to Computer Vision

Distant objects are smaller

©Michael J. Black CS143 Intro to Computer Vision

Orthographic Projection

X

Z

Y

Assume an infinite focal

length and that the world is

infinitely far away.

©Michael J. Black CS143 Intro to Computer Vision

Weak Perspective (scaled orthography)

X

Z

Y

Assume variation in depth is small relative to the distance from the camera.

Approximate scene as a fronto-parallel plane

©Michael J. Black CS143 Intro to Computer Vision

Weak Perspective (scaled orthography)

X

Z

Y

©Michael J. Black CS143 Intro to Computer Vision

Claim

For small motions, affine flow approximates

the motion of a plane viewed under

orthographic projection.

u = vx = a1x + a2y + a3v = vy = a4x + a5y + a6

uv

=x y 1 0 0 0

0 0 0 x y 1

a1a2a3a4a5a6

©Michael J. Black CS143 Intro to Computer Vision

Recall: 3D motion

©Michael J. Black CS143 Intro to Computer Vision

Motion Models

3D Rigid Motion

©Michael J. Black CS143 Intro to Computer Vision

Rotation

Review: Ch 2.1.3, Euclidean Geometry

Rotation matrix: orthogonal with determinant =1

RT= R 1 det(R) = 1

©Michael J. Black CS143 Intro to Computer Vision

Rotation

Review: Ch 2.1.3, Euclidean Geometry

©Michael J. Black CS143 Intro to Computer Vision

Assumption: Small Motion

(If is small)

©Michael J. Black CS143 Intro to Computer Vision

3D Motion

©Michael J. Black CS143 Intro to Computer Vision

Assumption: Planar World +

Orthographic Projection

©Michael J. Black CS143 Intro to Computer Vision

Assumption: Planar World

a1 = Yb

a3 = Y a + TXa2 = Yc Z

a4 = Z Xb

a5 = Xc

a6 = TY X a

Substitute:

u = vx = a1x + a2y + a3v = vy = a4x + a5y + a6

©Michael J. Black CS143 Intro to Computer Vision

Affine Flow

Small motion assumption

* e.g. at video frame rate

Planar surface

* look at only a small region of the scene

Orthographic projection

* surface distant from camera

* long focal length

u = vx = a1x + a2y + a3v = vy = a4x + a5y + a6

©Michael J. Black CS143 Intro to Computer Vision

Assumptions

What might be wrong with this?

Is there a probabilistic interpretation?

Minimize the negative log.

©Michael J. Black CS143 Intro to Computer Vision

Multiple Motions

©Michael J. Black CS143 Intro to Computer Vision

Occlusion

occlusion disocclusion shear

Multiple motions within a finite region.

©Michael J. Black CS143 Intro to Computer Vision

Coherent Motion

Possibly Gaussian.

©Michael J. Black CS143 Intro to Computer Vision

Multiple Motions

Definitely not Gaussian.

©Michael J. Black CS143 Intro to Computer Vision

Multiple Motions

What is the “best” fitting translational motion?

©Michael J. Black CS143 Intro to Computer Vision

Multiple Motions

Least squares fit.

X

©Michael J. Black CS143 Intro to Computer Vision

Simpler problem: fitting a line to data

y

x

©Michael J. Black CS143 Intro to Computer Vision

y

x

©Michael J. Black CS143 Intro to Computer Vision

y

x

©Michael J. Black CS143 Intro to Computer Vision

y

x

©Michael J. Black CS143 Intro to Computer Vision

Robust Statistics

• Recover the best fit to the majority of the

data.

• Detect and reject outliers.

History.

©Michael J. Black CS143 Intro to Computer Vision

Estimating the mean

0 2 4 6

Gaussian distribution

3

Mean is the optimal solution to:

residual

©Michael J. Black CS143 Intro to Computer Vision

Estimating the Mean

The mean maximizes this likelihood:

The negative log gives (with sigma=1):

“least squares” estimate

©Michael J. Black CS143 Intro to Computer Vision

Estimating the mean

0 2 4 6

©Michael J. Black CS143 Intro to Computer Vision

Estimating the mean

0 2 4

What happens if we change just one measurement?

With a single “bad” data point I can move the mean

arbitrarily far.

6+

top related