cap 6412 advanced computer vision - …bgong/cap6412/lec22.pdfa brief introduction to 3d computer...

76
CAP 6412 Advanced Computer Vision http://www.cs.ucf.edu/~bgong/CAP6412.html Boqing Gong March 31, 2016

Upload: lyduong

Post on 19-Jul-2019

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

CAP6412AdvancedComputerVision

http://www.cs.ucf.edu/~bgong/CAP6412.html

Boqing GongMarch 31,2016

Page 2: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

Today

• Administrivia• RNN(Review:model,learning,challenge&solution)• LSTM

Page 3: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

ProjectIIposted,dueTuesday04/26, 11:59pm

• http://www.cs.ucf.edu/~bgong/CAP6412/proj2.pdf

• NextTuesday:lastdaytoacquirepermissionfortakingoption2

Page 4: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

Nextweek

Tuesday(04/05)

Abdullah Jamal

Thursday(04/07)

Samer Iskander

Page 5: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

Today

• Administrivia• RNN(Review:model,learning,challenge&solution)• LSTM

Page 6: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

RecurrentNeuralNetwork

• Threetimestepsandbeyond

Imagecredits:RichardSocher

Page 7: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

RNN

• Threetimestepsandbeyond • Alayeredfeedforward net• Tiedweights fordifferenttimesteps• Conditioning (memorizing?)onallpreviousinput• Cheap to save memoryinRAM

Imagecredits:RichardSocher

Page 8: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?
Page 9: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?
Page 10: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?
Page 11: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?
Page 12: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

LSTMslidesborrowedfromHinton

Page 13: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?
Page 14: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?
Page 15: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?
Page 16: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

ComparingwithPlainRNN

• Threetimestepsandbeyond

Imagecredits:RichardSocher

Page 17: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

A Brief Introduction to 3D Computer Vision

Presented by Karan Daei-Mojdehi

Department of Computer ScienceUniversity of Central Florida

March 31, 2016

Page 18: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 19: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 20: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 21: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

A Hint About 3D Vision

image source: http://www.markedbyteachers.com/as-and-a-level/psychology/perception-cognitive-psychology-a-revision-categories.html

Page 22: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

What is 3D Computer Vision?

Definition: extraction of 3D information from digitalimages

i.e. we want to infer a scene’s geometry from takenimages of that scene

common representations of 3D information:depth maps,meshes, point clouds, volumetric models

example of depth map(source:[4]) sample point cloud

Motivation?

Page 23: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

What is 3D Computer Vision?

Definition: extraction of 3D information from digitalimages

i.e. we want to infer a scene’s geometry from takenimages of that scene

common representations of 3D information:depth maps,meshes, point clouds, volumetric models

example of depth map(source:[4]) sample point cloud

Motivation?

Page 24: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

What is 3D Computer Vision?

Definition: extraction of 3D information from digitalimages

i.e. we want to infer a scene’s geometry from takenimages of that scene

common representations of 3D information:depth maps,meshes, point clouds, volumetric models

example of depth map(source:[4]) sample point cloud

Motivation?

Page 25: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 26: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

3D Reconstruction

3D reconstruction is the process of capturing the shapeand appearance of real objectsGoogle Earth 3D Reconstruction of Aerial Images (click)

UCF Campus 3D Reconstruction from Google Earth

Page 27: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Image Registration

Image Registration is the process of alignment of differentviews of a scene with overlapping viewsOne familiar application is in taking panorama pictures

another interesting application is Microsoft Synth R© (click)which emerged from Photo Tourism[5]

Page 28: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Image Registration

Image Registration is the process of alignment of differentviews of a scene with overlapping viewsOne familiar application is in taking panorama pictures

another interesting application is Microsoft Synth R© (click)which emerged from Photo Tourism[5]

Page 29: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Other 3D Computer Vision Applications

Recognition From 3D Reconstruction

Robotics

Navigationobject manipulation with collision detection

Stereoscopy: Creating illusion of depth by creating twoviews of a scene for binocular vision

Page 30: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Other 3D Computer Vision Applications

Recognition From 3D Reconstruction

Robotics

Navigationobject manipulation with collision detection

Stereoscopy: Creating illusion of depth by creating twoviews of a scene for binocular vision

Page 31: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 32: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Shape From X series

Pioneering work: Shape From Shading[3] (also known asphotoclinometry)

Downside: strong assumptions on imaging conditions:

Lambertian Surface (glossy coatings void this)

Orthographic projection (assuming projection into imageplane is affine)a single infinitely distant and known light source

other similar Shape from X approaches: shape fromtexture, shape from

Page 33: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Shape From X series

Pioneering work: Shape From Shading[3] (also known asphotoclinometry)

Downside: strong assumptions on imaging conditions:

Lambertian Surface (glossy coatings void this)Orthographic projection (assuming projection into imageplane is affine)

a single infinitely distant and known light source

other similar Shape from X approaches: shape fromtexture, shape from

Page 34: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Shape From X series

Pioneering work: Shape From Shading[3] (also known asphotoclinometry)

Downside: strong assumptions on imaging conditions:

Lambertian Surface (glossy coatings void this)Orthographic projection (assuming projection into imageplane is affine)a single infinitely distant and known light source

other similar Shape from X approaches: shape fromtexture, shape from

Page 35: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Shape From X series

Pioneering work: Shape From Shading[3] (also known asphotoclinometry)

Downside: strong assumptions on imaging conditions:

Lambertian Surface (glossy coatings void this)Orthographic projection (assuming projection into imageplane is affine)a single infinitely distant and known light source

other similar Shape from X approaches: shape fromtexture, shape from

Page 36: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Structure from Motion

can infer the shape of an object by tracking its features inframes of a video

an example software which is exploiting this technique isVideo Trac(click)

Page 37: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Depth from Defocus

Take two images with different focus blur

exercise precise blur estimation and magnification variationresulted by changing focus

image credits: [6]

Page 38: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Model-Based Techniques

Derive a morphable model by scanning huge number ofexamples

New objects of same type can be modeled by forminglinear combination of prototypes

A Morphable Model for the Synthesis of 3D Faces [1]:

image credits: [1]

Page 39: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Stereopsis

Is based on visual disparity (i.e. parallax)

We can say that we have reached a point where stereo =laser scan

boils down to a problem of matching robust featuredescriptors in two views ( in case of uncallibrated camerawhich is the general case)

Page 40: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 41: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

A new Challenge: Single Image Depth Estimationof General Scenes

Motivation:

previous approaches short comingsthere exists cues for depth even in single images

ill-posed problem → non-deterministic approaches

probabilistic and learning based methods are applied

Page 42: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

A new Challenge: Single Image Depth Estimationof General Scenes

Motivation:

previous approaches short comingsthere exists cues for depth even in single images

ill-posed problem → non-deterministic approaches

probabilistic and learning based methods are applied

Page 43: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 44: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Learning Depth from Single Monocular ImagesUsing Deep Convolutional Neural Fields[4]

By F. Liu, et al.

University of Adelaide, Australlia

currently holds state of the art results

A Deep Learning Approach

Main Contributions:introduces a framework for joint training of a CNN andcontinuous CRF

Page 45: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 46: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Problem StatementMonocular Depth Estimation

Infer the depth of each pixel given a single RGB Image ofa scene

Is an essential step for 3D reconstruction of a scene

ill-posed problem as there are only subtle cues for depth ina single image (parallax, occlusion, perspective, etc)

image source: NYU v2 Depth Dataset

Page 47: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 48: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Approach Outline

Joint Training of Deep Convolutional Network andConditional Random Field (CRF)Continuous CRF since depth is continuousunary and pairwise potentials of CRF are learnt byseparate networks

Page 49: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Approach Outline

Joint Training of Deep Convolutional Network andConditional Random Field (CRF)

Continuous CRF since depth is continuous

unary and pairwise potentials of CRF are learnt byseparate networks

Page 50: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Approach Outline

Joint Training of Deep Convolutional Network andConditional Random Field (CRF)

Continuous CRF since depth is continuous

unary and pairwise potentials of CRF are learnt byseparate networks

Page 51: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Approach Outline

Joint Training of Deep Convolutional Network andConditional Random Field (CRF)

Continuous CRF since depth is continuous

unary and pairwise potentials of CRF are learnt byseparate networks

Page 52: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 53: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Conditional Random Field

first, super pixels are defined on small homogeneoussections of image

y is a vector of continuous depth for super pixels:

graphical model defined on y :

Z(x) is the partition function and E (y , x) is the energyfunction:

Unary Potential + Pairwise Potential

Page 54: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Conditional Random Field

first, super pixels are defined on small homogeneoussections of image

y is a vector of continuous depth for super pixels:

graphical model defined on y :

Z(x) is the partition function and E (y , x) is the energyfunction:

Unary Potential + Pairwise Potential

Page 55: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Conditional Random Field

first, super pixels are defined on small homogeneoussections of image

y is a vector of continuous depth for super pixels:

graphical model defined on y :

Z(x) is the partition function and E (y , x) is the energyfunction:

Unary Potential + Pairwise Potential

Page 56: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

CRF Unary Potential

where zp(θ) is the scalar output of last fully connectedlayer of CNN similar to VGG net

each super pixel is resized to 224x224 pixels and fed tothis network

Figure: Network architecture of Deep CNN for unary term

Page 57: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

CRF Pairwise Potential

Adjacent similar pixels will have close depth

Pairwise potential enforces smoothness amongneighbouring similar pixels

Rpq is the output of network in pairwise part:

Skpqs are different similarity matrices between adjacent

superpixels

Page 58: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

CRF Pairwise Potential

Adjacent similar pixels will have close depth

Pairwise potential enforces smoothness amongneighbouring similar pixels

Rpq is the output of network in pairwise part:

Skpqs are different similarity matrices between adjacent

superpixels

Page 59: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

CRF Pairwise Potential

Adjacent similar pixels will have close depth

Pairwise potential enforces smoothness amongneighbouring similar pixels

Rpq is the output of network in pairwise part:

Skpqs are different similarity matrices between adjacent

superpixels

Page 60: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Training the whole pipeline

We train with objective of solving MAP inference of depth

in CRF:

Closed-form solution derived in the paper

Stochastic Gradient Descent used to back propagate theerror and train the networks parameters

Page 61: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Training the whole pipeline

We train with objective of solving MAP inference of depth

in CRF:

Closed-form solution derived in the paper

Stochastic Gradient Descent used to back propagate theerror and train the networks parameters

Page 62: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 63: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Available Datasets for Evaluating Depth Estimation

KITI Data set

videos taken form a driving vehicledepths captured by a LiDar sensor700 train and 697 test images from 28 scenes (extractedby Eigen[2] from videos

Make3D Data set

Aligned depthmaps from laser range sensors400 train and 134 test images

NYU Depth v2 Data set

comprised of video sequences from indoor scenes byrecorded by kinect1449 aligned RGB-D images (with densely labeledsegments)407,024 raw unlabeled RGB-D frames

Page 64: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Evaluation Metrics

These metrics are currently common for evaluating depthestimation results:

Page 65: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Experiment Results

Results on NYU v2 Depth dataset:

Results on KITTI dataset:

Page 66: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Experiment Results

Results on Make3d dataset:

Page 67: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Sample Results from Model

Figure: Orignial image Predicted Depth Map

Page 68: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Sample Results from Model

Figure: Orignial image Predicted Depth Map

Page 69: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 70: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Strength and Weakness of the paper

Main Strength of the paper in the joint training frameworkthat is made possible by finding a closed form solution toCRF MAP inference of depth

weakness of paper is that it is not using any geometriccues of depth (e.g. vanishing point)

Page 71: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Outline

1 3D Computer VisionWhat is 3D Computer Vision?3D Computer Vision ApplicationsA short history of Approaches to 3D Vision ProblemsA new Challenge: Single Image Depth Estimation ofGeneral Scenes

2 ”Learning Depth from Single Monocular Images Using DeepConvolutional Neural Fields”

About the paperProblem StatementApproach OutlineApproach DetailsExperiment ResultsStrength and Weakness of the paperFuture Directions

Page 72: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Future Directions

Creating a hybrid model that can handle partly labeleddata

Applying structure learning of the graphical model to thismodel

Enhance resolution of output from Deep CNN (like the[FlowNet] paper we saw earlier

Page 73: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Questions

Questions?

Thank you for your time

Page 74: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Questions

Questions?

Thank you for your time

Page 75: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Volker Blanz and Thomas Vetter. “A morphable modelfor the synthesis of 3D faces”. In: Proceedings of the26th annual conference on Computer graphics andinteractive techniques. ACM Press/Addison-WesleyPublishing Co. 1999, pp. 187–194.

David Eigen, Christian Puhrsch, and Rob Fergus. “Depthmap prediction from a single image using a multi-scaledeep network”. In: Advances in neural informationprocessing systems. 2014, pp. 2366–2374.

Berthold KP Horn. “Shape from shading: A method forobtaining the shape of a smooth opaque object from oneview”. In: (1970).

Fayao Liu, Chunhua Shen, and Guosheng Lin. “Deepconvolutional neural fields for depth estimation from asingle image”. In: Proceedings of the IEEE Conference on

Page 76: CAP 6412 Advanced Computer Vision - …bgong/CAP6412/lec22.pdfA Brief Introduction to 3D Computer Vision Presented by Karan Daei-Mojdehi 3D Computer Vision What is 3D Computer Vision?

A BriefIntroduction

to 3DComputerVision

Presented byKaran

Daei-Mojdehi

3D ComputerVision

What is 3DComputerVision?

3D ComputerVisionApplications

A short historyof Approaches to3D VisionProblems

A newChallenge:Single ImageDepthEstimation ofGeneral Scenes

”LearningDepth fromSingleMonocularImages UsingDeepConvolutionalNeural Fields”

About the paper

ProblemStatement

ApproachOutline

ApproachDetails

ExperimentResults

Strength andWeakness of thepaper

FutureDirections

References

Computer Vision and Pattern Recognition. 2015,pp. 5162–5170.

Noah Snavely, Steven M Seitz, and Richard Szeliski.“Photo tourism: exploring photo collections in 3D”. In:ACM transactions on graphics (TOG). Vol. 25. 3. ACM.2006, pp. 835–846.

M. Watanabe, S.K. Nayar, and M. Noguchi. “Real-TimeComputation of Depth from Defocus”. In: Proceedings ofThe International Society for Optical Engineering (SPIE).Vol. 2599. Jan. 1996, pp. 14–25.