vision in man and machine. stats 19 sem 2. 263057202. talk 2. alan l. yuille. ucla. dept. statistics...

38
Machine Machine . . STATS 19 SEM 2. 263057202. Talk STATS 19 SEM 2. 263057202. Talk 2. 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. www.stat.ucla/~yuille

Post on 15-Jan-2016

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Vision in Man and MachineVision in Man and Machine..STATS 19 SEM 2. 263057202. Talk 2.STATS 19 SEM 2. 263057202. Talk 2.

Alan L. Yuille.

UCLA. Dept. Statistics and Psychology.

www.stat.ucla/~yuille

Page 2: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

The Purpose of Vision.The Purpose of Vision.

“To Know What is Where by Looking”. Aristotle. (384-322 BC).

Information Processing: receive a signal by light rays and decode its information.

Vision appears deceptively simple, but there is more to Vision than meets the Eye.

Page 3: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Ames RoomAmes Room

Page 4: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Perspective.Perspective.

Page 5: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Curved Lines?Curved Lines?

Page 6: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Brightness of Patterns: Adelson (MIT)Brightness of Patterns: Adelson (MIT)

Page 7: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Visual IllusionsVisual Illusions

The perception of brightness of a surface, or the length of a line, depends on context. Not on basic measurements like:the no. of photons that reach the eyeor the length of line in the image..

Page 8: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Perception as InferencePerception as Inference

Helmholtz. 1821-1894.“Perception as Unconscious Inference”.

Page 9: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Ball in a Box. (D. Kersten)Ball in a Box. (D. Kersten)

Page 10: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

How Hard is Vision?How Hard is Vision?

The Human Brain devotes an enormous amount of resources to vision.

(I) Optic nerve is the biggest nerve in the body. (II) Roughly half of the neurons in the cortex are

involved in vision (van Essen). If intelligence is proportional to neural activity,

then vision requires more intelligence than mathematics or chess.

Page 11: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Vision and the BrainVision and the Brain

Page 12: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Half the Cortex does VisionHalf the Cortex does Vision

Page 13: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Vision and Artificial IntelligenceVision and Artificial Intelligence

The hardness of vision became clearer when

the Artificial Intelligence community tried to

design computer programs to do vision. ’60s.AI workers thought that vision was “low-

level” and easy. Prof. Marvin Minsky (pioneer of AI) asked

a student to solve vision as a summer project.

Page 14: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Chess and Face DetectionChess and Face Detection

Artificial Intelligence Community preferred Chess to Vision.

By the mid-90’s Chess programs could beat the world champion Kasparov.

But computers could not find faces in images.

Page 15: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Man and Machine.Man and Machine.

David Marr (1945-1980) Three Levels of explanation:

1. Computation Level/Information Processing

2. Algorithmic Level

3. Hardware: Neurons versus silicon chips.

Claim: Man and Machine are similar at Level 1.

Page 16: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Vision: Decoding ImagesVision: Decoding Images

Page 17: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Vision as Probabilistic Inference Vision as Probabilistic Inference

Represent the World by S.Represent the Image by I.Goal: decode I and infer S.Model image formation by likelihood

function, generative model, P(I|S)Model our knowledge of the world by a

prior P(S).

Page 18: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Bayes TheoremBayes Theorem

Then Bayes’ Theorem states we show infer the world S from I by

P(S|I) = P(I|S)P(S)/P(I).Rev. T. Bayes. 1702-1761

Page 19: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Bayes to Infer S from IBayes to Infer S from I

P(I|S) likelihood function . P(S) prior.

.

Page 20: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Technically very interdisciplinaryTechnically very interdisciplinary

But applying Bayes is not straightforward.A beautiful theory is being developed

adapting techniques from Computer Science, Engineering, Mathematics, Physics, and Statistics.

E.G. Probabilistic Reasoning (Pearl CS),

Level Sets (Osher Maths).

Page 21: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

ExamplesExamples

Generative Models Visual Inference:

(1) Estimating Shape.

(2) Segmenting Images.

(3) Detecting Faces.

(4) Detecting and Reading Text.

Page 22: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Generative ModelsGenerative Models

Learn Generative Models from a fewimages and then generate new images.

Page 23: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Uses of Generative ModelsUses of Generative Models

Univ. Oxford

Page 24: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Shape Inference: (Zhu Lab)Shape Inference: (Zhu Lab)

Page 25: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Shape and Photometry ( Soatto Lab)Shape and Photometry ( Soatto Lab)

– Estimate geometry (shape) and photometry from multiple images.

Jin-Soatto-Yezzi

Page 26: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Compare ground truth (Soatto Lab)Compare ground truth (Soatto Lab)

Jin-Soatto-Yezzi 11/1/02

Estimated shapeEstimated shape

Alternative algorithmAlternative algorithm

Ground truthGround truth

Page 27: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Generated Image:synthesized from novelviewpoint and illumination.

Jin-Soatto-Yezzi 11/1/02

Ground Truth:

same lighting and viewpoint

Compare w. ground truth (Soatto Lab)Compare w. ground truth (Soatto Lab)

Page 28: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Segmentation (Level Sets)Segmentation (Level Sets)

Page 29: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Segmentation (Level Sets)Segmentation (Level Sets)

Page 30: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Segmenting Images (Zhu Lab)Segmenting Images (Zhu Lab)

Characterize the set of image patterns that

occur in natural images. Provide mathematical models. P(I|S) and P(S).

Page 31: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille
Page 32: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille
Page 33: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Face and Text Detection.Face and Text Detection.

Page 34: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Back to the BrainBack to the Brain

Top-Level; compare human performance to

Ideal Observers.

Explain human perceptual biases (visual

illusions) as strategies that are “statistical

effective”.

Page 35: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

Brain Architecture Brain Architecture

The Bayesian models have interesting

analogies to the brain. Generative Models require top-down

processing

Page 36: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

High-Level Tells Low-Level to High-Level Tells Low-Level to Shut Up (Kersten Lab)Shut Up (Kersten Lab)

Page 37: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

High-Level Tells Low-Level to High-Level Tells Low-Level to Shut up (Kersten Lab)Shut up (Kersten Lab)

Page 38: Vision in Man and Machine. STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. yuille

ConclusionConclusion

Vision is unconscious inference. Theory of Vision for Man and Machine.

See more about Vision at UCLA in the Vision and Image Science Collective

http://visciences.ucla.edu