perception - carnegie mellon school of computer...

62
Perception Introduction to HRI Simmons & Nourbakhsh Spring 2015

Upload: trinhdan

Post on 29-Mar-2018

215 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Perception

Introduction to HRI

Simmons & Nourbakhsh

Spring 2015

Page 2: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Perception – my goals

• What is the state of the art boundary?

• Where might we be in 5-10 years?

Page 3: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

The Perceptual Pipeline

• The classical approach: a serial pipeline

• Weak link analysis: each step depends on predecessors

Page 4: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Social Perception

• What features do we perceive for sociality?

• Is social perception a serial pipeline?

Page 5: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

1. HRI for Human Perceptual Shifting

Page 6: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Insect Telepresence

Educational telepresence designed using formal HCI inquiry tools.

Page 7: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Insect Telepresence Robot

• Problem • Increase visitors’ engagement with and appreciation of

insects in a museum terrarium at CMNH.

• Approach • Provide a scalar telepresence experience with

insect-safe visual browsing • Apply HCI techniques to design and evaluate the

input device and system • Cultural modeling, expert interview, baseline observation

• Measure engagement indirectly by ‘time on task’ • Partner with HCII, CMNH

Page 8: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Insect Telepresence Robot

• Innovations

• Asymmetric exhibit layout

• Mechanical transparency

• Clutched gantry lever arm

• FOV-relative 3 DOF joystick

Page 9: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Insect Telepresence Robot

Page 10: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Illah Nourbakhsh | CMU Robotics Institute | HRI Summer Course

Insect Telepresence Robot

• Evaluation Results: • Average group size: 3

• Average age of users: 19.5 years

• Three age modes: 8 years, 10 years, and 35 years

• Average time on task of all users: 60 seconds

• Average time on task of a single user: 27 seconds

• Average time on task for user groups: 93 seconds

Page 11: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

2. Vision Sensors

Page 12: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

The CCD (Charged Couple Device) • - Exotic timing circuitry required

• - Uneven frequency response in electron wells

• - Color separation: filters versus splitting

• - Lossy data formats: NTSC and “digital video” • > Credit: http://www.shortcourses.com/how/sensors/ccd_readout.gif

Page 13: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

The CMOS (Complementary Metal Oxide Semiconductor)

• - Standard chip fabrication techniques

• - Far lower power consumption overall (1:100)

• - Pixel/well measurement circuitry at along pixel

• - Real estate problems ; efficiency of photon usage

Page 14: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Human Vision

High quality sensors • color depth, dynamic range, light sensitivity, etc.

Massive information fusion • parallelism

• context-based reasoning • active foveation and selective attention • selective sensor fusion over space, capability and time • tuned feedback from interpretation to first computation • elegant and gradual failure characteristics

Page 15: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

3. Machine Vision

Poor-performance sensors • 8/24 bits of color, little dynamic range, inaccuracy and warp,

inconstant properties

Narrow, shallow, fragile • serial information processing

• information context typically as assumptions that violate • little sensor fusion across type • little sensor feedback loops across levels of interpretation • very little temporal filtering and interpretation

Page 17: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Origins: The Stanford Cart

Page 19: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Passive versus Active Tradeoff

The Passive/Active Design Question

• Sufficiency of natural contrast

• Interference between multiple robots

• System works in the dark

• System works in bright sunlight

Page 20: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Visual Ranging for Social Interaction

• Totally safe obstacle detection

• Human-body spatial interaction

• Arms and gesture recognition

• Human-designed environment engagement

Page 21: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Vision-based Rangefinding

• Imaging chips collapse a 3D world onto a 2D plane

Range inference from world knowledge / logical reasoning

Range inference from camera parameters

Range inference from disparity / matching

Page 22: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Depth from Defocus

• 1/f = 1/d + 1/e

Page 23: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Depth from Defocus

Pinhole camera – no blurring

Blur circle sensitivity inversely

proportional to distance

To calculate distance we must

know focused image

Page 24: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Depth from Defocus

Page 25: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Depth from Disparity

Page 26: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Depth from Disparity

• Distance is inversely proportional to disparity

• Disparity is proportional to baseline

• Large baselines offer a tradeoff across range

Page 27: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

The Feature Challenge

Features must: • provide sufficient density

• match across small viewpoint changes

• match across partial occlusions

• identify confidence

Features must not: • trigger false positive matches

• prove too sparse for the robot’s task

• require on-line human tuning

Page 28: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Example: ZLoG • Zero crossings of Laplacian of Gaussian

• Laplacian: “second derivative convolution”

• Gaussian: “smoothing convolution”

• Zero crossings: a sharp feature for interpolation

Page 29: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Stereo: Pictorial Example

Page 30: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Active Rangefinding

Page 31: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,
Page 32: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,
Page 33: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

HRI Vision: the special-case approach

Page 34: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Example: Cueing in Kismet

• Color-based human-robot interaction

• Cueing, orthogonal events, child-based interaction

• Challenges: constancy, illumination, human expectation

Page 35: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Motivational example: RALPH

Page 36: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Navlab on Streets

Page 37: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

• Carnegie Museum of Natural History

• Autonomy – 5 years, > 500 km navigated, auto-docking

– MTBF convergence at 1 week

– Proactive health state identification

Chips Museum Edubot - Chips

Page 38: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Museum Edubot - Chips

Page 39: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Landmarks: Visual Fiducials

Page 40: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Minerva: an example of “focused vision”

Page 41: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Minerva: an example of “focused vision”

Page 42: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

When special-case fails…

Page 43: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

SLAM

Page 44: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Visual SLAM Considerations

• Repeatable “landmark” recognition

• Feature locale

• Map-making

• Tracking robot position

Page 45: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

The Future of Visual Navigation • Hans Moravec’s stereo-based voxel grid

Page 46: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Invariant features

• Features: image contents coded so they can be found again on other images of same scene, …

• Invariant: …despite many changes: • rotation, translation • camera viewpoint: scale, perspective • illumination • noise • occlusion

Image matching by comparing invariant features

Notion of Interesting points and Keypoints

SIFT

Page 47: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Gaussian pyramid

Increasing sigma Gaussian pyramid

Increase -> no need to retain all pixels Scale smoothing parameter

Stored image can be reduced in size

Page 48: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

1. Scale-space extrema detection

Gaussian Pyramid processed one octave at a time

Blurs

DoGs

Page 49: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

2. Keypoint localization

• Detect maxima and minima of difference-of-Gaussian in scale space

• Reject points lying on edges

• Fit a quadratic to surrounding values for sub-pixel and sub-scale interpolation

Page 50: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

4. SIFT vector formation

Thresholded image gradients are sampled over 16x16 array of

locations in scale space

Create array of orientation histograms

8 orientations x 4x4 histogram array = 128 dimensions

Page 51: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Keypoints

( )( )local descriptor

Sampled regions located at interest points

Local invariant descriptors to scale and rotation

Local: robust to occlusion/clutter + no segmentation Invariant: to image transformations + illumination changes

Page 52: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

SIFT Features

• Very powerful method developed by David Lowe, Vancouver

• Image content is transformed into local feature coordinates that are invariant to translation, rotation, scale, and other imaging parameters

SIFT Features

Page 53: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

SIFT

Page 54: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Example: K9 Science Rover

Page 55: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Example: K9 Science Rover’s SIFT

Page 56: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

4. Social Vision State of Art

• Face detection, recognition

• Speech understanding

• Gesture understanding

Page 57: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Face Detection

• How would you detect faces in images?

Page 58: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Face Detection

• How would you detect faces in images?

Page 59: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Face Detection

• How would you detect faces in images?

Page 60: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Expression Detection

Page 61: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

First Person Vision

Page 62: Perception - Carnegie Mellon School of Computer Sciencereids/16-467/handouts/PerceptionSlideSet2015.pdf · Social Perception •What features do ... •Cultural modeling, expert interview,

Speech and Gesture Understanding

• Time for some fun:

– http://www.youtube.com/watch?v=1s-PiIbzbhw