eye-based interaction in graphical systems: theory & practice

66
Eye-Based Interaction in Graphical Eye-Based Interaction in Graphical Systems: Theory & Practice Systems: Theory & Practice Part I Part I Introduction to the Human Visual System Introduction to the Human Visual System

Upload: griffith-santana

Post on 30-Dec-2015

15 views

Category:

Documents


1 download

DESCRIPTION

Eye-Based Interaction in Graphical Systems: Theory & Practice. Part I Introduction to the Human Visual System. A: Visual Attention. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Eye-Based Interaction in Graphical Systems: Theory & Practice

Eye-Based Interaction in Graphical Systems: Eye-Based Interaction in Graphical Systems: Theory & PracticeTheory & Practice

Eye-Based Interaction in Graphical Systems: Eye-Based Interaction in Graphical Systems: Theory & PracticeTheory & Practice

Part IPart I

Introduction to the Human Visual SystemIntroduction to the Human Visual System

Page 2: Eye-Based Interaction in Graphical Systems: Theory & Practice

A: Visual AttentionA: Visual AttentionA: Visual AttentionA: Visual Attention

• Latin translation: Latin translation: “Many filtered into few “Many filtered into few for perception”for perception”

• Visual scene inspection is performed Visual scene inspection is performed minutatimminutatim (piecemeal), not (piecemeal), not in totoin toto

““When the things are apprehended by the When the things are apprehended by the sensessenses, , the number of them that can be attended to at the number of them that can be attended to at once is small, once is small, `Pluribus intentus, minor est ad `Pluribus intentus, minor est ad singula sensus' singula sensus' ””

— — William JamesWilliam James

Page 3: Eye-Based Interaction in Graphical Systems: Theory & Practice

A.1: Visual Attention—chronological A.1: Visual Attention—chronological reviewreviewA.1: Visual Attention—chronological A.1: Visual Attention—chronological reviewreview

• Qualitative historical background: a Qualitative historical background: a dichotomous theory of attention—the dichotomous theory of attention—the “what” and “where” of (visual) attention“what” and “where” of (visual) attention• Von Helmholtz (ca. 1900): mainly concerned with eye Von Helmholtz (ca. 1900): mainly concerned with eye

movements to spatial locations, the “where”, I.e., movements to spatial locations, the “where”, I.e., attention as overt mechanism (eye movements)attention as overt mechanism (eye movements)

• James (ca. 1900): defined attention mainly in terms of James (ca. 1900): defined attention mainly in terms of the “what”, i.e., attention as a more internally covert the “what”, i.e., attention as a more internally covert mechanismmechanism

Page 4: Eye-Based Interaction in Graphical Systems: Theory & Practice

A.1: Visual Attention—chronological A.1: Visual Attention—chronological review review (cont’d)(cont’d)

A.1: Visual Attention—chronological A.1: Visual Attention—chronological review review (cont’d)(cont’d)

• Broadbent (ca. 1950): defined attention as “selective Broadbent (ca. 1950): defined attention as “selective filter” from auditory experiments; generally agreeing filter” from auditory experiments; generally agreeing with Von Helmholtz’s “where”with Von Helmholtz’s “where”

• Deutsch and Deutsch (ca. 1960): rejected “selective Deutsch and Deutsch (ca. 1960): rejected “selective filter” in favor of “importance weightings”; generally filter” in favor of “importance weightings”; generally corresponding to James’ “what”corresponding to James’ “what”

• Treisman (ca. 1960): proposed unified theory of Treisman (ca. 1960): proposed unified theory of attention—attenuation filter (the “where”) followed by attention—attenuation filter (the “where”) followed by “dictionary units” (the “what”)“dictionary units” (the “what”)

Page 5: Eye-Based Interaction in Graphical Systems: Theory & Practice

A.1: Visual Attention—chronological A.1: Visual Attention—chronological review review (cont’d)(cont’d)

A.1: Visual Attention—chronological A.1: Visual Attention—chronological review review (cont’d)(cont’d)

• Main debate at this point: is attention Main debate at this point: is attention parallel (the “where”) or serial (the “what”) parallel (the “where”) or serial (the “what”) in nature?in nature?

• Gestalt view: recognition is a wholistic Gestalt view: recognition is a wholistic process (e.g., Kanizsa figure)process (e.g., Kanizsa figure)

• Theories advanced through early Theories advanced through early recordings of eye movementsrecordings of eye movements

Page 6: Eye-Based Interaction in Graphical Systems: Theory & Practice

A.1: Visual Attention—chronological A.1: Visual Attention—chronological review review (cont’d)(cont’d)

A.1: Visual Attention—chronological A.1: Visual Attention—chronological review review (cont’d)(cont’d)

• Yarbus (ca. 1967): demonstrated sequential, but Yarbus (ca. 1967): demonstrated sequential, but variable, viewing patterns over particular image variable, viewing patterns over particular image regions (akin to the “what”)regions (akin to the “what”)

• Noton and Stark (ca. 1970): showed that subjects Noton and Stark (ca. 1970): showed that subjects tend to fixate identifiable regions of interest, tend to fixate identifiable regions of interest, containing “informative details”; coined term containing “informative details”; coined term “scanpath” describing eye movement patterns“scanpath” describing eye movement patterns

• Scanpaths helped cast doubt on the Gestalt Scanpaths helped cast doubt on the Gestalt hypothesishypothesis

Page 7: Eye-Based Interaction in Graphical Systems: Theory & Practice

A.1: Visual Attention—chronological A.1: Visual Attention—chronological review review (cont’d)(cont’d)

A.1: Visual Attention—chronological A.1: Visual Attention—chronological review review (cont’d)(cont’d)

Fig.2: Yarbus’ early scanpath Fig.2: Yarbus’ early scanpath recording:recording:

• trace 1: examine at willtrace 1: examine at will

• trace 2: estimate wealthtrace 2: estimate wealth

• trace 3: estimate agestrace 3: estimate ages

• trace 4: guess previous activitytrace 4: guess previous activity

• trace 5: remember clothingtrace 5: remember clothing

• trace 6: remember positiontrace 6: remember position

• trace 7: time since last visittrace 7: time since last visit

Page 8: Eye-Based Interaction in Graphical Systems: Theory & Practice

A.1: Visual Attention—chronological A.1: Visual Attention—chronological review review (cont’d)(cont’d)

A.1: Visual Attention—chronological A.1: Visual Attention—chronological review review (cont’d)(cont’d)

• Posner (ca. 1980): proposed attentional “spotlight”, Posner (ca. 1980): proposed attentional “spotlight”, an overt mechanism independent from eye an overt mechanism independent from eye movements (akin to the “where”)movements (akin to the “where”)

• Treisman (ca. 1986): once again unified “what” and Treisman (ca. 1986): once again unified “what” and “where” dichotomy by proposing the Feature “where” dichotomy by proposing the Feature Integration Theory (FIT), describing attention as a Integration Theory (FIT), describing attention as a “glue” which integrates features at particular “glue” which integrates features at particular locations to allow wholistic perceptionlocations to allow wholistic perception

Page 9: Eye-Based Interaction in Graphical Systems: Theory & Practice

A.1: Visual Attention—chronological A.1: Visual Attention—chronological review review (cont’d)(cont’d)

A.1: Visual Attention—chronological A.1: Visual Attention—chronological review review (cont’d)(cont’d)

• Summary: the “what” and “where” Summary: the “what” and “where” dichotomy provides an intuitive sense of dichotomy provides an intuitive sense of attentional, foveo-peripheral visual attentional, foveo-peripheral visual mechanismmechanism

• Caution: the “what/where” account is Caution: the “what/where” account is probably overly simplistic and is but one probably overly simplistic and is but one theory of visual attentiontheory of visual attention

Page 10: Eye-Based Interaction in Graphical Systems: Theory & Practice

B: Neurological Substrate of the B: Neurological Substrate of the Human Visual System (HVS)Human Visual System (HVS)B: Neurological Substrate of the B: Neurological Substrate of the Human Visual System (HVS)Human Visual System (HVS)

• Any theory of visual attention must Any theory of visual attention must address the fundamental properties of address the fundamental properties of early visual mechanismsearly visual mechanisms

• Examination of the neurological substrate Examination of the neurological substrate provides evidence of limited information provides evidence of limited information capacity of the visual system—a capacity of the visual system—a physiological reason for an attentional physiological reason for an attentional mechanismmechanism

Page 11: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.1: The EyeB.1: The EyeB.1: The EyeB.1: The Eye

Fig. 3: The eye—“the world’s Fig. 3: The eye—“the world’s worst camera”worst camera”

• suffers from numerous suffers from numerous optical imperfections...optical imperfections...

• ...endowed with several ...endowed with several compensatory compensatory mechanismsmechanisms

Page 12: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.1: The Eye B.1: The Eye (cont’d)(cont’d)B.1: The Eye B.1: The Eye (cont’d)(cont’d)

Fig. 4: Ocular opticsFig. 4: Ocular optics

Page 13: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.1: The Eye B.1: The Eye (cont’d)(cont’d)B.1: The Eye B.1: The Eye (cont’d)(cont’d)

• Imperfections:Imperfections:• spherical abberationsspherical abberations

• chromatic abberationschromatic abberations

• curvature of fieldcurvature of field

• Compensations:Compensations:• iris—acts as a stopiris—acts as a stop

• focal lens—sharp focusfocal lens—sharp focus

• curved retina—matches curved retina—matches curvature of fieldcurvature of field

Page 14: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.2: The RetinaB.2: The RetinaB.2: The RetinaB.2: The Retina

• Retinal photoreceptors constitute first Retinal photoreceptors constitute first stage of visual perceptionstage of visual perception

• Photoreceptors Photoreceptors transducers converting transducers converting light energy to electrical impulses (neural light energy to electrical impulses (neural signals)signals)

• Photoreceptors are functionally classified Photoreceptors are functionally classified into two types: into two types: rodsrods and and conescones

Page 15: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.2: The Retina—rods and conesB.2: The Retina—rods and conesB.2: The Retina—rods and conesB.2: The Retina—rods and cones

• Rods: sensitive to dim and achromatic Rods: sensitive to dim and achromatic light (night vision)light (night vision)

• Cones: respond to brighter, chromatic Cones: respond to brighter, chromatic light (day vision)light (day vision)

• Retinal construction: 120M rods, 7M cones Retinal construction: 120M rods, 7M cones arranged concentricallyarranged concentrically

Page 16: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.2: The Retina—cellular makeupB.2: The Retina—cellular makeupB.2: The Retina—cellular makeupB.2: The Retina—cellular makeup

• The retina is composed of 3 main layers of The retina is composed of 3 main layers of different cell types (a 3-layer “sandwich”)different cell types (a 3-layer “sandwich”)

• Surprising fact: the retina is “inverted”— Surprising fact: the retina is “inverted”— photoreceptors are found in the bottom photoreceptors are found in the bottom layer (furthest away from incoming light)layer (furthest away from incoming light)

• Connection bundles between layers are Connection bundles between layers are called called plexiformplexiform or or synaptic layerssynaptic layers

Page 17: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.2: The Retina—cellular makeup B.2: The Retina—cellular makeup (cont’d)(cont’d)B.2: The Retina—cellular makeup B.2: The Retina—cellular makeup (cont’d)(cont’d)

Fig.5: The retinocellular Fig.5: The retinocellular layers (w.r.t. incoming layers (w.r.t. incoming light):light):

• ganglion layerganglion layer

• inner synaptic inner synaptic plexiform layerplexiform layer

• inner nuclear layerinner nuclear layer

• outer synaptic outer synaptic plexiform layerplexiform layer

• outer layerouter layer

Page 18: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.2: The Retina—cellular makeup B.2: The Retina—cellular makeup (cont’d)(cont’d)B.2: The Retina—cellular makeup B.2: The Retina—cellular makeup (cont’d)(cont’d)

Fig.5 (cont’d): The neuron:Fig.5 (cont’d): The neuron:

• all retinal cells are types all retinal cells are types of neuronsof neurons

• certain neurons mimic a certain neurons mimic a “digital gate”, firing when “digital gate”, firing when activation level exceeds a activation level exceeds a thresholdthreshold

• rods and cones are rods and cones are specific types of specific types of dendritesdendrites

Page 19: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.2: The Retina—retinogeniculate B.2: The Retina—retinogeniculate organization organization (from outside in, w.r.t. cortex)(from outside in, w.r.t. cortex)

B.2: The Retina—retinogeniculate B.2: The Retina—retinogeniculate organization organization (from outside in, w.r.t. cortex)(from outside in, w.r.t. cortex)

• Outer layerOuter layer: rods and cones: rods and cones

• Inner layerInner layer: horizontal cells, laterally : horizontal cells, laterally connected to photoreceptorsconnected to photoreceptors

• Ganglion layerGanglion layer: ganglion cells, connected : ganglion cells, connected (indirectly) to horizontal cells, project via (indirectly) to horizontal cells, project via the myelinated pathways, to the Lateral the myelinated pathways, to the Lateral Geniculate Nuclei (LGN) in the cortexGeniculate Nuclei (LGN) in the cortex

Page 20: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.2: The Retina—receptive fieldsB.2: The Retina—receptive fieldsB.2: The Retina—receptive fieldsB.2: The Retina—receptive fields

• Receptive fields: collections of Receptive fields: collections of interconnected cells within the inner and interconnected cells within the inner and ganglion layersganglion layers

• Field organization determines impulse Field organization determines impulse signature of cells, based on cell typessignature of cells, based on cell types

• Cells may depolarize due to light Cells may depolarize due to light increments (+) or decrements (-)increments (+) or decrements (-)

Page 21: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.2: The Retina—receptive fields B.2: The Retina—receptive fields (cont’d)(cont’d)B.2: The Retina—receptive fields B.2: The Retina—receptive fields (cont’d)(cont’d)

Fig.6: Receptive fields:Fig.6: Receptive fields:

• signal profile signal profile resembles a resembles a “Mexican hat”“Mexican hat”

• receptive field receptive field sizes vary sizes vary concentricallyconcentrically

• color-opposing color-opposing fields also existfields also exist

Page 22: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.3: Visual PathwaysB.3: Visual PathwaysB.3: Visual PathwaysB.3: Visual Pathways

• Retinal ganglion cells project to the LGN Retinal ganglion cells project to the LGN along two major pathways, distinguished along two major pathways, distinguished by morphological cell types: by morphological cell types: and and cells cells cells project to the cells project to the magnocellularmagnocellular (M-) layers (M-) layers

cells project to the cells project to the parvocellularparvocellular (P-) layers (P-) layers

• Ganglion cells are functionally classified Ganglion cells are functionally classified by three types: X, Y, and W cellsby three types: X, Y, and W cells

Page 23: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.3: Visual Pathways—functional B.3: Visual Pathways—functional response of ganglion cellsresponse of ganglion cellsB.3: Visual Pathways—functional B.3: Visual Pathways—functional response of ganglion cellsresponse of ganglion cells

• X cells: sustained stimulus, location, and X cells: sustained stimulus, location, and fine detailfine detail• nervate along both M- and P- projectionsnervate along both M- and P- projections

• Y cells: transient stimulus, coarse features, Y cells: transient stimulus, coarse features, and motionand motion• nervate along only the M-projectionnervate along only the M-projection

• W cells: coarse features and motionW cells: coarse features and motion• project to the Superior Colliculus (SC)project to the Superior Colliculus (SC)

Page 24: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.3: Visual Pathways B.3: Visual Pathways (cont’d)(cont’d)B.3: Visual Pathways B.3: Visual Pathways (cont’d)(cont’d)

Fig.7: Optic tract and radiations Fig.7: Optic tract and radiations (visual pathways):(visual pathways):

• The LGN is of particular The LGN is of particular clinical importanceclinical importance

• M- and P-cellular M- and P-cellular projections are clearly projections are clearly visible under microscopevisible under microscope

• Axons from M- and P-layers Axons from M- and P-layers of the LGN terminate in of the LGN terminate in area V1area V1

Page 25: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.3: Visual Pathways B.3: Visual Pathways (cont’d)(cont’d)B.3: Visual Pathways B.3: Visual Pathways (cont’d)(cont’d)

Table.1: Functional characteristics of ganglionic Table.1: Functional characteristics of ganglionic projectionsprojections

Characteristics Magno Parvoganglion size large smalltransmission time fast slowreceptive fields large smallsensitivity to small objects poor goodsensitivity to change in light levels large smallsensitivity to contrast low highsensitivity to motion high lowcolor discrimination no yes

Page 26: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.4: The Occipital Cortex and B.4: The Occipital Cortex and BeyondBeyondB.4: The Occipital Cortex and B.4: The Occipital Cortex and BeyondBeyond

Fig.8: The brain Fig.8: The brain and visual and visual pathways:pathways:

• the cerebral the cerebral cortex is cortex is composed of composed of numerous numerous regions regions classified by classified by their functiontheir function

Page 27: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.4: The Occipital Cortex and B.4: The Occipital Cortex and Beyond Beyond (cont’d)(cont’d)

B.4: The Occipital Cortex and B.4: The Occipital Cortex and Beyond Beyond (cont’d)(cont’d)

• M- and P- pathways terminate in distinct M- and P- pathways terminate in distinct layers of cortical area V1layers of cortical area V1

• Cortical cells (unlike center-surround Cortical cells (unlike center-surround ganglion receptive fields) respond to ganglion receptive fields) respond to orientation-specific stimulusorientation-specific stimulus

• Pathways emanating from V1 joining Pathways emanating from V1 joining multiple cortical areas involved in vision multiple cortical areas involved in vision are called are called streamsstreams

Page 28: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.4: The Occipital Cortex and B.4: The Occipital Cortex and BeyondBeyond——directional selectivitydirectional selectivityB.4: The Occipital Cortex and B.4: The Occipital Cortex and BeyondBeyond——directional selectivitydirectional selectivity

• Cortical Directional Selectivity (CDS) of cells Cortical Directional Selectivity (CDS) of cells in V1 contributes to motion perception and in V1 contributes to motion perception and control of eye movementscontrol of eye movements

• CDS cells establish a motion pathway from CDS cells establish a motion pathway from V1 projecting to areas V2 and MT (V5)V1 projecting to areas V2 and MT (V5)

• In contrast, Retinal Directional Selectivity In contrast, Retinal Directional Selectivity (RDS) may not contribute to motion (RDS) may not contribute to motion perception, but is involved in eye movementsperception, but is involved in eye movements

Page 29: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.4: The Occipital Cortex and B.4: The Occipital Cortex and BeyondBeyond——cortical cellscortical cellsB.4: The Occipital Cortex and B.4: The Occipital Cortex and BeyondBeyond——cortical cellscortical cells

• Two consequences of visual system’s Two consequences of visual system’s motion-sensitive, single-cell organization:motion-sensitive, single-cell organization:• due to motion sensitivity, eye movements are never due to motion sensitivity, eye movements are never

perfectly still (instead tiny jitter is observed, termed perfectly still (instead tiny jitter is observed, termed microsaccademicrosaccade)—if eyes were stabilized, image would )—if eyes were stabilized, image would fade!fade!

• due to single-cell organization, representation of due to single-cell organization, representation of natural images is quite abstract: there is no “retinal natural images is quite abstract: there is no “retinal buffer”buffer”

Page 30: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.4: The Occipital Cortex and B.4: The Occipital Cortex and BeyondBeyond——2 attentional streams2 attentional streamsB.4: The Occipital Cortex and B.4: The Occipital Cortex and BeyondBeyond——2 attentional streams2 attentional streams

• Dorsal stream:Dorsal stream:• V1, V2, MT (V5), MST, Posterior Parietal CortexV1, V2, MT (V5), MST, Posterior Parietal Cortex

• sensorimotor (motion, location) processingsensorimotor (motion, location) processing

• the attentional “where”?the attentional “where”?

• Ventral (temporal) stream:Ventral (temporal) stream:• V1, V2, V4, Inferotemporal CortexV1, V2, V4, Inferotemporal Cortex

• cognitive processingcognitive processing

• the attentional “what”?the attentional “what”?

Page 31: Eye-Based Interaction in Graphical Systems: Theory & Practice

B.4: The Occipital Cortex and B.4: The Occipital Cortex and BeyondBeyond——3 attentional regions3 attentional regionsB.4: The Occipital Cortex and B.4: The Occipital Cortex and BeyondBeyond——3 attentional regions3 attentional regions

• Posterior Parietal Cortex (dorsal stream):Posterior Parietal Cortex (dorsal stream):• disengages attentiondisengages attention

• Superior Colliculus (midbrain):Superior Colliculus (midbrain):• relocates attentionrelocates attention

• Pulvinar (thalamus; colocated with LGN):Pulvinar (thalamus; colocated with LGN):• engages, or enhances, attentionengages, or enhances, attention

Page 32: Eye-Based Interaction in Graphical Systems: Theory & Practice

C: Visual Perception C: Visual Perception (with emphasis on foveo-(with emphasis on foveo-

peripheral distinction)peripheral distinction)C: Visual Perception C: Visual Perception (with emphasis on foveo-(with emphasis on foveo-

peripheral distinction)peripheral distinction)

• Measurable performance parameters may Measurable performance parameters may often (but not always!) fall within ranges often (but not always!) fall within ranges predicted by known limitations of the predicted by known limitations of the neurological substrateneurological substrate

• Example: visual acuity may be estimated by Example: visual acuity may be estimated by knowledge of density and distribution of the knowledge of density and distribution of the retinal photoreceptorsretinal photoreceptors

• In general, performance parameters are In general, performance parameters are obtained empiricallyobtained empirically

Page 33: Eye-Based Interaction in Graphical Systems: Theory & Practice

• Main parameters sought: visual acuity, Main parameters sought: visual acuity, contrast sensitivitycontrast sensitivity

• Dimensions of retinal features are measured Dimensions of retinal features are measured in terms of projected scene onto retina in in terms of projected scene onto retina in units of degrees visual angle,units of degrees visual angle,

where where SS is the object size and is the object size and DD is distance is distance

C.1: Spatial VisionC.1: Spatial VisionC.1: Spatial VisionC.1: Spatial Vision

D

SA

22arctan

Page 34: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.1: Spatial Vision—visual angleC.1: Spatial Vision—visual angleC.1: Spatial Vision—visual angleC.1: Spatial Vision—visual angle

Fig.9: Visual angleFig.9: Visual angle

Page 35: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.1: Spatial Vision—common visual C.1: Spatial Vision—common visual anglesanglesC.1: Spatial Vision—common visual C.1: Spatial Vision—common visual anglesangles

Table 2: Common visual anglesTable 2: Common visual angles

Object Distance Angle subtended

thumbnail arm’s length 1.5-2 degsun or moon - .5 degUS quarter coin arm’s length 2 degUS quarter coin 85 m 1 minUS quarter coin 5 km 1 sec

Page 36: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.1: Spatial Vision—retinal regionsC.1: Spatial Vision—retinal regionsC.1: Spatial Vision—retinal regionsC.1: Spatial Vision—retinal regions

• Visual field: 180° horiz. Visual field: 180° horiz. 130° vert. 130° vert.• Fovea Centralis (foveola): highest acuityFovea Centralis (foveola): highest acuity

• 1.3° visual angle; 25,000 cones1.3° visual angle; 25,000 cones

• Fovea: high acuity (at 5°, acuity drops to 50%)Fovea: high acuity (at 5°, acuity drops to 50%)

• 5° visual angle; 100,000 cones5° visual angle; 100,000 cones

• Macula: within “useful” acuity region (to about 30°)Macula: within “useful” acuity region (to about 30°)

• 16.7° visual angle; 650,000 cones16.7° visual angle; 650,000 cones

• Hardly any rods in the foveal regionHardly any rods in the foveal region

Page 37: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.1: Spatial Vision—visual angle and C.1: Spatial Vision—visual angle and receptor distributionreceptor distributionC.1: Spatial Vision—visual angle and C.1: Spatial Vision—visual angle and receptor distributionreceptor distribution

Fig.10: Retinotopic receptor distributionFig.10: Retinotopic receptor distribution

Page 38: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.1: Spatial Vision—visual acuityC.1: Spatial Vision—visual acuityC.1: Spatial Vision—visual acuityC.1: Spatial Vision—visual acuity

Fig.11: Visual acuity at Fig.11: Visual acuity at eccentricities and light levels:eccentricities and light levels:

• at photopic (day) light levels, at photopic (day) light levels, acuity is fairly constant acuity is fairly constant within central 2°within central 2°

• acuity drops of linearly to 5°; acuity drops of linearly to 5°; drops sharply (exp.) beyonddrops sharply (exp.) beyond

• at scotopic (night) light at scotopic (night) light levels, acuity is poor at all levels, acuity is poor at all eccentricitieseccentricities

Page 39: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.1: Spatial Vision—measuring C.1: Spatial Vision—measuring visual acuityvisual acuityC.1: Spatial Vision—measuring C.1: Spatial Vision—measuring visual acuityvisual acuity

• Acuity roughly corresponds to foveal Acuity roughly corresponds to foveal receptor distribution in the fovea, but not receptor distribution in the fovea, but not necessarily in the peripherynecessarily in the periphery

• Due to various contributing factors Due to various contributing factors (synaptic organization and later-stage (synaptic organization and later-stage neural elements), effective relative visual neural elements), effective relative visual acuity is generally measured by acuity is generally measured by psychophysical experimentationpsychophysical experimentation

Page 40: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.2: Temporal VisionC.2: Temporal VisionC.2: Temporal VisionC.2: Temporal Vision

• Visual response to motion is characterized by Visual response to motion is characterized by two distinct facts: two distinct facts: persistence of visionpersistence of vision (POV) (POV) and the and the phi phenomenonphi phenomenon

• POV: essentially describes human temporal POV: essentially describes human temporal sampling ratesampling rate

• Phi: describes threshold above which humans Phi: describes threshold above which humans detect detect apparent movementapparent movement

• Both facts exploited in media to elicit motion Both facts exploited in media to elicit motion perceptionperception

Page 41: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.2: Temporal Vision—persistence of C.2: Temporal Vision—persistence of visionvisionC.2: Temporal Vision—persistence of C.2: Temporal Vision—persistence of visionvision

Fig.12: Critical Fusion Frequency:Fig.12: Critical Fusion Frequency:

• stimulus flashing at about stimulus flashing at about 50-60Hz appears steady50-60Hz appears steady

• CFF explains why flicker is CFF explains why flicker is not seen when viewing not seen when viewing sequence of still imagessequence of still images

• cinema: 24 fps cinema: 24 fps 3 = 72Hz 3 = 72Hz due to 3-bladed shutterdue to 3-bladed shutter

• TV: 60 fields/sec, interlacedTV: 60 fields/sec, interlaced

Page 42: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.2: Temporal Vision—phi C.2: Temporal Vision—phi phenomenonphenomenonC.2: Temporal Vision—phi C.2: Temporal Vision—phi phenomenonphenomenon

• Phi phenomenon explains why motion is Phi phenomenon explains why motion is perceived in cinema, TV, graphicsperceived in cinema, TV, graphics

• Besides necessary flicker rate (60Hz), Besides necessary flicker rate (60Hz), illusion of apparent, or stroboscopic, illusion of apparent, or stroboscopic, motion must be maintainedmotion must be maintained

• Similar to old-fashioned neon signs with Similar to old-fashioned neon signs with stationary bulbsstationary bulbs

• Minimum rate: 16 frames per secondMinimum rate: 16 frames per second

Page 43: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.2: Temporal Vision—peripheral C.2: Temporal Vision—peripheral motion perceptionmotion perceptionC.2: Temporal Vision—peripheral C.2: Temporal Vision—peripheral motion perceptionmotion perception

• Motion perception is not homogeneous across Motion perception is not homogeneous across visual fieldvisual field

• Sensitivity to target motion decreases with Sensitivity to target motion decreases with retinal eccentricity for slow motion...retinal eccentricity for slow motion...• higher rate of target motion (e.g., spinning disk) is higher rate of target motion (e.g., spinning disk) is

needed to match apparent velocity in foveaneeded to match apparent velocity in fovea

• ……but, motion is more salient in periphery than but, motion is more salient in periphery than in fovea (easier to detect moving targets than in fovea (easier to detect moving targets than stationary ones)stationary ones)

Page 44: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.2: Temporal Vision—peripheral C.2: Temporal Vision—peripheral sensitivity to direction of motionsensitivity to direction of motionC.2: Temporal Vision—peripheral C.2: Temporal Vision—peripheral sensitivity to direction of motionsensitivity to direction of motion

Fig.13: Threshold isograms for Fig.13: Threshold isograms for peripheral rotary movement:peripheral rotary movement:

• periphery is twice as periphery is twice as sensitive to horizontal-sensitive to horizontal-axis movement as to axis movement as to vertical-axis movementvertical-axis movement

• (numbers in diagram (numbers in diagram are rates of pointer are rates of pointer movement in rev./min.)movement in rev./min.)

Page 45: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.3: Color Vision—cone typesC.3: Color Vision—cone typesC.3: Color Vision—cone typesC.3: Color Vision—cone types

Fig.14: Spectral sensitivity curves Fig.14: Spectral sensitivity curves of cone photoreceptorsof cone photoreceptors

• foveal color vision is foveal color vision is facilitated by three types of facilitated by three types of cone photorecptorscone photorecptors

• a good deal is known about a good deal is known about foveal color vision, relatively foveal color vision, relatively little is known about little is known about peripheral color visionperipheral color vision

• of the 7,000,000 cones, of the 7,000,000 cones, most are packed tightly into most are packed tightly into the central 30° foveal regionthe central 30° foveal region

Page 46: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.3: Color Vision—peripheral color C.3: Color Vision—peripheral color perception fieldsperception fieldsC.3: Color Vision—peripheral color C.3: Color Vision—peripheral color perception fieldsperception fields

Fig.15: Visual fields for monocular Fig.15: Visual fields for monocular color vision (right eye)color vision (right eye)

• blueblue and and yellowyellow fields are fields are larger than larger than redred and and greengreen fieldsfields

• most sensitive to most sensitive to blueblue, up , up to 83°; to 83°; redred up to 76°; up to 76°; greengreen up to 74° up to 74°

• chromatic fields do not chromatic fields do not have definite borders, have definite borders, sensitivity gradually and sensitivity gradually and irregularly drops off over irregularly drops off over 15-30° range15-30° range

Page 47: Eye-Based Interaction in Graphical Systems: Theory & Practice

C.4: Implications for Design of C.4: Implications for Design of Attentional DisplaysAttentional DisplaysC.4: Implications for Design of C.4: Implications for Design of Attentional DisplaysAttentional Displays

• Need to consider distinct characteristics of Need to consider distinct characteristics of foveal and peripheral vision, in particular:foveal and peripheral vision, in particular:• spatial resolutionspatial resolution

• temporal resolutiontemporal resolution

• luminance / chrominanceluminance / chrominance

• Furthermore, Furthermore, gaze-contingentgaze-contingent systems must systems must match dynamics of human eye movementmatch dynamics of human eye movement

Page 48: Eye-Based Interaction in Graphical Systems: Theory & Practice

D: Taxonomy and Models of Eye D: Taxonomy and Models of Eye MovementsMovementsD: Taxonomy and Models of Eye D: Taxonomy and Models of Eye MovementsMovements

• Eye movements are mainly used to Eye movements are mainly used to reposition the foveareposition the fovea

• Five main classes of eye movements:Five main classes of eye movements:

• saccadicsaccadic

• smooth pursuitsmooth pursuit

• vergencevergence

• vestibularvestibular

• physiological nystagmusphysiological nystagmus

• (fixations)(fixations)

• Other types of movements are non-Other types of movements are non-positional positional (adaptation, accommodation)(adaptation, accommodation)

Page 49: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.1: Extra-Ocular MusclesD.1: Extra-Ocular MusclesD.1: Extra-Ocular MusclesD.1: Extra-Ocular Muscles

Fig.16: Extrinsic muscles of the eyes:Fig.16: Extrinsic muscles of the eyes:

• in general, eyes move within 6 degrees of freedom (6 in general, eyes move within 6 degrees of freedom (6 muscles)muscles)

Page 50: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.1: Oculomotor PlantD.1: Oculomotor PlantD.1: Oculomotor PlantD.1: Oculomotor Plant

Fig.17: Oculomotor system:Fig.17: Oculomotor system:

• eye movement signals eye movement signals emanate from three emanate from three main distinct regions:main distinct regions:

• occipital cortex (areas occipital cortex (areas 17, 18, 19, 22)17, 18, 19, 22)

• superior colliculus (SC)superior colliculus (SC)

• semicircular canals semicircular canals (SCC)(SCC)

Page 51: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.1: Oculomotor Plant D.1: Oculomotor Plant (cont’d)(cont’d)D.1: Oculomotor Plant D.1: Oculomotor Plant (cont’d)(cont’d)

• Two pertinent observations:Two pertinent observations:1 eye movement system is, to a large extent, a eye movement system is, to a large extent, a

feedback circuitfeedback circuit

2 controlling cortical regions can be functionally controlling cortical regions can be functionally characterized as:characterized as:

• voluntary (occipital cortex—areas 17, 18, 19, 22)voluntary (occipital cortex—areas 17, 18, 19, 22)

• involuntary (superior colliculus, SC)involuntary (superior colliculus, SC)

• reflexive (semicircular canals, SCC)reflexive (semicircular canals, SCC)

Page 52: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.2: SaccadesD.2: SaccadesD.2: SaccadesD.2: Saccades

• Rapid eye movements used to reposition Rapid eye movements used to reposition foveafovea

• Voluntary and reflexiveVoluntary and reflexive

• Range in duration from 10ms - 100msRange in duration from 10ms - 100ms

• Effectively blind during transitionEffectively blind during transition

• Deemed Deemed ballisticballistic (pre-programmed) and (pre-programmed) and stereotyped stereotyped (reproducible)(reproducible)

Page 53: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.2: Saccades—modelingD.2: Saccades—modelingD.2: Saccades—modelingD.2: Saccades—modeling

Fig.18: Linear moving average filter model:Fig.18: Linear moving average filter model:

• sstt = input (pulse), = input (pulse), xxtt = output (step), = output (step), ggkk = filter coefficients = filter coefficients

• e.g., Haar filter {1,-1}e.g., Haar filter {1,-1}

0

110

kktk

ttt

sg

sgsgx

Page 54: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.3: Smooth PursuitsD.3: Smooth PursuitsD.3: Smooth PursuitsD.3: Smooth Pursuits

• Involved when visually tracking a moving Involved when visually tracking a moving targettarget

• Depending on range of target motion, eyes Depending on range of target motion, eyes are capable of matching target velocityare capable of matching target velocity

• Pursuit movements are an example of a Pursuit movements are an example of a control system with built-in negative control system with built-in negative feedbackfeedback

Page 55: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.3: Smooth Pursuits—modelingD.3: Smooth Pursuits—modelingD.3: Smooth Pursuits—modelingD.3: Smooth Pursuits—modeling

Fig.19: Linear, time-invariant filter model:Fig.19: Linear, time-invariant filter model:

• sstt = target position, = target position, xxtt = (desired) eye position, = (desired) eye position, hh = filter = filter

• retinal receptors give additive velocity errorretinal receptors give additive velocity error

1 ttt xxsh )(

Page 56: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.4: NystagmusD.4: NystagmusD.4: NystagmusD.4: Nystagmus

• Conjugate eye movements characterized by Conjugate eye movements characterized by sawtooth-like time course pattern (pursuits sawtooth-like time course pattern (pursuits interspersed with saccades)interspersed with saccades)

• Two types (virtually indistinguishable):Two types (virtually indistinguishable):• Optokinetic: compensation for retinal movement of targetOptokinetic: compensation for retinal movement of target

• Vestibular: compensation for head movementVestibular: compensation for head movement

• May be possible to model with combination May be possible to model with combination of saccade/pursuit filtersof saccade/pursuit filters

Page 57: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.5: FixationsD.5: FixationsD.5: FixationsD.5: Fixations

• Possibly the most important type of eye Possibly the most important type of eye movement for attentional applicationsmovement for attentional applications• 90% viewing time is devoted to fixations90% viewing time is devoted to fixations

• duration: 150ms - 600msduration: 150ms - 600ms

• Not technically eye movements in their Not technically eye movements in their own right, rather characterized by own right, rather characterized by miniature eye movements:miniature eye movements:• tremor, drift, microsaccadestremor, drift, microsaccades

Page 58: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.6: Eye Movement AnalysisD.6: Eye Movement AnalysisD.6: Eye Movement AnalysisD.6: Eye Movement Analysis

• Two significant observations:Two significant observations:1 only three types of eye movements are mainly only three types of eye movements are mainly

needed to gain insight into overt localization of visual needed to gain insight into overt localization of visual attention:attention:

• fixationsfixations

• saccadessaccades

• smooth pursuits (to a lesser extent)smooth pursuits (to a lesser extent)

2 all three signals may be approximated by linear, all three signals may be approximated by linear, time-invariant (LTI) filter systemstime-invariant (LTI) filter systems

Page 59: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.6: Eye Movement Analysis—D.6: Eye Movement Analysis—assumptionsassumptionsD.6: Eye Movement Analysis—D.6: Eye Movement Analysis—assumptionsassumptions

• Important point: it is assumed observed eye Important point: it is assumed observed eye movements disclose evidence of movements disclose evidence of overtovert visual visual attentionattention• it is possible to attend to objects it is possible to attend to objects covertlycovertly (without (without

moving eyes)moving eyes)

• Linearity: although practical, this assumption Linearity: although practical, this assumption is an operational oversimplification of is an operational oversimplification of neuronal (non-linear) systemsneuronal (non-linear) systems

Page 60: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.6: Eye Movement Analysis—goalsD.6: Eye Movement Analysis—goalsD.6: Eye Movement Analysis—goalsD.6: Eye Movement Analysis—goals

• goal of analysis is to locate goal of analysis is to locate regions where signal regions where signal average changes abruptlyaverage changes abruptly• fixation end, saccade startfixation end, saccade start• saccade end, fixation startsaccade end, fixation start

• two main approaches:two main approaches:• summation-basedsummation-based• differentiation-baseddifferentiation-based

• both approaches rely on both approaches rely on empirical thresholdsempirical thresholds

Fig.20: Hypothetical eye movement signalFig.20: Hypothetical eye movement signal

Page 61: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.6: Eye Movement Analysis—D.6: Eye Movement Analysis—denoisingdenoisingD.6: Eye Movement Analysis—D.6: Eye Movement Analysis—denoisingdenoising

Fig.21: Signal denoising—reduce noise due to:Fig.21: Signal denoising—reduce noise due to:• eye instability (jitter), or worse, blinkseye instability (jitter), or worse, blinks• removal possible based on device characteristics (e.g., blink = removal possible based on device characteristics (e.g., blink =

[0,0])[0,0])

Page 62: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.6: Eye Movement Analysis—D.6: Eye Movement Analysis—summation basedsummation basedD.6: Eye Movement Analysis—D.6: Eye Movement Analysis—summation basedsummation based

• Dwell-time fixation detection depends on:Dwell-time fixation detection depends on:• identification of a stationary signal (fixation), andidentification of a stationary signal (fixation), and

• size of time window specifying range of duration size of time window specifying range of duration (and hence temporal threshold)(and hence temporal threshold)

• Example: Example: position-varianceposition-variance method: method:• determine whether determine whether MM of of NN points lie within a certain points lie within a certain

distance distance DD of the mean ( of the mean () of the signal) of the signal

• values values MM, , NN, and , and DD are determined empirically are determined empirically

Page 63: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.6: Eye Movement Analysis—D.6: Eye Movement Analysis—differentiation baseddifferentiation basedD.6: Eye Movement Analysis—D.6: Eye Movement Analysis—differentiation baseddifferentiation based

• Velocity-based saccade/fixation detection:Velocity-based saccade/fixation detection:• calculated velocity (over signal window) is compared calculated velocity (over signal window) is compared

to thresholdto threshold

• if velocity > threshold then saccade, else fixationif velocity > threshold then saccade, else fixation

• Example: Example: velocity detectionvelocity detection method: method:• use short Finite Impulse Response (FIR) filters to use short Finite Impulse Response (FIR) filters to

detect saccade (may be possible in real-time)detect saccade (may be possible in real-time)

• assuming symmetrical velocity profile, can extend to assuming symmetrical velocity profile, can extend to velocity-based predictionvelocity-based prediction

Page 64: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.6: Eye Movement Analysis D.6: Eye Movement Analysis (cont’d)(cont’d)D.6: Eye Movement Analysis D.6: Eye Movement Analysis (cont’d)(cont’d)

Fig.22: Saccade/fixation detectionFig.22: Saccade/fixation detection

(a) position-variance(a) position-variance (b) velocity-detection(b) velocity-detection

Page 65: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.6: Eye Movement Analysis—D.6: Eye Movement Analysis—exampleexampleD.6: Eye Movement Analysis—D.6: Eye Movement Analysis—exampleexample

Fig.23: FIR filter velocity-detection Fig.23: FIR filter velocity-detection method based on idealized method based on idealized saccade detection:saccade detection:

• 4 conditions on measured 4 conditions on measured acceleration:acceleration:

maxmin

)()(

||

||

TIIT

ISgnISgn

BI

AI

12

12

2

1 • acc. > thresh. Aacc. > thresh. A

• acc. > thresh. Bacc. > thresh. B

• sign changesign change

• duration thresh.duration thresh.

• thresholds derived from thresholds derived from empirical valuesempirical values

Page 66: Eye-Based Interaction in Graphical Systems: Theory & Practice

D.6: Eye Movement Analysis—D.6: Eye Movement Analysis—example example (cont’d)(cont’d)

D.6: Eye Movement Analysis—D.6: Eye Movement Analysis—example example (cont’d)(cont’d)

• Amplitude thresholds Amplitude thresholds AA, , BB: derived from expected peak : derived from expected peak saccade velocities: 600°/ssaccade velocities: 600°/s

• Duration thresholds Duration thresholds TTminmin, , TTmaxmax: derived from expected : derived from expected

saccade duration: 120ms - 300mssaccade duration: 120ms - 300ms

Fig.24: FIR filters for saccade detectionFig.24: FIR filters for saccade detection