why does this suck?

74
why does this suck?

Upload: michi

Post on 22-Feb-2016

35 views

Category:

Documents


0 download

DESCRIPTION

why does this suck?. Information Visualization. Jeffrey Heer UC Berkeley | PARC, Inc. CS160 – 2004.11.22. (includes numerous slides from Marti Hearst, Ed Chi, Stuart Card, and Peter Pirolli). Basic Problem. We live in a new ecology. Scientific Journals. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: why does this suck?

why does this suck?

Page 2: why does this suck?

Information Visualization

Jeffrey HeerUC Berkeley | PARC, Inc.

CS160 – 2004.11.22

(includes numerous slides from Marti Hearst, Ed Chi, Stuart Card, and Peter Pirolli)

Page 3: why does this suck?

Basic Problem

We live in a new ecology.

Page 4: why does this suck?

Scientific JournalsJournals/person increases 10X every 50 yearsJournals/person increases 10X every 50 years

YearYear

0.01

0.1

1

10

100

1000

10000

100000

1000000

1750 1800 1850 1900 1950 2000

JournalsJournals

Journals/People x10Journals/People x1066

DarwinDarwin V. BushV. Bush YouYou

Page 5: why does this suck?

Web Ecologies

1

10

100

1000

10000

100000

1000000

10000000

Aug-92 Feb-93 Aug-93 Feb-94 Aug-94 Feb-95 Aug-95 Feb-96 Aug-96 Feb-97 Aug-97 Feb-98 Aug-98

Serv

ers

Source: World Wide Web Consortium, Mark Gray, Netcraft Server Survey

1 new server every 2 seconds7.5 new pages per second

Page 6: why does this suck?

Human Capacity

0.01

0.1

1

10

100

1000

10000

100000

1000000

1750 1800 1850 1900 1950 2000DarwinDarwin V. BushV. Bush YouYou

Page 7: why does this suck?

Attentional Processes“What information consumes is rather obvious: it consumes the attention of its recipients. Hence a wealth of information creates a poverty of attention, and a need to allocate that attention efficiently among the overabundance of information sources that might consume it.”

~Herb Simonas quoted by Hal Varian

Scientific AmericanSeptember 1995

Page 8: why does this suck?

Human-Information Interaction The real design problem is not increased

access to information, but greater efficiency in finding useful information.

Increasing the rate at which people can find and use relevant information improves human intelligence.

Amount ofAccessibleKnowledge

Amount ofAmount ofAccessibleAccessibleKnowledgeKnowledge

Cost [Time]Cost [Time]Cost [Time]

Amount ofAccessibleKnowledge

Amount ofAmount ofAccessibleAccessibleKnowledgeKnowledge

Cost [Time]Cost [Time]Cost [Time]

Page 9: why does this suck?

Information Visualization Leverage highly-developed human visual

system to achieve rapid understanding of abstract information.

1.2 b/s (Reading)2.3 b/s (Pictures)

Page 10: why does this suck?

Information Visualization “Transformation of the symbolic into the geometric”

(McCormick et al., 1987)

“... finding the artificial memory that best supports our natural means of perception.'‘ (Bertin, 1983)

The depiction of information using spatial or graphical representations, to facilitate comparison, pattern recognition, change detection, and other cognitive skills by making use of the visual system. (Hearst, 2003)

Page 11: why does this suck?

Why Visualization? Use the eye for pattern recognition; people good at

scanning recognizing remembering images

Graphical elements facilitate comparisons via length shape orientation texture

Animation shows changes across time Color helps make distinctions Aesthetics make the process appealing

Page 12: why does this suck?

Visualization Success Stories

Page 13: why does this suck?

Visualization Success Story

Mystery: what is causing a cholera epidemic in London in 1854?

Page 14: why does this suck?

Visualization Success Story

From Visual Explanations by Edward Tufte, Graphics Press, 1997

Illustration of John Snow’sdeduction that a cholera epidemic was caused by a bad water pump, circa 1854.

Horizontal lines indicate location of deaths.

Page 15: why does this suck?

Visualization Success Story

From Visual Explanations by Edward Tufte, Graphics Press, 1997

Illustration of John Snow’s deduction that a cholera epidemic was caused by a bad water pump, circa 1854.

Horizontal lines indicate location of deaths.

Page 16: why does this suck?

A Visualization Expedition

(a tour through past and present)

Page 17: why does this suck?

Perspective Wall

Page 18: why does this suck?

Slide adapted from Chris North 18

Starfield Displays

Page 19: why does this suck?

Film Finder

Page 20: why does this suck?

Table Lens

Page 21: why does this suck?

Distortion Techniques

Page 22: why does this suck?

Indented Hierarchy Layout

Places all items along vertically spaced rows

Uses indentation to show parent child relationships

Breadth and depth end up fighting for space resources

Page 23: why does this suck?

Top-down layout

Uses separate dimensions for breadth and depth

Reingold-Tilford Layout

tidier drawing of trees - reingold, tilford

Page 24: why does this suck?

TreeMaps

Space-filling technique that divides space recursively

Segments space according to ‘size’ of children nodes

map of the market – smartmoney.com

Page 25: why does this suck?

SpaceTree

Page 26: why does this suck?

Cone Trees

Tree layout in three dimensions

Shadows provide 2D structure

Can also make “Balloon Trees” – 2D version of ConeTree

cone tree – robertson, mackinlay, and card

Page 27: why does this suck?

Degree-of-Interest Trees

Page 28: why does this suck?

Hyperbolic Trees

Page 29: why does this suck?

Network visualization

Often uses physics models (e.g., edges as springs) to perform layout.

Can be animated and interacted with.

Page 30: why does this suck?

Network Visualization

Skitter, www.caida.org

Page 31: why does this suck?

WebBook

Page 32: why does this suck?

Web Forager

Page 33: why does this suck?

Document Lens

Page 34: why does this suck?

Data Mountain

Supports document organization in a 2.5 dimensional environment.

Page 35: why does this suck?

Designing Visualizations

(some tricks of the trade)

Page 36: why does this suck?

Graphical Excellence [Tufte]

the well-designed presentation of interesting data – a matter of substance, of statistics, and of design

consists of complex ideas communicated with clarity, precision and efficiency

is that which gives to the viewer the greatest number of ideas in the shortest time with the least ink in the smallest space

requires telling the truth about the data.

Page 37: why does this suck?

Interactive Tasks [Shneiderman]

1. Overview: Get an overview of the collection2. Zoom: Zoom in on items of interest3. Filter: Remove uninteresting items4. Details on demand: Select items and get

details5. Relate: View relationships between items6. History: Keep a history of actions for undo,

replay, refinement7. Extract: Make subcollections

Page 38: why does this suck?

Proposed Data Types1. 1D: timelines,…2. 2D: maps,…3. 3D: volumes,…4. Multi-dimensional: databases,…5. Hierarchies/Trees: directories,…6. Networks/Graphs: web,…7. Document collections: digital libraries,…

This is useful, but what’s wrong here?

Page 39: why does this suck?

Basic Types of Data Nominal (qualitative)

(no inherent order) city names, types of diseases, ...

Ordinal (qualitative) (ordered, but not at measurable intervals) first, second, third, … cold, warm, hot Mon, Tue, Wed, Thu …

Interval (quantitative) integers or reals

Page 40: why does this suck?

QUANT ORDINAL NOMINAL

Position Position PositionLength Density Color HueAngle Color Saturation TextureSlope Color Hue ConnectionArea Texture ContainmentVolume Connection DensityDensity Containment Color SaturationColor Saturation Length ShapeColor Hue Angle Length

Ranking of Applicability of Properties for Different Data Types(Mackinlay 88, Not Empirically Verified)

Page 41: why does this suck?

Visualization Design Patterns Pre-Attentive Patterns

Leverage things that automatically “pop-out” to human attention Stark contrast in color, shape, size, orientation

Gestalt Properties Use psychological theories of visual grouping proximity, similarity, continuity, connectedness, closure,

symmetry, common fate, figure/ground separation High Data Density

Maximize number of items/area of graphic This is controversial! Whitespace may contribute to good visual

design… so balance appropriately. Small Multiples

Show varying visualizations/patterns adjacent to one another Enable Comparisons

Page 42: why does this suck?

Visualization Design Patterns Focus+Context

Highlight regions of current interest, while de-emphasizing but keeping visible surrounding context.

Can visually distort space, or use degree-of-interest function to control what is and isn’t visualized.

Dynamic Queries Allow rapid refinement of visualization criteria Range sliders, Query sliders

Panning and Zooming Navigate large spaces using a camera metaphor

Semantic Zooming Change content presentation based on zooming level Hide/reveal additional data in accordance with available space

Page 43: why does this suck?

Software Architectures The Information Visualization Reference

Model [Chi, Card, Mackinlay, Shneiderman]

Page 44: why does this suck?

Evaluating Visualizations

Page 45: why does this suck?

Evaluating Visualizations Visualizations are user interfaces, too…established

methodologies can be used. Questions to ask

What tasks do you expect people to perform with the visualization?

What interfaces currently exist for this task? In what ways do you expect different visualizations to help

or hurt aspects of these tasks? Metrics: task time, success rate, information gained

(e.g., test the user, or exploit priming effects), eye tracking.

Page 46: why does this suck?

Evaluating Hyperbolic Trees The Great CHI’97 Browse-Off: Individual

browsers race against the clock to perform various retrieval and comparison tasks.

Hyperbolic Tree won against M$ File Explorer and others.

Can we conclude that it is the better browser?

vs.

Page 47: why does this suck?

Evaluating Hyperbolic Trees No!

Different people operating each browser. Tasks were not ecologically valid.

Can’t say what is better for what. PARC researchers did extensive eye-tracking studies uncovering

very nuanced visual psychology. Found Hyperbolic Tree is better when underlying information

design (e.g., tree structure and labeling) is better. In case of CHI Browse Off, the Hyperbolic Tree had a quicker

human user “behind the wheel”. Moral: Exercise judicious study design, but also don’t feel let

down if task times are not being radically improved… subtleties abound.

Page 48: why does this suck?

Questions?

Jeffrey Heer [email protected] http://prefuse.sourceforge.net

Page 49: why does this suck?

Accuracy Ranking of Quantitative Perceptual TasksEstimated; only pairwise comparisons have been validated

(Mackinlay 88 from Cleveland & McGill)

Page 50: why does this suck?

Interpretations of Visual PropertiesSome properties can be discriminated more accurately but don’t have intrinsic meaning

Density (Greyscale)Darker -> More

Size / Length / AreaLarger -> More

PositionLeftmost -> first, Topmost -> first

Hue??? no intrinsic meaning

Slope??? no intrinsic meaning

Page 51: why does this suck?

Micro-Aspects of Visualization Design(aka fun with visual psychology)

Page 52: why does this suck?

Preattentive Processing A limited set of visual properties are

processed preattentively (without need for focusing attention).

This is important for design of visualizationswhat can be perceived immediatelywhat properties are good discriminatorswhat can mislead viewers

All Preattentive Processing figures from Healey 97http://www.csc.ncsu.edu/faculty/healey/PP/PP.html

Page 53: why does this suck?

Example: Color Selection

Viewer can rapidly and accurately determinewhether the target (red circle) is present or absent.Difference detected in color.

Page 54: why does this suck?

Example: Shape Selection

Viewer can rapidly and accurately determinewhether the target (red circle) is present or absent.Difference detected in form (curvature)

Page 55: why does this suck?

Pre-attentive Processing < 200 - 250ms qualifies as pre-attentive

eye movements take at least 200ms yet certain processing can be done very quickly,

implying low-level processing in parallel If a decision takes a fixed amount of time

regardless of the number of distractors, it is considered to be preattentive.

Page 56: why does this suck?

Example: Conjunction of Features

Viewer cannot rapidly and accurately determinewhether the target (red circle) is present or absent when target has two or more features, each of which arepresent in the distractors. Viewer must search sequentially.

All Preattentive Processing figures from Healey 97http://www.csc.ncsu.edu/faculty/healey/PP/PP.html

Page 57: why does this suck?

Example: Emergent Features

Target has a unique feature with respect to distractors (open sides) and so the groupcan be detected preattentively.

Page 58: why does this suck?

Example: Emergent Features

Target does not have a unique feature with respect to distractors and so the group cannot be detected preattentively.

Page 59: why does this suck?

Asymmetric and Graded Preattentive Properties Some properties are asymmetric

a sloped line among vertical lines is preattentive a vertical line among sloped ones is not

Some properties have a gradation some more easily discriminated among than others

Page 60: why does this suck?

Use Grouping of Well-Chosen Shapes for Displaying Multivariate Data

Page 61: why does this suck?

SUBJECT PUNCHED QUICKLY OXIDIZED TCEJBUS DEHCNUP YLKCIUQ DEZIDIXOCERTAIN QUICKLY PUNCHED METHODS NIATREC YLKCIUQ DEHCNUP SDOHTEMSCIENCE ENGLISH RECORDS COLUMNS ECNEICS HSILGNE SDROCER SNMULOCGOVERNS PRECISE EXAMPLE MERCURY SNREVOG ESICERP ELPMAXE YRUCREMCERTAIN QUICKLY PUNCHED METHODS NIATREC YLKCIUQ DEHCNUP SDOHTEMGOVERNS PRECISE EXAMPLE MERCURY SNREVOG ESICERP ELPMAXE YRUCREMSCIENCE ENGLISH RECORDS COLUMNS ECNEICS HSILGNE SDROCER SNMULOCSUBJECT PUNCHED QUICKLY OXIDIZED TCEJBUS DEHCNUP YLKCIUQ DEZIDIXOCERTAIN QUICKLY PUNCHED METHODS NIATREC YLKCIUQ DEHCNUP SDOHTEMSCIENCE ENGLISH RECORDS COLUMNS ECNEICS HSILGNE SDROCER SNMULOC

Page 62: why does this suck?

SUBJECT PUNCHED QUICKLY OXIDIZED TCEJBUS DEHCNUP YLKCIUQ DEZIDIXOCERTAIN QUICKLY PUNCHED METHODS NIATREC YLKCIUQ DEHCNUP SDOHTEMSCIENCE ENGLISH RECORDS COLUMNS ECNEICS HSILGNE SDROCER SNMULOCGOVERNS PRECISE EXAMPLE MERCURY SNREVOG ESICERP ELPMAXE YRUCREMCERTAIN QUICKLY PUNCHED METHODS NIATREC YLKCIUQ DEHCNUP SDOHTEMGOVERNS PRECISE EXAMPLE MERCURY SNREVOG ESICERP ELPMAXE YRUCREMSCIENCE ENGLISH RECORDS COLUMNS ECNEICS HSILGNE SDROCER SNMULOCSUBJECT PUNCHED QUICKLY OXIDIZED TCEJBUS DEHCNUP YLKCIUQ DEZIDIXOCERTAIN QUICKLY PUNCHED METHODS NIATREC YLKCIUQ DEHCNUP SDOHTEMSCIENCE ENGLISH RECORDS COLUMNS ECNEICS HSILGNE SDROCER SNMULOC

Text NOT PreattentiveText NOT Preattentive

Page 63: why does this suck?

Preattentive Visual Properties(Healey 97)

length Triesman & Gormican [1988] width Julesz [1985] size Triesman & Gelade [1980] curvature Triesman & Gormican [1988] number Julesz [1985]; Trick & Pylyshyn [1994] terminators Julesz & Bergen [1983] intersection Julesz & Bergen [1983] closure Enns [1986]; Triesman & Souther [1985] colour (hue) Nagy & Sanchez [1990, 1992]; D'Zmura [1991]

Kawai et al. [1995]; Bauer et al. [1996] intensity Beck et al. [1983]; Triesman & Gormican [1988] flicker Julesz [1971] direction of motion Nakayama & Silverman [1986]; Driver & McLeod [1992] binocular lustre Wolfe & Franzel [1988] stereoscopic depth Nakayama & Silverman [1986] 3-D depth cues Enns [1990] lighting direction Enns [1990]

Page 64: why does this suck?

Gestalt Principles

Idea: forms or patterns transcend the stimuli used to create them. Why do patterns emerge? Under what circumstances?

Principles of Pattern Recognition “gestalt” German for “pattern” or “form, configuration” Original proposed mechanisms turned out to be wrong Rules themselves are still useful

Page 65: why does this suck?

Gestalt PropertiesProximity

Why perceive pairs vs. triplets?

Page 66: why does this suck?

Gestalt PropertiesSimilarity

Slide adapted from Tamara Munzner

Page 67: why does this suck?

Gestalt PropertiesContinuity

Slide adapted from Tamara Munzner

Page 68: why does this suck?

Gestalt PropertiesConnectedness

Slide adapted from Tamara Munzner

Page 69: why does this suck?

Gestalt PropertiesClosure

Slide adapted from Tamara Munzner

Page 70: why does this suck?

Gestalt PropertiesSymmetry

Slide adapted from Tamara Munzner

Page 71: why does this suck?

Gestalt Laws of Perceptual Organization (Kaufman 74)

Figure and Ground Escher illustrations are good examples Vase/Face contrast

Subjective Contour

Page 72: why does this suck?

More Gestalt Laws Law of Common Fate

like preattentive motion propertymove a subset of objects among similar ones and

they will be perceived as a group

Page 73: why does this suck?

Colors for Labeling Ware recommends to take into account:

Distinctness Unique hues

Component process model Contrast with background Color blindness Number

Only a small number of codes can be rapidly perceived Field Size

Small changes in color are difficult to perceive Conventions

Page 74: why does this suck?

Ware’s Recommended Colors for Labeling

Red, Green, Yellow, Blue, Black, White, Pink, Cyan, Gray, Orange, Brown, Purple.The top six colors are chosen because they are the unique colors that mark the ends of the opponent color axes. The entire set corresponds to the eleven color names found to be the most common in a cross-cultural study, plus cyan (Berlin and Kay)