visualization analysis &...

51
@tamaramunzner http://www.cs.ubc.ca/~tmm/talks.html#vad15d3 Visualization Analysis & Design Tamara Munzner Department of Computer Science University of British Columbia D3 Unconference Keynote November 21 2015, San Francisco CA

Upload: others

Post on 24-Mar-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

tamaramunznerhttpwwwcsubcca~tmmtalkshtmlvad15d3

Visualization Analysis amp Design

Tamara MunznerDepartment of Computer ScienceUniversity of British Columbia

D3 Unconference KeynoteNovember 21 2015 San Francisco CA

Hauser
Typewritten Text
(minor edits Helwig Hauser)
Hauser
Typewritten Text

Defining visualization (vis)

2

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Why have a human in the loop

bull donrsquot need vis when fully automatic solution exists and is trusted

bull many analysis problems ill-specifiedndash donrsquot know exactly what questions to ask in advance

bull possibilitiesndash long-term use for end users (eg exploratory analysis of scientific data)ndash presentation of known results ndash stepping stone to better understanding of requirements before developing modelsndash help developers of automatic solution refinedebug determine parametersndash help end users of automatic solutions verify build trust 3

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Visualization is suitable when there is a need to augment human capabilities rather than replace people with computational decision-making methods

Why use an external representation

bull external representation replace cognition with perception

4

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE TVCG (Proc InfoVis) 14(6)1253-1260 2008]

Why represent all the data

bull summaries lose information details matter ndash confirm expected and find unexpected patternsndash assess validity of statistical model

5

Identical statisticsIdentical statisticsx mean 9x variance 10y mean 8y variance 4xy correlation 1

Anscombersquos Quartet

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Why represent all the data

bull summaries lose information details matter ndash confirm expected and find unexpected patternsndash assess validity of statistical model

5

Identical statisticsIdentical statisticsx mean 9x variance 10y mean 8y variance 4xy correlation 1

Anscombersquos Quartet

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Analysis framework Four levels three questions

bull domain situationndash who are the target users

bull abstractionndash translate from specifics of domain to vocabulary of visbull what is shown data abstraction

bull often donrsquot just draw what yoursquore given transform to new formbull why is the user looking at it task abstraction

bull idiombull how is it shown

bull visual encoding idiom how to draw

bull interaction idiom how to manipulate

bull algorithmndash efficient computation

6

algorithmidiom

abstraction

domain

[A Nested Model of Visualization Design and Validation

Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

algorithm

idiom

abstraction

domain

[A Multi-Level Typology of Abstract Visualization Tasks

Brehmer and Munzner IEEE TVCG 19(12)2376-2385 2013 (Proc InfoVis 2013) ]

Why is validation difficult

bull different ways to get it wrong at each level

7

Domain situationYou misunderstood their needs

Yoursquore showing them the wrong thing

Visual encodinginteraction idiomThe way you show it doesnrsquot work

AlgorithmYour code is too slow

Datatask abstraction

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

problem-driven work

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 2: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Defining visualization (vis)

2

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Why have a human in the loop

bull donrsquot need vis when fully automatic solution exists and is trusted

bull many analysis problems ill-specifiedndash donrsquot know exactly what questions to ask in advance

bull possibilitiesndash long-term use for end users (eg exploratory analysis of scientific data)ndash presentation of known results ndash stepping stone to better understanding of requirements before developing modelsndash help developers of automatic solution refinedebug determine parametersndash help end users of automatic solutions verify build trust 3

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Visualization is suitable when there is a need to augment human capabilities rather than replace people with computational decision-making methods

Why use an external representation

bull external representation replace cognition with perception

4

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE TVCG (Proc InfoVis) 14(6)1253-1260 2008]

Why represent all the data

bull summaries lose information details matter ndash confirm expected and find unexpected patternsndash assess validity of statistical model

5

Identical statisticsIdentical statisticsx mean 9x variance 10y mean 8y variance 4xy correlation 1

Anscombersquos Quartet

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Why represent all the data

bull summaries lose information details matter ndash confirm expected and find unexpected patternsndash assess validity of statistical model

5

Identical statisticsIdentical statisticsx mean 9x variance 10y mean 8y variance 4xy correlation 1

Anscombersquos Quartet

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Analysis framework Four levels three questions

bull domain situationndash who are the target users

bull abstractionndash translate from specifics of domain to vocabulary of visbull what is shown data abstraction

bull often donrsquot just draw what yoursquore given transform to new formbull why is the user looking at it task abstraction

bull idiombull how is it shown

bull visual encoding idiom how to draw

bull interaction idiom how to manipulate

bull algorithmndash efficient computation

6

algorithmidiom

abstraction

domain

[A Nested Model of Visualization Design and Validation

Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

algorithm

idiom

abstraction

domain

[A Multi-Level Typology of Abstract Visualization Tasks

Brehmer and Munzner IEEE TVCG 19(12)2376-2385 2013 (Proc InfoVis 2013) ]

Why is validation difficult

bull different ways to get it wrong at each level

7

Domain situationYou misunderstood their needs

Yoursquore showing them the wrong thing

Visual encodinginteraction idiomThe way you show it doesnrsquot work

AlgorithmYour code is too slow

Datatask abstraction

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

problem-driven work

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 3: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Why have a human in the loop

bull donrsquot need vis when fully automatic solution exists and is trusted

bull many analysis problems ill-specifiedndash donrsquot know exactly what questions to ask in advance

bull possibilitiesndash long-term use for end users (eg exploratory analysis of scientific data)ndash presentation of known results ndash stepping stone to better understanding of requirements before developing modelsndash help developers of automatic solution refinedebug determine parametersndash help end users of automatic solutions verify build trust 3

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Visualization is suitable when there is a need to augment human capabilities rather than replace people with computational decision-making methods

Why use an external representation

bull external representation replace cognition with perception

4

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE TVCG (Proc InfoVis) 14(6)1253-1260 2008]

Why represent all the data

bull summaries lose information details matter ndash confirm expected and find unexpected patternsndash assess validity of statistical model

5

Identical statisticsIdentical statisticsx mean 9x variance 10y mean 8y variance 4xy correlation 1

Anscombersquos Quartet

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Why represent all the data

bull summaries lose information details matter ndash confirm expected and find unexpected patternsndash assess validity of statistical model

5

Identical statisticsIdentical statisticsx mean 9x variance 10y mean 8y variance 4xy correlation 1

Anscombersquos Quartet

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Analysis framework Four levels three questions

bull domain situationndash who are the target users

bull abstractionndash translate from specifics of domain to vocabulary of visbull what is shown data abstraction

bull often donrsquot just draw what yoursquore given transform to new formbull why is the user looking at it task abstraction

bull idiombull how is it shown

bull visual encoding idiom how to draw

bull interaction idiom how to manipulate

bull algorithmndash efficient computation

6

algorithmidiom

abstraction

domain

[A Nested Model of Visualization Design and Validation

Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

algorithm

idiom

abstraction

domain

[A Multi-Level Typology of Abstract Visualization Tasks

Brehmer and Munzner IEEE TVCG 19(12)2376-2385 2013 (Proc InfoVis 2013) ]

Why is validation difficult

bull different ways to get it wrong at each level

7

Domain situationYou misunderstood their needs

Yoursquore showing them the wrong thing

Visual encodinginteraction idiomThe way you show it doesnrsquot work

AlgorithmYour code is too slow

Datatask abstraction

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

problem-driven work

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 4: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Why use an external representation

bull external representation replace cognition with perception

4

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE TVCG (Proc InfoVis) 14(6)1253-1260 2008]

Why represent all the data

bull summaries lose information details matter ndash confirm expected and find unexpected patternsndash assess validity of statistical model

5

Identical statisticsIdentical statisticsx mean 9x variance 10y mean 8y variance 4xy correlation 1

Anscombersquos Quartet

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Why represent all the data

bull summaries lose information details matter ndash confirm expected and find unexpected patternsndash assess validity of statistical model

5

Identical statisticsIdentical statisticsx mean 9x variance 10y mean 8y variance 4xy correlation 1

Anscombersquos Quartet

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Analysis framework Four levels three questions

bull domain situationndash who are the target users

bull abstractionndash translate from specifics of domain to vocabulary of visbull what is shown data abstraction

bull often donrsquot just draw what yoursquore given transform to new formbull why is the user looking at it task abstraction

bull idiombull how is it shown

bull visual encoding idiom how to draw

bull interaction idiom how to manipulate

bull algorithmndash efficient computation

6

algorithmidiom

abstraction

domain

[A Nested Model of Visualization Design and Validation

Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

algorithm

idiom

abstraction

domain

[A Multi-Level Typology of Abstract Visualization Tasks

Brehmer and Munzner IEEE TVCG 19(12)2376-2385 2013 (Proc InfoVis 2013) ]

Why is validation difficult

bull different ways to get it wrong at each level

7

Domain situationYou misunderstood their needs

Yoursquore showing them the wrong thing

Visual encodinginteraction idiomThe way you show it doesnrsquot work

AlgorithmYour code is too slow

Datatask abstraction

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

problem-driven work

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 5: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Why represent all the data

bull summaries lose information details matter ndash confirm expected and find unexpected patternsndash assess validity of statistical model

5

Identical statisticsIdentical statisticsx mean 9x variance 10y mean 8y variance 4xy correlation 1

Anscombersquos Quartet

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Why represent all the data

bull summaries lose information details matter ndash confirm expected and find unexpected patternsndash assess validity of statistical model

5

Identical statisticsIdentical statisticsx mean 9x variance 10y mean 8y variance 4xy correlation 1

Anscombersquos Quartet

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Analysis framework Four levels three questions

bull domain situationndash who are the target users

bull abstractionndash translate from specifics of domain to vocabulary of visbull what is shown data abstraction

bull often donrsquot just draw what yoursquore given transform to new formbull why is the user looking at it task abstraction

bull idiombull how is it shown

bull visual encoding idiom how to draw

bull interaction idiom how to manipulate

bull algorithmndash efficient computation

6

algorithmidiom

abstraction

domain

[A Nested Model of Visualization Design and Validation

Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

algorithm

idiom

abstraction

domain

[A Multi-Level Typology of Abstract Visualization Tasks

Brehmer and Munzner IEEE TVCG 19(12)2376-2385 2013 (Proc InfoVis 2013) ]

Why is validation difficult

bull different ways to get it wrong at each level

7

Domain situationYou misunderstood their needs

Yoursquore showing them the wrong thing

Visual encodinginteraction idiomThe way you show it doesnrsquot work

AlgorithmYour code is too slow

Datatask abstraction

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

problem-driven work

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 6: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Why represent all the data

bull summaries lose information details matter ndash confirm expected and find unexpected patternsndash assess validity of statistical model

5

Identical statisticsIdentical statisticsx mean 9x variance 10y mean 8y variance 4xy correlation 1

Anscombersquos Quartet

Computer-based visualization systems provide visual representations of datasets designed to help people carry out tasks more effectively

Analysis framework Four levels three questions

bull domain situationndash who are the target users

bull abstractionndash translate from specifics of domain to vocabulary of visbull what is shown data abstraction

bull often donrsquot just draw what yoursquore given transform to new formbull why is the user looking at it task abstraction

bull idiombull how is it shown

bull visual encoding idiom how to draw

bull interaction idiom how to manipulate

bull algorithmndash efficient computation

6

algorithmidiom

abstraction

domain

[A Nested Model of Visualization Design and Validation

Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

algorithm

idiom

abstraction

domain

[A Multi-Level Typology of Abstract Visualization Tasks

Brehmer and Munzner IEEE TVCG 19(12)2376-2385 2013 (Proc InfoVis 2013) ]

Why is validation difficult

bull different ways to get it wrong at each level

7

Domain situationYou misunderstood their needs

Yoursquore showing them the wrong thing

Visual encodinginteraction idiomThe way you show it doesnrsquot work

AlgorithmYour code is too slow

Datatask abstraction

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

problem-driven work

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 7: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Analysis framework Four levels three questions

bull domain situationndash who are the target users

bull abstractionndash translate from specifics of domain to vocabulary of visbull what is shown data abstraction

bull often donrsquot just draw what yoursquore given transform to new formbull why is the user looking at it task abstraction

bull idiombull how is it shown

bull visual encoding idiom how to draw

bull interaction idiom how to manipulate

bull algorithmndash efficient computation

6

algorithmidiom

abstraction

domain

[A Nested Model of Visualization Design and Validation

Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

algorithm

idiom

abstraction

domain

[A Multi-Level Typology of Abstract Visualization Tasks

Brehmer and Munzner IEEE TVCG 19(12)2376-2385 2013 (Proc InfoVis 2013) ]

Why is validation difficult

bull different ways to get it wrong at each level

7

Domain situationYou misunderstood their needs

Yoursquore showing them the wrong thing

Visual encodinginteraction idiomThe way you show it doesnrsquot work

AlgorithmYour code is too slow

Datatask abstraction

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

problem-driven work

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 8: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Why is validation difficult

bull different ways to get it wrong at each level

7

Domain situationYou misunderstood their needs

Yoursquore showing them the wrong thing

Visual encodinginteraction idiomThe way you show it doesnrsquot work

AlgorithmYour code is too slow

Datatask abstraction

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

problem-driven work

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 9: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

problem-driven work

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 10: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Hauser
Typewritten Text

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

problem-driven work

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 11: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

8

Why is validation difficult

Domain situationObserve target users using existing tools

Visual encodinginteraction idiomJustify design with respect to alternatives

AlgorithmMeasure system timememoryAnalyze computational complexity

Observe target users after deployment ( )

Measure adoption

Analyze results qualitativelyMeasure human time with lab experiment (lab study)

Datatask abstraction

computer science

design

cognitive psychology

anthropologyethnography

anthropologyethnography

problem-driven work

technique-driven work

[A Nested Model of Visualization Design and Validation Munzner IEEE TVCG 15(6)921-928 2009 (Proc InfoVis 2009) ]

bull solution use methods from different fields at each level

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 12: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 13: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Why analyze

bull imposes a structure on huge design spacendash scaffold to help you think

systematically about choicesndash analyzing existing as stepping stone

to designing new

9

[SpaceTree Supporting Exploration in Large Node Link Tree Design Evolution and Empirical Evaluation Grosjean Plaisant and Bederson Proc InfoVis 2002 p 57ndash64]

SpaceTree

[TreeJuxtaposer Scalable Tree Comparison Using Focus+Context With Guaranteed Visibility ACM Trans on Graphics (Proc SIGGRAPH) 22453ndash 462 2003]

TreeJuxtaposer

Present Locate Identify

Path between two nodes

Actions

Targets

SpaceTree

TreeJuxtaposer

Encode Navigate Select Filter AggregateTree

Arrange

Why What How

Encode Navigate Select

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 14: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position10

Why

How

What

Dataset Availability

Static Dynamic

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 15: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Types Datasets and data

11

Tables

Attributes (columns)

Items (rows)

Cell containing value

Dataset Types

Attribute TypesCategorical Ordered

Ordinal Quantitative

Networks

Link

Node (item)

Node (item)

Fields (Continuous)

Attributes (columns)

Value in cell

Cell

Grid of positions

Geometry (Spatial)

Position

SpatialNetworks

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 16: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

12

bull action target pairsndash discover distribution

ndash compare trends

ndash locate outliers

ndash browse topology

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial DataShape

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

How

What

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 17: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

13

Actions 1 Analyzebull consume

ndashdiscover vs presentbull classic splitbull aka explore vs explain

ndashenjoybull newcomerbull aka casual social

bull producendashannotate recordndashderive

bull crucial design choice

Analyze

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

tag

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 18: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Derive

bull donrsquot just draw what yoursquore givenndash decide what the right thing to show isndash create it with a series of transformations from the original datasetndash draw that

bull one of the four major strategies for handling complexity

14Original Data

exports

imports

Derived Data

trade balance = exports imports

trade balance

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 19: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Analysis example Derive one attribute

15

[Using Strahler numbers for real time visual exploration of huge graphs Auber Proc Intl Conf Computer Vision and Graphics pp 56ndash69 2002]

bull Strahler numberndash centrality metric for treesnetworksndash derived quantitative attribute

ndash draw top 5K of 500K for good skeleton

Task 1

58

54

64

84

24

74

6484

84

94

74

OutQuantitative attribute on nodes

58

54

64

84

24

74

6484

84

94

74

InQuantitative attribute on nodes

Task 2

Derive

WhyWhat

In Tree ReduceSummarize

HowWhyWhat

In Quantitative attribute on nodes TopologyIn Tree

Filter

InTree

OutFiltered TreeRemoved unimportant parts

InTree +

Out Quantitative attribute on nodes Out Filtered Tree

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 20: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

16

Actions II Search

bull what does user knowndash target location

Search

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 21: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

17

Actions III Query

bull what does user knowndash target location

bull how much of the data mattersndash one some all

bull analyze search queryndash independent choices for each

Search

Query

Identify Compare Summarize

Target known Target unknown

Location known

Location unknown

Lookup

Locate

Browse

Explore

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 22: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Targets

18

Trends

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Extremes

Network Data

Spatial DataShape

Topology

Paths

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 23: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

19

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 24: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

How to encode Arrange space map channels

20

Encode

ArrangeExpress Separate

Order Align

Use

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 25: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Encoding visually

bull analyze idiom structure

21

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 26: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

22

Definitions Marks and channelsbull marks

ndash geometric primitives

bull channelsndash control appearance of marks

Horizontal

Position

Vertical Both

Color

Shape Tilt

Size

Length Area Volume

Points Lines Areas

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 27: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Encoding visually with marks and channels

bull analyze idiom structurendash as combination of marks and channels

23

1 vertical position

mark line

2 vertical positionhorizontal position

mark point

3 vertical positionhorizontal positioncolor hue

mark point

4 vertical positionhorizontal positioncolor huesize (area)

mark point

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 28: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

24

Channels Expressiveness types and effectiveness rankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 29: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

25

Channels Matching TypesMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 30: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

26

Channels RankingsMagnitude Channels Ordered Attributes Identity Channels Categorical Attributes

Spatial region

Color hue

Motion

Shape

Position on common scale

Position on unaligned scale

Length (1D size)

Tiltangle

Area (2D size)

Depth (3D position)

Color luminance

Color saturation

Curvature

Volume (3D size)

bull expressiveness principlendash match channel and data characteristics

bull effectiveness principlendash encode most important attributes with

highest ranked channels

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 31: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

27

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 32: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

How to handle complexity 3 more strategies

28

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change view over timebull facet across multiple

viewsbull reduce itemsattributes

within single viewbull derive new data to

show within view

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 33: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

How to handle complexity 3 more strategies

29

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull change over time- most obvious amp flexible

of the 4 strategies

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 34: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Idiom Animated transitionsbull smooth transition from one state to another

ndash alternative to jump cutsndash support for item tracking when amount of change is limited

bull example multilevel matrix viewsndash scope of what is shown narrows down

bull middle block stretches to fill space additional structure appears withinbull other blocks squish down to increasingly aggregated representations

30[Using Multilevel Call Matrices in Large Software Projects van Ham Proc IEEE Symp Information Visualization (InfoVis) pp 227ndash232 2003]

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 35: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

How to handle complexity 3 more strategies

31

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull facet data across multiple views

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 36: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Facet

32

Juxtapose

Partition

Superimpose

Coordinate Multiple Side By Side Views

Share Encoding SameDierent

Share Data AllSubsetNone

Share Navigation

Linked Highlighting

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 37: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Idiom Linked highlighting

33

System EDVbull see how regions

contiguous in one view are distributed within anotherndash powerful and pervasive

interaction idiom

bull encoding differentndashmultiform

bull data all shared

[Visual Exploration of Large Structured Datasets Wills Proc New Techniques and Trends in Statistics (NTTS) pp 237ndash246 IOS Press 1995]

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 38: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Idiom birdrsquos-eye maps

34

bull encoding samebull data subset sharedbull navigation shared

ndash bidirectional linking

bull differencesndash viewpointndash (size)

bull overview-detail

System Google Maps

[A Review of Overview+Detail Zooming and Focus+Context Interfaces Cockburn Karlson and Bederson ACM Computing Surveys 411 (2008) 1ndash31]

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 39: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Idiom Small multiplesbull encoding samebull data none shared

ndash different attributes for node colors

ndash (same network layout)

bull navigation shared

35

System Cerebral

[Cerebral Visualizing Multiple Experimental Conditions on a Graph with Biological Context Barsky Munzner Gardy and Kincaid IEEE Trans Visualization and Computer Graphics (Proc InfoVis 2008) 146 (2008) 1253ndash1260]

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 40: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Coordinate views Design choice interaction

36

All Subset

Same

Multiform

Multiform Overview

Detail

None

Redundant

No Linkage

Small Multiples

OverviewDetail

bull why juxtapose viewsndash benefits eyes vs memory

bull lower cognitive load to move eyes between 2 views than remembering previous state with single changing view

ndash costs display area 2 views side by side each have only half the area of one view

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 41: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Partition into views

37

bull how to divide data between viewsndash encodes association between items

using spatial proximity ndash major implications for what patterns

are visiblendash split according to attributes

bull design choicesndash how many splits

bull all the way down one mark per regionbull stop earlier for more complex structure

within region

ndash order in which attribs used to splitndash how many views

Partition into Side-by-Side Views

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 42: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Partitioning List alignmentbull single bar chart with grouped bars

ndash split by state into regionsbull complex glyph within each region showing all ages

ndash compare easy within state hard across ages

bull small-multiple bar chartsndash split by age into regions

bull one chart per region

ndash compare easy within age harder across states

38

110

100

90

80

70

60

50

40

30

20

10

00 CA TK NY FL IL PA

65 Years and Over45 to 64 Years25 to 44 Years18 to 24 Years14 to 17 Years5 to 13 YearsUnder 5 Years

CA TK NY FL IL PA

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

0

5

11

httpblocksorgmbostock3887051 httpblocksorgmbostock4679202

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 43: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Partitioning Recursive subdivision

bull split by neighborhoodbull then by type bull then time

ndash years as rowsndash months as columns

bull color by price

bull neighborhood patternsndash where itrsquos expensivendash where you pay much more

for detached type

39[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 44: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Partitioning Recursive subdivision

bull switch order of splitsndash type then neighborhood

bull switch colorndash by price variation

bull type patternsndash within specific type which

neighborhoods inconsistent

40[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 45: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Partitioning Recursive subdivision

bull different encoding for second-level regionsndash choropleth maps

41[Configuring Hierarchical Layouts to Address Research Questions Slingsby Dykes and Wood IEEE Transactions on Visualization and Computer Graphics (Proc InfoVis 2009) 156 (2009) 977ndash984]

System HIVE

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 46: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

How to handle complexity 3 more strategies

42

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

Derive

+ 1 previous

bull reduce what is shown within single view

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 47: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Reduce items and attributes

43

bull reduceincrease inversesbull filter

ndash pro straightforward and intuitivebull to understand and compute

ndash con out of sight out of mind

bull aggregationndash pro inform about whole setndash con difficult to avoid losing signal

bull not mutually exclusivendash combine filter aggregatendash combine reduce facet change derive

Reduce

Filter

Aggregate

Embed

Reducing Items and Attributes

FilterItems

Attributes

Aggregate

Items

Attributes

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 48: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Idiom boxplotbull static item aggregationbull task find distributionbull data tablebull derived data

ndash 5 quant attribsbull median central linebull lower and upper quartile boxesbull lower upper fences whiskers

ndash values beyond which items are outliers

ndash outliers beyond fence cutoffs explicitly shown

44

pod and the rug plot looks like the seeds within Kampstra (2008) also suggests a way of comparing two

groups more easily use the left and right sides of the bean to display different distributions A related idea

is the raindrop plot (Barrowman and Myers 2003) but its focus is on the display of error distributions from

complex models

Figure 4 demonstrates these density boxplots applied to 100 numbers drawn from each of four distribu-

tions with mean 0 and standard deviation 1 a standard normal a skew-right distribution (Johnson distri-

bution with skewness 22 and kurtosis 13) a leptikurtic distribution (Johnson distribution with skewness 0

and kurtosis 20) and a bimodal distribution (two normals with mean -095 and 095 and standard devia-

tion 031) Richer displays of density make it much easier to see important variations in the distribution

multi-modality is particularly important and yet completely invisible with the boxplot

n s k mm

2

02

4

n s k mm

2

02

4

n s k mm

4

2

02

4

4

2

02

4

n s k mm

Figure 4 From left to right box plot vase plot violin plot and bean plot Within each plot the distributions from left to

right are standard normal (n) right-skewed (s) leptikurtic (k) and bimodal (mm) A normal kernel and bandwidth of

02 are used in all plots for all groups

A more sophisticated display is the sectioned density plot (Cohen and Cohen 2006) which uses both

colour and space to stack a density estimate into a smaller area hopefully without losing any information

(not formally verified with a perceptual study) The sectioned density plot is similar in spirit to horizon

graphs for time series (Reijner 2008) which have been found to be just as readable as regular line graphs

despite taking up much less space (Heer et al 2009) The density strips of Jackson (2008) provide a similar

compact display that uses colour instead of width to display density These methods are shown in Figure 5

6

[40 years of boxplots Wickham and Stryjewski 2012 hadconz]

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 49: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

Idiom Dimensionality reduction for documents

45

Task 1

InHD data

Out2D data

ProduceIn High- dimensional data

WhyWhat

Derive

In2D data

Task 2

Out 2D data

HowWhyWhat

EncodeNavigateSelect

DiscoverExploreIdentify

In 2D dataOut ScatterplotOut Clusters amp points

OutScatterplotClusters amp points

Task 3

InScatterplotClusters amp points

OutLabels for clusters

WhyWhat

ProduceAnnotate

In ScatterplotIn Clusters amp pointsOut Labels for clusters

wombat

bull attribute aggregationndash derive low-dimensional target space from high-dimensional measured space

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 50: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

46

Datasets

WhatAttributes

Dataset Types

Data Types

Data and Dataset Types

Tables

Attributes (columns)

Items (rows)

Cell containing value

Networks

Link

Node (item)

Trees

Fields (Continuous)

Geometry (Spatial)

Attributes (columns)

Value in cell

Cell

Multidimensional Table

Value in cell

Items Attributes Links Positions Grids

Attribute Types

Ordering Direction

Categorical

OrderedOrdinal

Quantitative

Sequential

Diverging

Cyclic

Tables Networks amp Trees

Fields Geometry Clusters Sets Lists

Items

Attributes

Items (nodes)

Links

Attributes

Grids

Positions

Attributes

Items

Positions

Items

Grid of positions

Position

Trends

Actions

Analyze

Search

Query

Why

All Data

Outliers Features

Attributes

One ManyDistribution Dependency Correlation Similarity

Network Data

Spatial Data

Topology

Paths

Extremes

ConsumePresent EnjoyDiscover

ProduceAnnotate Record Derive

Identify Compare Summarize

tag

Target known Target unknown

Location knownLocation unknown

Lookup

Locate

Browse

Explore

Targets

Why

What

Encode

ArrangeExpress Separate

Order Align

Use

Manipulate Facet Reduce

Change

Select

Navigate

Juxtapose

Partition

Superimpose

Filter

Aggregate

Embed

How

Encode Manipulate Facet Reduce

Map

Color

Motion

Size Angle Curvature

Hue Saturation Luminance

Shape

Direction Rate Frequency

from categorical and ordered attributes

algorithm

idiom

abstraction

domain

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner

Page 51: Visualization Analysis & Designii.uib.no/vis_old/publications/pdfs/2016-01-17--vad15d3unconf--edited-by-HH.pdf · Visualization is suitable when there is a need to augment human capabilities

More Informationbull this talk

httpwwwcsubcca~tmmtalkshtmlvad15d3

bull book page (including tutorial lecture slides)httpwwwcsubcca~tmmvadbook

ndash 20 promo code for book+ebook combo HVN17

ndash httpwwwcrcpresscomproductisbn9781466508910

ndash illustrations Eamonn Maguire

bull papers videos software talks full courses httpwwwcsubccagroupinfovis httpwwwcsubcca~tmm

47Munzner A K Peters Visualization Series CRC Press Visualization Series 2014

Visualization Analysis and Design

tamaramunzner