information visualisation - lecture 2

59
@wassx #ILV Informationsvisualisierungen Information Visualisation Information Visualisation Lecture 2 - Data

Upload: stefan-wasserbauer

Post on 15-Apr-2017

265 views

Category:

Data & Analytics


3 download

TRANSCRIPT

@wassx#ILV Informationsvisualisierungen

Information Visualisation

Information Visualisation

Lecture 2 - Data

#ILV Informationsvisualisierungen 2

Types of Data

#ILV Informationsvisualisierungen 3

Types of Data

Our goal of visualisation research is to transform data into a perceptually efficient visual format.

Therefore we must be able to say something about types of data to visualise.

#ILV Informationsvisualisierungen 4

Types of Data

For example:

„Color coding is good for stock-market symbols, but texture coding is good for geological maps.“

#ILV Informationsvisualisierungen 5

Types of Data

Better?

„Color coding is good for category information.“

or

„Motion coding is good for highlighting selected data.“

#ILV Informationsvisualisierungen 6

Types of Data

https://en.wikipedia.org/wiki/Jacques_Bertin

Jacques Bertin„..was a French cartographer and theorist, known from his book Semiologie Graphique (Semiology of Graphics), published in 1967. This monumental work, … represents the first and widest intent to provide a theoretical foundation to Information Visualization.“

#ILV Informationsvisualisierungen 7

Types of Data

Jacques Bertin

… suggested that there are two fundamental forms of data:

1. Data values (Entities) 2. Data structures (Relationships)

#ILV Informationsvisualisierungen 8

Types of Data

Entities are the objects we wish to visualise, relations define structures and patterns that relate entities. Sometimes relations are provided explicitly, sometimes the discovery of relations is the main purpose of a visualisation.

Entity / Relation

#ILV Informationsvisualisierungen 9

Types of Data

Entities

... are generally objects of interest.

e.g. people, cars,... but groups too: traffic jams

http://www.shutterstock.com/video/clip-476470-stock-footage-stand-and-wait-people-silhouette.html http://www.iconsfind.com/20140406/transport-traffic-jam-icons/

#ILV Informationsvisualisierungen 10

Types of Data

Entities

#ILV Informationsvisualisierungen 11

Types of Data

Relationships

... form the structures that relate entities.

e.g. "Part-of" relationship, structural, physical, causal, temporal

#ILV Informationsvisualisierungen 12

Types of Data

Relationships

#ILV Informationsvisualisierungen 13

Types of Data

Part-of

#ILV Informationsvisualisierungen 14

Types of Data

Hierarchical

#ILV Informationsvisualisierungen 15

Types of Data

http://www.nytimes.com/interactive/2013/02/20/movies/among-the-oscar-contenders-a-host-of-connections.html?_r=0

#ILV Informationsvisualisierungen 16

Types of Data

Attributes of Entities or Relationships

... property of an entity and cannot be thought of independently.

e.g. color of apple, duration of journey

#ILV Informationsvisualisierungen 17

Types of Data

Attributes of Entities or Relationships

... property of an entity and cannot be thought of independently.

e.g. color of apple, duration of journey

How about the salary of an employee?

#ILV Informationsvisualisierungen 18

Types of Data

Data Dimensions: 1D, 2D, 3D,..

Attribute of an entity can have multiple dimensions.

Single scalar Weight of a person

#ILV Informationsvisualisierungen 19

Types of Data

Data Dimensions: 1D, 2D, 3D,..

Attribute of an entity can have multiple dimensions.

Single scalar Weight of a personVector quantity Direction of person walking

#ILV Informationsvisualisierungen 20

Types of Data

Data Dimensions: 1D, 2D, 3D,..

Attribute of an entity can have multiple dimensions.

Single scalar Weight of a personVector quantity Direction of person walkingTensors Direction and shear forces

#ILV Informationsvisualisierungen 21

https://www.windyty.com/?48.137,13.975,4

#ILV Informationsvisualisierungen 22

Types of Numbers

#ILV Informationsvisualisierungen 23

Types of Numbers

https://en.wikipedia.org/wiki/Stanley_Smith_Stevens

Stanley Smith StevensAmerican psychologist

„In 1946 he introduced a theory of levels of measurement widely used by scientists but criticized by statisticians.“

#ILV Informationsvisualisierungen 24

Types of Numbers

Taxonomy of number scales by statistician Stevens (1946)

• Nominal• Ordinal• Interval• Ratio

Stevens, S. S. (1946). On the theory of scales of measurement. Science, 103, 677–680.

#ILV Informationsvisualisierungen 25

Types of Numbers

Nominal

Labeling function

Fruit can be classified into apples, bananas, oranges,…

No sense in which fruit can be ordered in a sequence.

Sometimes numbers are used this way (bus line)

„Rejected“, Don Hertzfeld, 2000

#ILV Informationsvisualisierungen 26

Types of Numbers

Ordinal

Numbers used to order things in a sequence.

The position of an item in a list is an ordinal quality.

Ranking items (e.g. itunes) in order of preference

#ILV Informationsvisualisierungen 27

Types of Numbers

IntervalGap between data valuesTime of departure and time of arrival of e.g. a trainHas no meaningful (absence) zero point (11:13 - 15:26)

#ILV Informationsvisualisierungen 28

Types of Numbers

Ratio

Full expressive power of a real number.

Statements: „Object A is twice as large as object B“

E.g. mass of an object, money,…

Use of ratio scale implies a zero value used as reference

#ILV Informationsvisualisierungen 29

Data „Add-ons“

#ILV Informationsvisualisierungen 30

Data „Add-ons“

Uncertainty

Common for science and engineering to attach uncertainty attribute.

Estimating uncertainty is a major part of engineering practice.

Important to show uncertainty in a visualisation: Visual object suggests literal concrete quality, which makes the viewer think it is accurate.

#ILV Informationsvisualisierungen 31

Data „Add-ons“

Metadata

… is data about data.

E.g. who collected it, which transformations used, uncertainty,..

Visualisation is challenging due to additional complexity.

image resource: http://house-co.com/blog/why-metadata-should-be-the-love-of-your-life/

#ILV Informationsvisualisierungen 32

Data „Add-ons“

Operations Considered as Data• Mathematical operations on numbers

#ILV Informationsvisualisierungen 33

Data „Add-ons“

Operations Considered as Data• Mathematical operations on numbers • Merging two lists

#ILV Informationsvisualisierungen 34

Data „Add-ons“

Operations Considered as Data• Mathematical operations on numbers • Merging two lists • Inverting a value to create opposite

#ILV Informationsvisualisierungen 35

Data „Add-ons“

Operations Considered as Data• Mathematical operations on numbers • Merging two lists • Inverting a value to create opposite • Bringing an entity or relationship to existence

#ILV Informationsvisualisierungen 36

Data „Add-ons“

Operations Considered as Data• Mathematical operations on numbers • Merging two lists • Inverting a value to create opposite • Bringing an entity or relationship to existence • Deleting an entity or relationship

#ILV Informationsvisualisierungen 37

Data „Add-ons“

Operations Considered as Data• Mathematical operations on numbers • Merging two lists • Inverting a value to create opposite • Bringing an entity or relationship to existence • Deleting an entity or relationship • Transforming an entity in some way (caterpillar turns into a butterfly)

#ILV Informationsvisualisierungen 38

Data „Add-ons“

Operations Considered as Data• Mathematical operations on numbers • Merging two lists • Inverting a value to create opposite • Bringing an entity or relationship to existence • Deleting an entity or relationship • Transforming an entity in some way (caterpillar turns into a butterfly) • Forming a new object out of other object (a pie is baked from apples

and pastry)

#ILV Informationsvisualisierungen 39

Data „Add-ons“

Operations Considered as Data• Mathematical operations on numbers • Merging two lists • Inverting a value to create opposite • Bringing an entity or relationship to existence • Deleting an entity or relationship • Transforming an entity in some way (caterpillar turns into a butterfly) • Forming a new object out of other object (a pie is baked from apples

and pastry) • Splitting a single entity into its component parts (disassemble machine)

#ILV Informationsvisualisierungen 40

Hands-on #2a

#ILV Informationsvisualisierungen 41

Hands-on #2a - Pen & Paper

Short exercise ~15min

Take 3 operations of the list and try to sketch a visual (iconic) representation of it.

http://cs-shop.de/explosionszeichnungen/C10127.htm

#ILV Informationsvisualisierungen 42

Data Aggregations

#ILV Informationsvisualisierungen 43

Data Aggregations

Factoid Series Multiseries SummableMultiseries

SummaryRecords

IndividualTransaction

#ILV Informationsvisualisierungen 44

Data Aggregations

Factoid Series Multiseries SummableMultiseries

SummaryRecords

IndividualTransaction

Limited ability to explore and pivot More options to explore and pivot

#ILV Informationsvisualisierungen 45

Data Aggregations

Level of Aggregation Number of metrics Description

Factoid Maximum context Single data point; No drill-down

Series One metric across an axis Can compare rate of change

Multiseries Several metrics, common axis Can compare rate of change, correlation between metrics

Summable mutliseries Several metrics, common axisCan compare rate of chagne,

correlation between metrics; Can compare percentages to whole

Summary recordsOne record for each item in a

series; Metrics in other series have been aggregated somehow

Items can be compared

Individual transactions One record per instance No aggregation or combination; Maximum drill-down

#ILV Informationsvisualisierungen 46

Data Aggregations

Level of Aggregation Number of metrics Description

Factoid Maximum context Single data point; No drill-down

Series One metric across an axis Can compare rate of change

Multiseries Several metrics, common axis Can compare rate of change, correlation between metrics

Summable mutliseries Several metrics, common axisCan compare rate of chagne,

correlation between metrics; Can compare percentages to whole

Summary recordsOne record for each item in a

series; Metrics in other series have been aggregated somehow

Items can be compared

Individual transactions One record per instance No aggregation or combination; Maximum drill-down

#ILV Informationsvisualisierungen 47

Data Aggregations

Level of Aggregation Number of metrics Description

Factoid Maximum context Single data point; No drill-down

Series One metric across an axis Can compare rate of change

Multiseries Several metrics, common axis Can compare rate of change, correlation between metrics

Summable mutliseries Several metrics, common axisCan compare rate of chagne,

correlation between metrics; Can compare percentages to whole

Summary recordsOne record for each item in a

series; Metrics in other series have been aggregated somehow

Items can be compared

Individual transactions One record per instance No aggregation or combination; Maximum drill-down

#ILV Informationsvisualisierungen 48

Data Aggregations

Level of Aggregation Number of metrics Description

Factoid Maximum context Single data point; No drill-down

Series One metric across an axis Can compare rate of change

Multiseries Several metrics, common axis Can compare rate of change, correlation between metrics

Summable multiseries Several metrics, common axisCan compare rate of chagne,

correlation between metrics; Can compare percentages to whole

Summary recordsOne record for each item in a

series; Metrics in other series have been aggregated somehow

Items can be compared

Individual transactions One record per instance No aggregation or combination; Maximum drill-down

#ILV Informationsvisualisierungen 49

Data Aggregations

Level of Aggregation Number of metrics Description

Factoid Maximum context Single data point; No drill-down

Series One metric across an axis Can compare rate of change

Multiseries Several metrics, common axis Can compare rate of change, correlation between metrics

Summable multiseries Several metrics, common axisCan compare rate of chagne,

correlation between metrics; Can compare percentages to whole

Summary recordsOne record for each item in a

series; Metrics in other series have been aggregated somehow

Items can be compared

Individual transactions One record per instance No aggregation or combination; Maximum drill-down

#ILV Informationsvisualisierungen 50

Data Aggregations

Level of Aggregation Number of metrics Description

Factoid Maximum context Single data point; No drill-down

Series One metric across an axis Can compare rate of change

Multiseries Several metrics, common axis Can compare rate of change, correlation between metrics

Summable multiseries Several metrics, common axisCan compare rate of chagne,

correlation between metrics; Can compare percentages to whole

Summary recordsOne record for each item in a

series; Metrics in other series have been aggregated somehow

Items can be compared

Individual transactions One record per instance No aggregation or combination; Maximum drill-down

#ILV Informationsvisualisierungen 51

Data Aggregations

Factoid

A factoid is a piece of trivia. It is calculated from source data, but chosen to emphasise a particular point.

„36.7% of coffee in 2000 was consumed by women“

Factoid Series Multiseries SummableMultiseries

SummaryRecords

IndividualTransaction

#ILV Informationsvisualisierungen 52

Data Aggregations

Series

This is one type of information (the dependent variable) compared to another (the independent variable).Often the independent variable is time.

0

17,5

35

52,5

70

April Mai Juni Juli0

1,25

2,5

3,75

5

Peter Mary Charles Marty

Factoid Series Multiseries SummableMultiseries

SummaryRecords

IndividualTransaction

#ILV Informationsvisualisierungen 53

Data Aggregations

Multiseries

A multiseries dataset has several dependent variables and one independent.

0

22,5

45

67,5

90

April Mai Juni Juli

male female

Factoid Series Multiseries SummableMultiseries

SummaryRecords

IndividualTransaction

#ILV Informationsvisualisierungen 54

Data AggregationsSummable Multiseries

Multiseries which are subgroups are stacked to give an impression of the overall sum.

0

37,5

75

112,5

150

April Mai Juni Juli

male female

Factoid Series Multiseries SummableMultiseries

SummaryRecords

IndividualTransaction

#ILV Informationsvisualisierungen 55

Data AggregationsSummary Records

Keeps dataset fairly small, suggests ways how to explore data.

Factoid Series Multiseries SummableMultiseries

SummaryRecords

IndividualTransaction

Name Gender Occurrance A Occurrance B Total

Mary F 5 9 14

Charles M 2 8 10

Marty M 3 2 5

Peter M 2 8 10

Sum 12 27 39

#ILV Informationsvisualisierungen 56

Data AggregationsIndividualTransactions

Transactional records capture things about a specific event. No aggregation of the data.

Factoid Series Multiseries SummableMultiseries

SummaryRecords

IndividualTransaction

#ILV Informationsvisualisierungen 57

Data AggregationsIndividualTransactions

Factoid Series Multiseries SummableMultiseries

SummaryRecords

IndividualTransaction

Timestamp Name Gender Type of Occurrance13:00 Paul M A13:14 Bob M A14:34 Charly M B14:55 Simon M A15:23 Mary F B15:25 Betty F A16:11 Peter M B17:01 Lisa F B18:23 Betty F A20:09 Mary F A

#ILV Informationsvisualisierungen 58

Hands-on #2bVisit following websites for datasets you are interested in:

http://data.un.org

https://www.google.com/trends/

Try to find datasets which you could set in relation to explain a „theory“.

For example: alcohol deaths vs. weather trend

You are allowed to find most ridiculous datasets. The goal is to filter, aggregate and visualize the data to make a statement which you support with the visualization. Make us curious. So data first, attractive visual design is secondary.

Use your available tools (excel, openoffice, google charts,…)

Keep in mind: simple bar charts, scatter plots,… are enough to tell the story. -> Keep it simple and clear.

Upload a zip file, containing datasets and screenshots of charts. Add JS code if used. Don’t forget to document progress.

http://www.targetmap.com/viewer.aspx?reportId=7830

#ILV Informationsvisualisierungen 59

Push conferenceAudree Lapierre@ffunctionhttp://itsmylife.cancer.cahttp://earthinsights.org

http://dataveyes.com/#!/en@dataveyesCaroline Goulard

http://audreelapierre.com/

http://dataveyes.com/#!/en/case-studies/identite-generative