the art of data presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 raj jain common mistakes in...

37
10-1 ©2010 Raj Jain www.rajjain.com The Art of The Art of Data Presentation Data Presentation

Upload: others

Post on 10-Sep-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-1©2010 Raj Jain www.rajjain.com

The Art of The Art of Data PresentationData Presentation

Page 2: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-2©2010 Raj Jain www.rajjain.com

OverviewOverview

Types of VariablesGuidelines for Preparing Good ChartsCommon Mistakes in Preparing ChartsPictorial GamesSpecial Charts for Computer Performance

Gantt ChartsKiviat GraphsSchumacher Charts

Decision Maker’s Games

Page 3: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-3©2010 Raj Jain www.rajjain.com

Types of VariablesTypes of Variables

Type of computer: Super computer, minicomputer, microcomputerType of Workload: Scientific, engineering, educationalNumber of processorsResponse time of system

Page 4: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-4©2010 Raj Jain www.rajjain.com

Guidelines for Preparing Good ChartsGuidelines for Preparing Good ChartsRequire minimum effort from the readerDirect labeling vs. legend box

Maximize Information: Words in place of symbolsCleary label the axes

Page 5: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-5©2010 Raj Jain www.rajjain.com

Guidelines (cont)Guidelines (cont)Minimize Ink: No grid lines, more details

Use Commonly accepted practices: origin at (0,0)Independent variable (cause) along x axis, linear scales, increasing scales, equal divisionsAvoid ambiguity: Show coordinate axes, scale divisions, origin. Identify individual curves and bars.See checklist in Box 10.1

Page 6: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-6©2010 Raj Jain www.rajjain.com

Common Mistakes in Preparing ChartsCommon Mistakes in Preparing ChartsPresenting too many alternatives on a single chartMax 5 to 7 messages => Max 6 curves in a line charts, no more than 10 bars in a bar chart, max 8 components in a pie chartPresenting many y variables on a single chart

Page 7: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-7©2010 Raj Jain www.rajjain.com

Common Mistakes in Charts (Cont)Common Mistakes in Charts (Cont)Using symbols in place of text

Placing extraneous information on the chart: grid lines, granularity of the grid linesSelecting scale ranges improperly: automatic selection by programs may not be appropriate

Page 8: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-8©2010 Raj Jain www.rajjain.com

Common Mistakes in Charts (Cont)Common Mistakes in Charts (Cont)

Using a line chart in place of column chart: line => Continuity

CPU Type8000 8100 83008200

MIPS

Page 9: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-9©2010 Raj Jain www.rajjain.com

Pictorial GamesPictorial GamesUsing non-zero origins to emphasize the differenceThree quarter high-rule => height/width > 3/4

Page 10: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-10©2010 Raj Jain www.rajjain.com

Pictorial Games (Cont)Pictorial Games (Cont)

Using double-whammy graph for dramatizationUsing related metrics

Page 11: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-11©2010 Raj Jain www.rajjain.com

Pictorial Games (Cont)Pictorial Games (Cont)

Plotting random quantities without showing confidence intervals

Page 12: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-12©2010 Raj Jain www.rajjain.com

Pictorial Games (Cont)Pictorial Games (Cont)

Pictograms scaled by height

MinePerformance = 2

YoursPerformance = 1

Page 13: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-13©2010 Raj Jain www.rajjain.com

Pictorial Games (Cont)Pictorial Games (Cont)

Using inappropriate cell size in histograms

[0,2) [2,4) [4,6) [6,8) [8,10) [10,12) [0,6) [6,12)Response Time Response Time

Frequency Frequency

2468

1012

02468

1012

0

Page 14: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-14©2010 Raj Jain www.rajjain.com

Pictorial Games (Cont)Pictorial Games (Cont)

Using broken scales in column charts

ASystem

Resp.Time

2468

1012

0B C D E F A

System

Resp.Time

0B C D E F

9

10

11

12

Page 15: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-15©2010 Raj Jain www.rajjain.com

Special Charts for Computer Performance Special Charts for Computer Performance

Gantt chartsKiviat GraphsSchumacher's charts

Page 16: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-16©2010 Raj Jain www.rajjain.com

Gantt ChartsGantt ChartsShows relative duration of a number of conditions

CPU

IO Channel

Network

20% 40% 60% 80% 100%0%

60

20 20

30 10 5 15

Utilization

Page 17: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-17©2010 Raj Jain www.rajjain.com

Example: Data for Gantt ChartExample: Data for Gantt Chart

Page 18: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-18©2010 Raj Jain www.rajjain.com

Draft of the Gantt ChartDraft of the Gantt Chart

Page 19: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-19©2010 Raj Jain www.rajjain.com

Final Gantt ChartFinal Gantt Chart

Page 20: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-20©2010 Raj Jain www.rajjain.com

KiviatKiviat GraphsGraphsRadial chart with even number of metricsHB and LB metrics alternateIdeal shape: star

CPUBusy CPU in

Supervisor State

CPU inProblem State

CPUWaitAny Channel

Busy

Channel onlyBusy

CPU/ChannelOverlap

CPU OnlyBusy

Page 21: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-21©2010 Raj Jain www.rajjain.com

KiviatKiviat Graph for a Balanced SystemGraph for a Balanced System

Problem: Inter-related metricsCPU busy = problem state + Supervisor state

CPU wait = 100 – CPU busyChannel only – any channel –CPU/channel overlap

CPU only = CPU busy – CPU/channel overlap

CPUBusy CPU in

Supervisor State

CPU inProblem State

CPUWaitAny Channel

Busy

Channel onlyBusy

CPU/ChannelOverlap

CPU OnlyBusy

Page 22: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-22©2010 Raj Jain www.rajjain.com

Shapes ofShapes of KiviatKiviat GraphsGraphs

CPU Keel boat I/O Wedge I/O Arrow

Page 23: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-23©2010 Raj Jain www.rajjain.com

MerrillMerrill’’s Figure of Merit (s Figure of Merit (FoMFoM))

Performance = {x1, x2, x3, …, x2n}Odd values are HB and even values are LB

x2n+1 is the same as x1

Average FOM = 50%

Page 24: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-24©2010 Raj Jain www.rajjain.com

Example:Example: FoMFoM

System A:

Page 25: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-25©2010 Raj Jain www.rajjain.com

FoMFoM Example (Cont)Example (Cont)System B:

System B has a higher figure of merit and it is better.

Page 26: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-26©2010 Raj Jain www.rajjain.com

Figure of Merit: Known ProblemsFigure of Merit: Known Problems

All axes are considered equalExtreme values are assumed to be betterUtility is not a linear function of FoMTwo systems with the same FoM are not equally good.System with slightly lower FoM may be better

Page 27: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-27©2010 Raj Jain www.rajjain.com

KiviatKiviat Graphs For Other SystemsGraphs For Other SystemsNetworks:

ApplicationThroughput

Packets With Error

Implicit Acks

Duplicate Packets

LinkUtilization

LinkOverhead

Page 28: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-28©2010 Raj Jain www.rajjain.com

Schumacher ChartsSchumacher ChartsPerformance matrix are plotted in a tabular mannerValues are normalized with respect to long term means and standard deviationsAny observations that are beyond mean ± one standard deviation need to be explainedSee Figure 10.25 in the book

Page 29: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-29©2010 Raj Jain www.rajjain.com

Performance Analysis Rat HolesPerformance Analysis Rat Holes

ConfigurationWorkload Metrics Details

Page 30: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-30©2010 Raj Jain www.rajjain.com

Reasons for not Accepting an AnalysisReasons for not Accepting an AnalysisThis needs more analysis.You need a better understanding of the workload.It improves performance only for long IOs/packets/jobs/files, and most of the IOs/packets/jobs/files are short.It improves performance only for short IOs/packets/jobs/files, but who cares for the performance of short IOs/packets/jobs/files, its the long ones that impact the system.It needs too much memory/CPU/bandwidth and memory/CPU/bandwidth isn't free.It only saves us memory/CPU/bandwidth and memory/CPU/bandwidth is cheap.

See Box 10.2 on page 162 of the book for a complete list

Page 31: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-31©2010 Raj Jain www.rajjain.com

SummarySummary

1. Qualitative/quantitative, ordered/unordered, discrete/continuous variables

2. Good charts should require minimum effort from the reader and provide maximum information with minimum ink

3. Use no more than 5-6 curves, select ranges properly, Three-quarter high rule

4. Gantt Charts show utilizations of various components5. Kiviat Graphs show HB and LB metrics alternatively on a

circular graph6. Schumacher Charts show mean and standard deviations7. Workload, metrics, configuration, and details can always be

challenged. Should be carefully selected.

Page 32: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-32©2010 Raj Jain www.rajjain.com

Exercise 10.1Exercise 10.1What type of chart (line or bar) would you use to plot:

a. CPU usage for 12 months of the yearb. CPU usage as a function of time in monthsc. Number of I/O's to three disk drives: A, B, and Cd. Number of I/O's as a function of number of disk drives in

a system

Page 33: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-33©2010 Raj Jain www.rajjain.com

Exercise 10.2Exercise 10.2

List the problems with the following charts

Page 34: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-34©2010 Raj Jain www.rajjain.com

Exercise 10.3Exercise 10.3On a system consisting of 3 resources, called A, B, and C. The measured utilizations are shown in the following table. A zero in a column indicates that the resource is not utilized. Draw a Gantt chart showing utilization profiles.

Page 35: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-35©2010 Raj Jain www.rajjain.com

Exercise 10.4Exercise 10.4The measured values of the eight performance metrics listed in Example 10.2 for a system are: 70%, 10%, 60%, 20%, 80%, 30%, 50%, and 20%. Draw the Kiviat graph and compute its figure of merit.

Page 36: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-36©2010 Raj Jain www.rajjain.com

Exercise 10.5Exercise 10.5For a computer system of your choice, list a number of HB and LB metrics and draw a typical Kiviat graph using data values of your choice.

Page 37: The Art of Data Presentationjain/iucee/ftp/k_10adp.pdf · 10-6 ©2010 Raj Jain Common Mistakes in Preparing Charts Presenting too many alternatives on a single chart Max 5 to 7 messages

10-37©2010 Raj Jain www.rajjain.com

HomeworkHomework

Read Chapter 10Submit solutions to exercises 10.3 and 10.4Approximate hand-drawn figures are sufficient