data visualisation

51
@dmarcous

Upload: daniel-marcous

Post on 16-Aug-2015

97 views

Category:

Data & Analytics


2 download

TRANSCRIPT

Page 1: Data Visualisation

@dmarcous

Page 2: Data Visualisation

“Data Is The New Oil”

Page 3: Data Visualisation

“Data Is The New Soil”David McCandless

Page 4: Data Visualisation

“The 19th century culture was defined by the novel, the 20th century culture was defined by the cinema, and the 21st century culture will be defined by the interface”

Lev Manovich, Visual arts professor and media theorist

Page 5: Data Visualisation

Steps of Data Analytics (Science)

●Defining question●Data Exploration●Data Gathering●Data Preparation●Data Analysis●Data Visualisation

Page 6: Data Visualisation

@dmarcous

●@IDFDBA (DataBase Administrator)Big Data Professional

●@Google – WazeData Wizard - Magic with Data

Page 7: Data Visualisation
Page 8: Data Visualisation
Page 9: Data Visualisation
Page 10: Data Visualisation

Old Roots

Page 11: Data Visualisation

Charles Joseph MinardFrench civil engineerBorn: March 27, 1781Died: October 24, 1870

Page 12: Data Visualisation
Page 13: Data Visualisation

Lon-5 0 5 10

42

44

46

48

50

La

t

@dmarcous

Page 14: Data Visualisation
Page 15: Data Visualisation

Edward Tufte

●Visual Display of Quantitative Information●Beautiful Evidence

Page 16: Data Visualisation

Data-Ink Ratio

Page 17: Data Visualisation
Page 18: Data Visualisation
Page 19: Data Visualisation
Page 20: Data Visualisation
Page 21: Data Visualisation

Grammar of graphicsA graphic is a mapping:from data - to aesthetic attributes (colour, shape, size) of geometric object (points, lines, bar).

Leland Wilkinson, 1999

Page 22: Data Visualisation

ggplot2 packageHadley WickhamAssistant professor of statistics at Rice University, Houston

Page 23: Data Visualisation

Technical Problems -Large Result SetReal Time results

Page 24: Data Visualisation

Serving Database

Indexed ResultsCached Results

Slice Data in MemoryBuild Graphs In Memory

Page 25: Data Visualisation

Combined Architecture

●Batch Processing - Hadoop / Spark

●Stream Processing - Storm / Spark Streaming

●Serving / Online Indexing - HBase / solr / elastic / mongo

●Visualise - R / python /+ javascript (D3.js)

Page 26: Data Visualisation

Dimensions of Data

NumbersNumbers

DistanceDistance

ShapesShapes

LocationLocation

OpacityOpacity SizeSize

WordsWords

Page 27: Data Visualisation

Amount of data VS Human Readability

We don’t want only matter experts to be able to understand the visualisation

Page 28: Data Visualisation

Techniques

Page 29: Data Visualisation

Case Study 1

●Bush VS Gore election results●Divided by Ethnicity (white/ black/ hispanic)●Divided according to size of population

Page 30: Data Visualisation

Red - BushBlack - Gore

Page 31: Data Visualisation

Case Study 2

●Display specific area attack patterns○ Time of attack○ Intensity of response (we want to minimize casualties from our side)

●Take seasonality into account (day of week)●Display level of confidence●Bonus - Add “automatic pattern discovery” / Recommendation

for attack time

Page 32: Data Visualisation
Page 33: Data Visualisation

Case Study 3

●Most Photographed Places on Earth●Personalise to specific user taken photos●Compare specific user photo location to popularity of location

Page 34: Data Visualisation
Page 35: Data Visualisation

Case Study 4

●What customers do with my app?●Distribution of entry points●“Paths to success”

Page 36: Data Visualisation
Page 37: Data Visualisation
Page 38: Data Visualisation

Case Study 5

●Detect and present Anomalies in data (machine data / user patterns / irregular traffic etc’)○ Anomaly = out of ordinary.○ Learn “normal”○ Detect Global anomalies○ Detect normal anomalies○ Detect change of trends

Page 39: Data Visualisation

Twitter Breakout Detection

Page 40: Data Visualisation

Case Study 6

●Assess and display the effect of an event on a certain metric○ Learn normal pattern○ Predict according to normal pattern○ compute residuals (predicted-actual)

●Did the storm affected number of accidents?●Did a military action affected number of missiles sent?

Page 41: Data Visualisation

Google Causal Impact

Page 42: Data Visualisation

Case Study 7

●Build a stock portfolio with minimised risk●Pick stocks that are doing well (easy…)●Pick stocks that act different from each other (correlations)

Page 43: Data Visualisation

Clustering Analysis

Page 44: Data Visualisation

Interactive Visualisation

Page 45: Data Visualisation
Page 46: Data Visualisation

Data Art

Page 47: Data Visualisation
Page 48: Data Visualisation

data-waze.com

Page 49: Data Visualisation

Open source is good for me, I will fully embrace it.Open source is good for me, I will fully embrace it.Open source is good for me, I will fully embrace it.Open source is good for me, I will fully embrace it.Open source is good for me, I will fully embrace it.Open source is good for me, I will fully embrace it.Open source is good for me, I will fully embrace it.Open source is good for me, I will fully embrace it.Open source is good for me, I will fully embrace it.Open source is good for me, I will fully embrace it.Open source is good for me, I will fully embrace it.Open source is good for me, I will fully embrace it.Open source is good for me, I will fully embrace it.

Page 50: Data Visualisation
Page 51: Data Visualisation

Contact

●Daniel Marcous●[email protected]