march 3, 2015 john brosz - tfdl - dataforgood.pdf · phd & msc in computer science 3d computer...

29
March 3, 2015 John Brosz

Upload: others

Post on 21-May-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

March 3, 2015

John Brosz

Page 2: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering,

Multiresolution

Post-Doc at the InnoVis Lab (HCI & InfoVis) Focus+Context viewing, touch-based interfaces/NUI, Tree Vis

Visualization Research Coordinator Visualization Studio

Programming – workshops, help with visualizations

Other Projects

Page 3: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Tour Time!

Page 4: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

At the University we see a wide variety

Focusing on free

1. Preparing Data

2. Visualization

Page 5: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Interactive tool for cleaning & rearranging

Suggests changes

Web tool – sends data to external site

Import: text, CSV, JSON

Export: CSV, JSON

No longer being developed (since 2011)

http://vis.stanford.edu/wrangler/

Page 6: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Consolidate spelling

Auto-detect outliers

Sorting & filtering

Auto-suggests changes

Import: Excel, XML, JSON, RDF, CSV

Export: Excel, CSV, ODF, HTML

http://openrefine.org/

Page 7: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Import: csv, tsv, copy & paste from Excel

Export: JSON, ASP, ActionScript, MySQL, PHP, Python, Ruby, HTML, XML

http://shancarter.github.io/mr-data-converter/

Page 8: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Extract data from PDFs

Stand-alone app for Windows/Mac

Interactively select table

Output: CSV, Excel

http://tabula.nerdpower.org/

Page 9: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

General Purpose

Maps & GIS

Text Analysis

Programming/APIs

Page 10: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Strengths:

Many chart types

Interactive web output

Access to underlying data

Many data sources (live)

Drag & drop – easy to experiment

Maps

Great defaults

Link visualizations

Academic Program - Free for students

http://www.tableau.com/

Page 11: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Tableau

Explicitly addresses visual variables

Interact with chart legend

Mapping is easier

Story points – sequential series of data views

Bottom-up

Focused on interface

Stacked Area, Gantt, Bubble

Spotfire

Interesting “first guess” visualizations

API

Stronger statistical tools

Better TreeMap

Top-Down

Focused on performance

Parallel Coordinate Plot, 3D Scatterplot

Page 12: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

As the alphabet is to reading, visual variables are to visualization.

Jacques Bertin’s Sémiologie Graphique

Jock Mackinlay significantly furthered this research in the ‘80s(hint: he’s now a VP at Tableau)

Page 13: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the
Page 14: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Merge others’ data into your own table

Collaborative data gathering

Great for maps

Resolves geocoding

data API

Limited data capacity& customization

https://www.google.com/fusiontables

Page 15: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Web tool – Java/Flash

Many Vis Types

Scatterplot, network diagram, treemap, word tree, tag/word cloud, phrase net

Charts: matrix, bar, bubble, pie

Graphs: line, stack

Maps & cartograms

Data & charts are public

Limited data size (5MB)

http://www-958.ibm.com/software/analytics/manyeyes/

Page 16: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Open-source software for statistics & graphics

All sorts of advanced stats Regression, linear/nonlinear models, time series analysis, clustering, nonparametric tests

Data wrangling

Charts & Plots

Command line*

Many add-ons (> 4400)

60+ Resources for R

http://www.computerworld.com/article/2497464/business-intelligence/60-r-resources-to-improve-your-data-skills.html

http://www.r-project.org/

* Several UIs: e.g., Shiny R, Quick-R, Rstudio, RExcel, etc

Page 17: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Create SVG graphics

Data not uploaded (so remains private)

16 charts

API to create new charts

http://app.raw.densitydesign.org

Page 18: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Import: txt, HTML, XML, PDF, RTF, & Word

lexical analysis

frequency and distribution

Export: XML, tsv, html widgets

http://voyant-tools.org/

Page 19: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Open Source alternative to ArcGIS

Linux, Windows, Mac OSX, Unix, Android

http://www.qgis.org/

Page 20: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Uses OpenStreetMap tiles

Generates choropleths or heat maps that can be embedded into web pages

http://www.openheatmap.com/

Page 21: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Windows/Linux/OS X

Can handle 50K nodes & 1000K edges

Interactive

Filter

Dynamic layout

Clustering/hierarchies

http://gephi.github.io

Page 22: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Plugin for Excel ‘07, ‘10, ‘13

Allows exploration of network graphs

Additional plugins to import from social networks:

FaceBook

Twitter

YouTube

email

http://nodexl.codeplex.com/

Page 23: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Variety of free charts

Web tool that provides URL or full API

24 different charts

Limited customization

https://developers.google.com/chart/

Page 24: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

More customizable, interactive, & dynamic than Google Charts

Less work than D3

Free for noncommerical

http://www.highcharts.com/

Page 25: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Very flexible

Embeds data into the webpage

Animations

Many examples to build from

Need to know html, css, javascript, & SVG

Used extensively by New York Times

Many libraries that build on D3

Poor performance when you reach 10K elements

http://d3js.org/

Page 26: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

Programming language & IDE designed for non-programmers

Used by Artists, Designers, Educators, Researchers

Can be converted to Android & Java

https://processing.org/

Page 27: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

J. Brosz, M. A. Nacenta, R. Pusch, S. Carpendale and C. Hurter

Transmogrifiers.org

Page 28: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the

J. Brosz, M. A. Nacenta, R. Pusch, S. Carpendale and C. Hurter

Transmogrifiers.org

Videos taken out to reduce file size, if you would like to see

them again, visit: http://www.transmogrifiers.org/description/

Page 29: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the