![Page 1: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/1.jpg)
Data VisualizationIts place in Software Engineering
Jonathan Reese
![Page 2: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/2.jpg)
Visual representation of data Broad topic! Software creates new possibilities
◦ Powerful and automatic generation◦ Massive amounts of data
Make data◦ Understandable◦ Manageable◦ Exciting
Introduction
![Page 3: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/3.jpg)
Population Density
![Page 4: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/4.jpg)
![Page 5: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/5.jpg)
UWP (official) Facebook Posts
![Page 6: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/6.jpg)
We generate and store data at a much faster rate than we understand it◦ Corporation’s records◦ Police reports◦ Statistics and Surveys
Tap into the wealth of data
Big Data
![Page 7: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/7.jpg)
Automated process of interpreting data to extract more specific information.
Data mining is not a part of data visualization
Both are independent, but can be used together
Data Mining
![Page 8: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/8.jpg)
225,000 users gave names to 5,000,000 randomly generated colors.◦ Big data
He created an algorithm to extract commonly understood color names and their values from the data◦ Data mining
A 3D rotating model was created to show the spectrum of color names.◦ Data visualization
Munroe’s Color Survey
![Page 9: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/9.jpg)
Make it easier to find what you are looking for in data◦ Exclude unwanted information◦ Represent numbers visually
Colors Distance Location Size
Visualization can assist more than just end users
Utility in Software
![Page 10: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/10.jpg)
Valve monitors and records statistics from their players.
They use data visualization to make sense of their own recordings
Heat maps◦ Valve monitors the frequency and location where
action takes place in their FPS games◦ Helps them make map design choices, and
monitor effects of changes
Valve’s Heat Maps
![Page 11: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/11.jpg)
Valve’s Heat Maps
![Page 12: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/12.jpg)
Verizon’s Coverage Map
![Page 13: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/13.jpg)
A prospective customer of Verizon does not want to know the locations of all the Verizon towers.
The customer wants to see where there is coverage.
A customer can quickly decide if there will be coverage issues by looking at the map
Verizon’s Coverage Map
![Page 14: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/14.jpg)
Statistical Analysis Software◦ Provides data visualization software◦ Customers purchase their software to essentially
look at their data differently◦ “The Power to Know”
Magnaview◦ “Visualize anything, visualize everything”
Companies providing software that simply makes data viewable
SAS and Magnaview
![Page 15: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/15.jpg)
SAS
![Page 16: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/16.jpg)
Data visualization can improve or harm an interface◦ It can overcomplicate an interface◦ Cause confusion◦ Prevent a viewer from finding specific data
It is important to know when it is appropriate
When to Visualize
![Page 17: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/17.jpg)
Criteria◦ Raw data is overwhelming◦ There is interest in understanding the data◦ The specific numbers are less important than
what the numbers represent◦ We are safe to assume what a viewer will be
looking for in the data
When to Visualize
![Page 18: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/18.jpg)
There are many methods for visualization filling different niches
Choose a method that ◦ Includes only relevant information◦ Has minimal distractions◦ Represents data in an intuitive way
Choosing How to Visualize
![Page 19: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/19.jpg)
Pros◦ Interprets big data of media to finding similar
content◦ Intuitive
Connections represent relation Distance represents how related the items are
Cons◦ Distractions
Color and size are meaningless/unexplained Connections are redundant
Compare to a list of related artists
Liveplasma
![Page 20: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/20.jpg)
Correlation Matrix
![Page 21: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/21.jpg)
Scenario◦ A convenience store owner has a database with
purchase records of customers◦ The owner wants to reorganize to maximize sales
and customer satisfaction Correlation matrices would be helpful
◦ A matrix for how often item categories are purchased together
◦ A matrix for how often individual items are purchased together
Correlation Matrix
![Page 22: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/22.jpg)
Correlations between categories would help decide◦ What categories to put in the same aisle◦ What aisles to put next to each other
Correlations between individual items in categories◦ Organize sections appropriately
Correlation Matrix
![Page 23: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/23.jpg)
Using color to represent numbers works in that example for a couple reasons◦ A large matrix of percentages would not make the
interesting data pop out.◦ The exact percentages are not as important as how
the percentages compare to each other When using color SAS says it is ideal for color
shade represent a value instead of color hue If color is transparent then changes in hue will
be more easily visible (Valve’s heatmaps)
Color
![Page 24: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/24.jpg)
Represents coordinate data with a point on the map
Useful if location is relevant With Google Maps and other similar
resources it became very easy to add distractions◦ Satellite view◦ Points of Interest◦ Roads
These should be included only if they are relevant
Geographical Visualization
![Page 25: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/25.jpg)
Violent Crime in Milwaukee by neighborhood
Roads are relevant Satellite view
would add clutter and warp colors.
Geographical Visualization
![Page 26: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/26.jpg)
Tree Map
![Page 27: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/27.jpg)
Similar to a pie chart; effective at demonstrating the portion taken up by items
Rectangular containers and items Hierarchical
◦ Containers can hold containers or items Item sizes are based on portion of the
category it is in Thus category sizes are based on the
portion taken up by what it holds
Tree Map
![Page 28: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/28.jpg)
WinDirStat is an open-source memory management application for Windows
Uses “TreeMap” class from Java’s libraries Turns the big data of a storage device’s file
system into a TreeMap◦ Folders – Categories◦ Files – Items◦ Portion of used memory taken - Size◦ Type of file - Color◦ Glare shows folders without using border space!
Tree Map
![Page 29: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/29.jpg)
![Page 30: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/30.jpg)
Customizable and Interactive TreeMap visualization of news headlines◦ Color – Type of news◦ Category – Location◦ Portion – How big the story is (Attention it’s
getting)◦ Color Shade – How “breaking” the news is
Interactive ◦ clicking on a link sends you to the news article
Customizable◦ If there is information that doesn’t interest you,
remove it
NewsMap
![Page 31: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/31.jpg)
Currently, the internet presents a highly disorganized collage of information. Many of us are working in an information-soaked world. There is too much of everything. We are subject everywhere to a sensory overload of images, bombarded with information; in magazines and advertisements, on TV, radio, in the cityscape. The internet is a wonderful communication tool, but day after day we find ourselves constantly dealing with information overload. Today, the internet presents a new challenge, the wide and unregulated distribution of information requires new visual paradigms to organize, simplify and analyze large amounts of data. New user interface challenges are arising to deal with all that overwhelming quantity of information.
- Markos Weskamp
NewsMap
![Page 32: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/32.jpg)
Big data ◦ A wealth of useful information, but is
overwhelming◦ Data visualization helps to make big data
manageable Data visualization represents the data in a
meaningful way Different types are useful based on the
situation Visualizations can often be improved by
adding Interactivity and Customizability
Recap
![Page 33: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/33.jpg)
Data visualization helps make useful discoveries in data
Can make an interface stand out There is room for creativity! We only
covered a few templates Next time you have big data on your hands,
think of it as an opportunity instead of a problem
Conclusion
![Page 34: Its place in Software Engineering Jonathan Reese](https://reader035.vdocument.in/reader035/viewer/2022062500/56649da05503460f94a8b4b9/html5/thumbnails/34.jpg)
[1] Statistical Analysis Solutions (2012). Data Visualization Techniques. Retrieved from http://www.sas.com/reg/wp/corp/51989 [2] Stephen Few (2009). Introduction to Geographical Data Visualization. Retrieved from
http://www.perceptualedge.com/articles/visual_business_intelligence/geographical_data_visualization.pdf [3] Crime safety map (2013). [Geographical data visualization of Milwaukee March 19, 2013] Location Inc.
Retrieved from http://www.neighborhoodscout.com/wi/milwaukee/crime/ [4] Newsmap news feed (2013). [Treemap data visualization of Google News March 19, 2013]
Marcos Weskamp. Retrieved from http://newsmap.jp [5] Marcos Weskamp. Newsmap [pg 6]. Message posted to http://marumushi.com/projects/newsmap [6] WinDirStat Developers (Open Source) (2007). WinDirStat (Version 1.1.2) [Software]. Available from
http://windirstat.info/index.html [7] Verizon coverage locator (2013). [Verizon mobile phone coverage locator March 19, 2013] Verizon. Retrieved
from http://www.verizonwireless.com/b2c/support/coverage-locator [8] MagnaView (2013) Website. http://www.magnaview.com/ [9] Howard Yeend (May 4, 2010). XKCD Colour Survey – a 3D visualization. [Web log post]. Retrieved from
http://www.puremango.co.uk/2010/05/xkcd-color-survey-3d-visualization/ [10] Duncan Graham-Rowe (2007). Mapping the Internet. MIT Technology Review. Retrieved from
http://www.technologyreview.com/news/408104/mapping-the-internet/ [11] Valve (2007). CP_dustbowl heat map. Retrieved from http://www.tfportal.de/?site=news_details&id=447 [12] Time (2013). Population Density US Map. Retrieved from
http://www.time.com/time/interactive/0,31813,1549966,00.html [13] Wordle (2013). Used to generate UWP Facebook Word Map [14] Itoh, T., Yamaguchi, Y.; Ikehaha, Y.; Kajinaga, Y., (2004). Hierarchical data visualization using a fast rectangle-
packing algorithm,” Visualization and Computer Graphics, IEEE Transactions on, vol.10, no.3 pp.302,313. May 2004 doi:http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=1272729&contentType=Journals+%26+Magazines&searchField%3DSearch_All%26queryText%3D.QT.Data+visualization.QT.
[15] Yau, N. (2012). Visualize this, the flowingdata guide to design, visualization, and statistics. Wiley.
References