think link: network insights with no programming skills

101
Marc A. Smith Chief Social Scientist Connected Action Consulting Group [email protected] http://www.connectedaction.net http://nodexl.codeplex.com/ roject from the Social Media Research Foundation : http:// www.smrfou Think Link! Network Insights with No Programming Skills

Upload: marc-smith

Post on 26-Jan-2015

110 views

Category:

Technology


0 download

DESCRIPTION

Networks are everywhere, but the tools for end users to access, analyze, visualize and share insights into connected structures have been absent. NodeXL, the network overview discovery and exploration add-in for Excel makes network analysis as easy as making a pie chart.

TRANSCRIPT

Page 1: Think Link: Network Insights with No Programming Skills

Marc A. SmithChief Social ScientistConnected Action Consulting [email protected]://www.connectedaction.nethttp://nodexl.codeplex.com/

A project from the Social Media Research Foundation: http://www.smrfoundation.org

Think Link! Network Insights with No Programming Skills

Page 2: Think Link: Network Insights with No Programming Skills

About Me

Introductions

Marc A. SmithChief Social ScientistConnected Action Consulting Group

[email protected]://www.connectedaction.nethttp://www.codeplex.com/nodexlhttp://www.twitter.com/marc_smithhttp://www.flickr.com/photos/marc_smithhttp://www.facebook.com/marc.smith.sociologisthttp://www.linkedin.com/in/marcasmithhttp://www.slideshare.net/Marc_A_Smithhttp://www.smrfoundation.org

Page 3: Think Link: Network Insights with No Programming Skills

Social Media Research Foundationhttp://smrfoundation.org

Page 5: Think Link: Network Insights with No Programming Skills

What we are trying to do:Open Tools, Open Data, Open Scholarship

• Build the “Firefox of GraphML” – open tools for collecting and visualizing social media data

• Connect users to network analysis – make network charts as easy as making a pie chart

• Connect researchers to social media data sources• Archive: Be the “Allen Very Large Telescope Array”

for Social Media data – coordinate and aggregate the results of many user’s data collection and analysis

• Create open access research papers & findings• Make “collections of connections” easy for users to

manage

Page 6: Think Link: Network Insights with No Programming Skills

Kodak BrownieSnap-Shot Camera

The first easy to use

point and shoot!

Page 7: Think Link: Network Insights with No Programming Skills

Crowds matter

Page 8: Think Link: Network Insights with No Programming Skills

What we have done: Open Tools

• NodeXL• Data providers (“spigots”)

– ThreadMill Message Board– Exchange Enterprise Email– Voson Hyperlink– SharePoint– Facebook– Twitter– YouTube– Flickr

Page 9: Think Link: Network Insights with No Programming Skills

What we have done: Open Data

• NodeXLGraphGallery.org– User generated collection of

network graphs, datasets and annotations

– Collective repository for the research community

– Published collections of data from a range of social media data sources to help students and researchers connect with data of interest and relevance

Page 10: Think Link: Network Insights with No Programming Skills

What we have done: Open Scholarship

Page 11: Think Link: Network Insights with No Programming Skills

http://www.pewinternet.org/2014/02/20/mapping-twitter-topic-networks-from-polarized-crowds-to-community-clusters/

Page 12: Think Link: Network Insights with No Programming Skills

Social Media (email, Facebook, Twitter, YouTube, and more) is all about connections

from people

to people.

12

Page 13: Think Link: Network Insights with No Programming Skills

Patterns are left behind

13

Page 14: Think Link: Network Insights with No Programming Skills

There are many kinds of ties…. Send, Mention,

http://www.flickr.com/photos/stevendepolo/3254238329

Like, Link, Reply, Rate, Review, Favorite, Friend, Follow, Forward, Edit, Tag, Comment, Check-in…

Internet Verbs!

Page 17: Think Link: Network Insights with No Programming Skills

http://www.flickr.com/photos/fullaperture/81266869/

Strength of Weak ties

Page 20: Think Link: Network Insights with No Programming Skills

World Wide Web

Social media must contain one or more

social networks

Page 21: Think Link: Network Insights with No Programming Skills

Vertex1 Vertex 2 “Edge” Attribute

“Vertex1” Attribute

“Vertex2” Attribute

@UserName1 @UserName2 value value value

A network is born whenever two GUIDs are joined.

Username Attributes

@UserName1 Value, value

Username Attributes

@UserName2 Value, value

A B

Page 22: Think Link: Network Insights with No Programming Skills

NodeXL imports “edges” from social media data sources

Page 23: Think Link: Network Insights with No Programming Skills
Page 24: Think Link: Network Insights with No Programming Skills

Social Networks

• History: from the dawn of time!

• Theory and method: 1934 ->

• Jacob L. Moreno

• http://en.wikipedia.org/wiki/Jacob_L._Moreno

Jacob Moreno’s early social network diagram of positive and negative relationships among members of a football team.

Originally published in Moreno, J. L. (1934). Who shall survive? Washington, DC: Nervous and Mental Disease Publishing Company.

Page 25: Think Link: Network Insights with No Programming Skills

A nearly social network diagram of relationships among workers in a factory illustrates the positions different workers occupy within the workgroup.

Originally published in Roethlisberger, F., and Dickson, W. (1939). Management andthe worker. Cambridge, UK: Cambridge University Press.

Page 26: Think Link: Network Insights with No Programming Skills

Location, Location, Location

Page 27: Think Link: Network Insights with No Programming Skills

Position, Position, Position

Page 28: Think Link: Network Insights with No Programming Skills

https://www.simonsfoundation.org/quanta/20131004-the-mathematical-shape-of-things-to-come/

Page 29: Think Link: Network Insights with No Programming Skills

http://simonsfoundation.s3.amazonaws.com/jwplayer/BigData/Topological_Data_Analysis_Intro.mp4

Page 30: Think Link: Network Insights with No Programming Skills
Page 31: Think Link: Network Insights with No Programming Skills

Introduction to NodeXL

Like MSPaint™ for graphs.— the Community

Page 32: Think Link: Network Insights with No Programming Skills

Now Available

Page 33: Think Link: Network Insights with No Programming Skills

Communities in Cyberspace

Page 34: Think Link: Network Insights with No Programming Skills

Network Analysis Data Flow

PublicationVisualizationAnalysisContainerProviders

Page 35: Think Link: Network Insights with No Programming Skills

http://www.flickr.com/photos/badgopher/3264760070/

Data Providers

Page 36: Think Link: Network Insights with No Programming Skills

Providers

Page 37: Think Link: Network Insights with No Programming Skills

http://www.flickr.com/photos/druclimb/2212572259/in/photostream/

Data Container

Page 38: Think Link: Network Insights with No Programming Skills

Container

Page 39: Think Link: Network Insights with No Programming Skills

Data Analysis

http://www.flickr.com/photos/hchalkley/47839243/

Page 40: Think Link: Network Insights with No Programming Skills

Analysis

Page 41: Think Link: Network Insights with No Programming Skills

Data Visualization

http://www.flickr.com/photos/rvwithtito/4236716778

Page 42: Think Link: Network Insights with No Programming Skills

Visualization

Page 43: Think Link: Network Insights with No Programming Skills

http://www.flickr.com/photos/62693815@N03/6277208708/

Data Publication

Page 44: Think Link: Network Insights with No Programming Skills

Publication

Page 45: Think Link: Network Insights with No Programming Skills

Social Network Maps Reveal

Key influencers in any topic.

Sub-groups.

Bridges.

Page 46: Think Link: Network Insights with No Programming Skills

Hubs

Page 47: Think Link: Network Insights with No Programming Skills

Bridges

Page 48: Think Link: Network Insights with No Programming Skills

Islands

http://www.flickr.com/photos/storm-crypt/3047698741

Page 49: Think Link: Network Insights with No Programming Skills

http://www.flickr.com/photos/library_of_congress/3295494976/sizes/o/in/photostream/

Clusters

Page 50: Think Link: Network Insights with No Programming Skills

http://www.flickr.com/photos/amycgx/3119640267/

Crowds

Page 51: Think Link: Network Insights with No Programming Skills
Page 52: Think Link: Network Insights with No Programming Skills

Welser, Howard T., Eric Gleave, Danyel Fisher, and Marc Smith. 2007. Visualizing the Signatures of Social Roles in Online Discussion Groups. The Journal of Social Structure. 8(2).

Experts and “Answer People”

Discussion starters, Topic setters

Discussion people, Topic setters

Page 53: Think Link: Network Insights with No Programming Skills

Dian

e has

high

de

gree

Heather has high

betweenness

NodeXLNetwork Overview Discovery and Exploration add-in for Excel 2007/2010

A minimal network can illustrate the ways different

locations have different values for centrality and degree

Page 54: Think Link: Network Insights with No Programming Skills
Page 56: Think Link: Network Insights with No Programming Skills

[Divided]Polarized Crowds

[Unified]Tight Crowd

[Fragmented]Brand Clusters

[Clustered]Community Clusters

[In-Hub & Spoke]Broadcast Network

[Out-Hub & Spoke]Support Network

6 kinds of Twitter social media networks

Page 57: Think Link: Network Insights with No Programming Skills

[Divided]Polarized Crowds

[Unified]Tight Crowd

[Fragmented]Brand Clusters

[Clustered]Community Clusters

[In-Hub & Spoke]Broadcast Network

[Out-Hub & Spoke]Support Network

6 kinds of Twitter social media networks

Page 58: Think Link: Network Insights with No Programming Skills

#My2K

Polarized

Page 59: Think Link: Network Insights with No Programming Skills

#CMgrChat

In-group / Community

Page 60: Think Link: Network Insights with No Programming Skills

Lumia

Brand / Public Topic

Page 61: Think Link: Network Insights with No Programming Skills

#FLOTUS

Bazaar

Page 62: Think Link: Network Insights with No Programming Skills

New York Times ArticlePaul Krugman

Broadcast: Audience + Communities

Page 63: Think Link: Network Insights with No Programming Skills

Dell Listens/Dellcares

Support

Page 64: Think Link: Network Insights with No Programming Skills

SNA questions for social media:

1. What does my topic network look like?2. What does the topic I aspire to be look like?3. What is the difference between #1 and #2?4. How does my map change as I intervene?

What does #YourHashtag look like?

Page 65: Think Link: Network Insights with No Programming Skills

pawcon Twitter NodeXL SNA Map and Report for Monday, 17 March 2014 at 15:15 UTC

Page 66: Think Link: Network Insights with No Programming Skills

strataconf Twitter NodeXL SNA Map and Report for 2014-02-11 12-53-27

Top 10 Vertices, Ranked by Betweenness Centrality:

@strataconf@peteskomoroch@acroll@oreillymedia@orthonormalruss@ayirpelle@bigdata@furrier@marketpowerplus@sassoftware

Page 67: Think Link: Network Insights with No Programming Skills

datavis Twitter NodeXL SNA Map and Report for Tuesday, 11 February 2014 at 18:55 UTC

Top 10 Vertices, Ranked by Betweenness Centrality:

@bigpupazzoverde@randal_olson@twitterdata@7of13@yochum@edwardtufte@twittersports@grandjeanmartin@smfrogers@albertocairo

Page 68: Think Link: Network Insights with No Programming Skills
Page 69: Think Link: Network Insights with No Programming Skills

[Divided]Polarized Crowds

[Unified]Tight Crowd

[Fragmented]Brand Clusters

[Clustered]Community Clusters

[In-Hub & Spoke]Broadcast Network

[Out-Hub & Spoke]Support Network

6 kinds of Twitter social media networks

Page 70: Think Link: Network Insights with No Programming Skills

http://www.pewresearch.org/fact-tank/2014/02/20/the-six-types-of-twitter-conversations/

Page 71: Think Link: Network Insights with No Programming Skills

[Divided]Polarized Crowds

[Unified]Tight Crowd

[Fragmented]Brand Clusters

[Clustered]Communities

[In-Hub & Spoke]Broadcast

Network

[Out-Hub & Spoke]Support

Network

[Low probability]Find bridge users.Encourage shared material.

[Low probability]Get message out to disconnected communities.

[Possible transition]Draw in new participants.

[Possible transition]Regularly create content.

[Possible transition]Reply to multiple users.

[Undesirable transition]Remove bridges, highlight divisions.

[Low probability]Get message out to disconnected communities.

[High probability]Draw in new participants.

[Possible transition]Regularly create content.

[Possible transition]Reply to multiple users.

[Undesirable transition]Increase density of connections in two groups.

[Low probability]Dramatically increase density of connections.

[High probability]Increase retention, build connections.

[Possible transition]Regularly create content.

[Possible transition]Reply to multiple users.

[Undesirable transition]Increase density of connections in two groups.

[Low probability]Dramatically increase density of connections.

[Undesirable transition]Increase population, reduce connections.

[Possible transition]Regularly create content.

[Possible transition]Reply to multiple users.

[Undesirable transition]Increase density of connections in two groups.

[Low probability]Dramatically increase density of connections.

[Low probability]Get message out to disconnected communities.

[Possible transition]Increase retention, build connections.

[High probability]Increase reply rate, reply to multiple users.

[Undesirable transition]Increase density of connections in two groups.

[Low probability]Dramatically increase density of connections.

[Possible transition]Get message out to disconnected communities.

[High probability]Increase retention, build connections.

[High probability]Increase publication of new content and regularly create content.

Page 72: Think Link: Network Insights with No Programming Skills

Request your own network map and report

http://connectedaction.net

Page 73: Think Link: Network Insights with No Programming Skills

• Central tenet – Social structure emerges from – the aggregate of relationships (ties) – among members of a population

• Phenomena of interest– Emergence of cliques and clusters – from patterns of relationships– Centrality (core), periphery (isolates), – betweenness

• Methods– Surveys, interviews, observations,

log file analysis, computational analysis of matrices

(Hampton &Wellman, 1999; Paolillo, 2001; Wellman, 2001)

Source: Richards, W. (1986). The NEGOPY network analysis program. Burnaby, BC: Department of Communication, Simon Fraser University. pp.7-16

Social Network Theoryhttp://en.wikipedia.org/wiki/Social_network

Page 74: Think Link: Network Insights with No Programming Skills

SNA 101• Node

– “actor” on which relationships act; 1-mode versus 2-mode networks• Edge

– Relationship connecting nodes; can be directional• Cohesive Sub-Group

– Well-connected group; clique; cluster• Key Metrics

– Centrality (group or individual measure)• Number of direct connections that individuals have with others in the group (usually look at

incoming connections only)• Measure at the individual node or group level

– Cohesion (group measure)• Ease with which a network can connect• Aggregate measure of shortest path between each node pair at network level reflects

average distance– Density (group measure)

• Robustness of the network• Number of connections that exist in the group out of 100% possible

– Betweenness (individual measure)• # shortest paths between each node pair that a node is on• Measure at the individual node level

• Node roles– Peripheral – below average centrality– Central connector – above average centrality– Broker – above average betweenness

E

D

F

A

CB

H

G

I

CD

E

A B D E

Page 75: Think Link: Network Insights with No Programming Skills

NodeXLFree/Open Social Network Analysis add-in for Excel 2007/2010 makes graph

theory as easy as a pie chart, with integrated analysis of social media sources.http://nodexl.codeplex.com

Page 76: Think Link: Network Insights with No Programming Skills

http://www.youtube.com/watch?v=0M3T65Iw3Ac

Nod

eXL

Vide

o

Page 77: Think Link: Network Insights with No Programming Skills

Goal: Make SNA easier

• Existing Social Network Tools are challenging for many novice users

• Tools like Excel are widely used• Leveraging a spreadsheet as a host for SNA

lowers barriers to network data analysis and display

Page 78: Think Link: Network Insights with No Programming Skills

Twitter Network for “Microsoft Research”*BEFORE*

Page 79: Think Link: Network Insights with No Programming Skills

Twitter Network for “Microsoft Research”*AFTER*

Page 80: Think Link: Network Insights with No Programming Skills

Network Motif Simplification

Cody Dunne, University of Maryland

Page 81: Think Link: Network Insights with No Programming Skills

Network Motif Simplification

D-connector (glyph on the right)

D-clique (glyphs for 4, 5, and 6 member cliques below)

Dr. Cody Dunne

Fan(glyph on the right)

Page 82: Think Link: Network Insights with No Programming Skills

NodeXLGraph Gallery

Page 83: Think Link: Network Insights with No Programming Skills

Scholars using NodeXL

• Communications– Katy Pearce– Itai Himelboim

• Business– Scott Dempwolf

• Humanities/Classics– Diane Cline

Page 84: Think Link: Network Insights with No Programming Skills
Page 86: Think Link: Network Insights with No Programming Skills

What is Social Network Analysis? How is it useful for the humanities?

1. New framework for analysis2. Data visualization allows new perspectives – less linear, more comprehensive

Social Network Analysis and Ancient HistoryDiane H. Cline, Ph.D.University of Cincinnati

Page 87: Think Link: Network Insights with No Programming Skills

Strategies for social media engagement based on social media network analysis

Page 88: Think Link: Network Insights with No Programming Skills

NodeXL calculates metrics about networks and content

Page 89: Think Link: Network Insights with No Programming Skills

The Content summary spreadsheet displays the most

frequently used URLs, hashtags, and user names within the

network as a whole and within each calculated sub-group.

Page 90: Think Link: Network Insights with No Programming Skills
Page 91: Think Link: Network Insights with No Programming Skills

NodeXL Ribbon in Excel

Page 92: Think Link: Network Insights with No Programming Skills

NodeXL data import sources

Page 93: Think Link: Network Insights with No Programming Skills

Example NodeXL data importer for Twitter

Page 94: Think Link: Network Insights with No Programming Skills

NodeXL imports “edges” from social media data sources

Page 95: Think Link: Network Insights with No Programming Skills

NodeXL creates a list of “vertices” from imported social media edges

NodeXL displays subgraph images along with network metadata

Page 96: Think Link: Network Insights with No Programming Skills

Automate

NodeXL Automation

makes analysis simple and fast

Perform collections of common operations

with a single click

Page 97: Think Link: Network Insights with No Programming Skills

NodeXL Generates Overall Network Metrics

Page 98: Think Link: Network Insights with No Programming Skills

What we want to do: (Build the tools to) map the social web

• Move NodeXL to the web: (Node[NOT]XL)– Node for Google Doc Spreadsheets? – WebGL Canvas? D3.JS? Sigma.JS

• Connect to more data sources of interest:– RDF, MediaWikis, Gmail, NYT, Citation Networks

• Solve hard network manipulation UI problems:– Modal transform, Time series, Automated layouts

• Grow and maintain archives of social media network data sets for research use.

• Improve network science education:– Workshops on social media network analysis– Live lectures and presentations– Videos and training materials

Page 99: Think Link: Network Insights with No Programming Skills

How you can help

• Sponsor a feature• Sponsor workshops• Sponsor a student• Schedule training• Sponsor the foundation• Donate your money, code, computation, storage,

bandwidth, data or employee’s time• Help promote the work of the Social Media

Research Foundation

Page 100: Think Link: Network Insights with No Programming Skills

Marc A. SmithChief Social ScientistConnected Action Consulting [email protected]://www.connectedaction.nethttp://nodexl.codeplex.com/

A project from the Social Media Research Foundation: http://www.smrfoundation.org

Think Link! Network Insights with No Programming Skills

Page 101: Think Link: Network Insights with No Programming Skills