a comparative study of social network analysis tools
DESCRIPTION
TRANSCRIPT
![Page 1: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/1.jpg)
Membre deMembre de
A comparative study of social network analysis tools
David Combe, Christine Largeron, Előd Egyed-Zsigmond and Mathias Géry
International Workshop on Web Intelligence and Virtual Enterprises 2 (2010)
![Page 2: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/2.jpg)
2/26
Context: social networks and analysis software
Expected functionalities of network analysis software
Benchmark
Conclusion
Outline
![Page 3: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/3.jpg)
3/26
Context
Definition (Wikipedia)A social network is a social structure made up of individuals called "nodes," which are tied by one or more specific types of interdependency, such as friendship, common interest, etc.
Sociologic analysis▫ Sociological works (Moreno 1934, Milgram 1967,
Cartwright and Harary, 1977)▫ Web 2.0 : Renewed interest from the Web based social
networks websites development.
Context
![Page 4: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/4.jpg)
4/26
Context: Social network in business
•For the Gartner Institute:▫“By 2014, social networking services will replace e-
mail as the primary vehicle for interpersonal communications for 20 percent of business users.” (Gartner 2008)
▫Social network analysis is getting mature.
•Some applications in business:▫Workflow study to adapt management to the real flow in
a company;▫ Identify key actors, ie. for viral marketing.
•These applications need adapted software.
Context
![Page 5: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/5.jpg)
5/26
Context: social networks and analysis software
•Network analysis software▫A previous statistical analysis oriented survey
(Huisman & Van Duijn, 2003)
•Networks and needs are changing Size Complex graphs
▫Necessity to make a new benchmark
Context
![Page 6: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/6.jpg)
6/26
Context
Expected functionalities of network analysis softwareBenchmarkConclusion
![Page 7: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/7.jpg)
7/26
Expected functionalities of network analysis software
1. Representation
2. Visualization
3. Characterization by indicators
4. Community detection
Expected functionalities of network analysis software
![Page 8: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/8.jpg)
8/26
1. Network representation as graph(Cartwright and Harary, 1977)•Link orientation
▫Undirected links (edges, ex: co-authorship)▫Directed (arcs, ex: e-mails sent, Enron dataset)
•Weight on edges
•With typed nodes(ex. bipartite network)
Expected functionalities of network analysis software
3
3
21
![Page 9: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/9.jpg)
9/26
1. Network representation as graph
*Vertices 5*Edges1 21 42 32 43 43 54 5
Expected functionalities of network analysis software
Connections
(.net file format)
2
4 3
5
1
1 2 3 4 5
1 0 1 0 1 0
2 1 0 1 1 0
3 0 1 0 1 1
4 1 1 1 0 1
5 0 0 1 1 0
Adjacency matrix
1 2, 4
2 1, 2, 4
3 2, 4, 5
4 2, 3, 5
5 3, 4
Adjacency list
Edge list
![Page 10: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/10.jpg)
10/26Random layout F-R convergence
2. Visualization
Aim: give a visual representation of the graph, with different approaches:•Fish eye
Centered on an actor
•Force driven visualization layouts▫Fruchterman Reingold (1984)
Iterative algorithm
Expected functionalities of network analysis software
![Page 11: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/11.jpg)
11/26
3. Characterization by indicators• Global indicators at network level by:
▫ Number of nodes▫ Number of edges▫ Diameter▫ …
• Local indicators at node level:▫ Number of neighboors degree▫ …
• Distance▫ Length of the shortest path
Expected functionalities of network analysis software
Density
2
4 3
5
1
4
2
5
![Page 12: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/12.jpg)
12/26
3. Characterization by indicators : how to decide if an actor is « central »?
•Many ways to determine central actors.
•Ex: Betweenness centrality▫ Which node is the most likely to
be an intermediary for a random communication?
▫ higher betweenness centrality
• Selection depends on what they are needed for.
Expected functionalities of network analysis software
![Page 13: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/13.jpg)
13/26
4. Community detection
•Community:▫A set of actors having
strong connexions.•Community detection
algorithms▫Newman–Girvan (Newman
and Girvan, 2002)▫Walktrap (Latapy & Pons,
2005)
Expected functionalities of network analysis software
![Page 14: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/14.jpg)
14/26
ContextExpected functionalities of network analysis software
BenchmarkConclusion
![Page 15: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/15.jpg)
15/26
Benchmark methodology• Required points:
▫A social network analysis point of view▫Scalability▫Free for educational purposes
• A balance between well established software and newer ones, based on recent development standards (ergonomics, modularity and data portability).
• Datasets: Zachary’s karate-club, DBLP
Benchmark
![Page 16: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/16.jpg)
16/26
Software comparison criteria
Input/output formats
Custom attribute handling
Bipartite graphs specific functions
Longitudinal analysis
Visualization
Indicators
Community detection
Benchmark
![Page 17: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/17.jpg)
17/26
Studied software• Gephi is an “interactive visualization and exploration platform”.
• GUESS is dedicated to visualization purposes, with several layouts.
• Tulip can handle over 1 million vertices and 4 millions edges. It has visualization, clustering and extension by plug-ins capabilities.
• GraphViz is mainly for graph visualization.
• UCInet is not free. It uses Pajek and Netdraw for visualization. It is specialized in statistical and matricial analysis. It calculates indicators (such as triad census, Freeman betweenness) and performs hierarchical clustering.
• Pajek is a Windows program for analysis and visualization of large networks. It is freely available, for noncommercial use.
• igraph is a free software package for creating and manipulating graphs. It also implements algorithms for some recent network analysis methods.
• NetworkX is a package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks.
• JUNG, for Java Universal Network/Graph Framework, is mainly developed for creating interactive graphs in Java GUIs, JUNG has been extended with some SNA metrics.
Benchmark
![Page 18: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/18.jpg)
18/26
Selected software
•Stand-alone software▫Pajek http://pajek.imfm.si/doku.php▫Gephi http://gephi.org/
•Libraries▫ igraph http://igraph.sourceforge.net/▫NetworkX http://networkx.lanl.gov/
Benchmark
![Page 19: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/19.jpg)
19/26
Pajek (Vladimir Batagelj and Andrej Mrvar)
• Development started in 1996• Data mining oriented• Many graph operators
available• Fast• Exports 3D visualization• Macro• Supports matrices,
adjacency lists and arcs lists oriented input files
Benchmark
![Page 20: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/20.jpg)
20/26
Gephi (Bastian M., Heymann S., Jacomy M.)
Benchmark
• Development started in 2008• Interactive GUI• Uses Java• Recent scriptability improvements• « Photoshop for graphs » with
customizable visualization• Supports the main file formats for
networks• Improvable by plugins• Community detection still
experimental
![Page 21: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/21.jpg)
21/26
NetworkX (Brandes U., Erlebach T.)
•Python•Bipartite graphs ready•Attribute-friendly•1,000,000 nodes wide
networks can be handled.•Lacks in community
detection algorithms•Relies on other software for
visualization
Benchmark
>>> import networkx as nx>>> G=nx.Graph() >>> G.add_node("spam") >>> G.add_edge(1,2) >>> print(G.nodes()) [1, 2, 'spam'] >>> print(G.edges()) [(1, 2)]>>> G.degree(1) 1
![Page 22: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/22.jpg)
22/26
Igraph (Csárdi G., Nepusz T.)
•For R (a statistical environment) and Python. The low level routines are written in C.
•GUI available for R.•Community detection
ready.•Not custom attributes-
friendly
Benchmark
> g <- graph.ring(10)> degree(g) [1] 2 2 2 2 2 2 2 2 2 2> g2 <- erdos.renyi.game(1000, 10/1000)> degree.distribution(g2) [1] 0.000 0.000 0.002 0.009 0.020 0.039 0.064 0.107 0.111 0.115 0.118…[21] 0.003 0.001
![Page 23: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/23.jpg)
23/26
Benchmark
How to choose the right tool?Pajek Gephi NetworkX igraph
Input/output + ++ + +
Attribute handling + + ++ - -
Bipartite graphs + - + +
Temporality + + + -
Visualization ++ ++ - ++
Indicators + + ++ ++
Clustering + - - - - ++++ Mature functionality - - Not available or weak
![Page 24: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/24.jpg)
24/26
Feature comparison
Benchmark
Input / output
Visualization
IndicatorsBipartite
Clustering
Temporality
Attribute handling
igraph
Pajek
NetworkX
Gephi
![Page 25: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/25.jpg)
25/26
ContextExpected functionalities of network analysis softwareBenchmark
Conclusion
![Page 26: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/26.jpg)
26/26
Conclusion
•Many domains, many approaches, many software (sociology, computer science, mathematics and physics).
•Functionalities to develop in the future (e.g. for decision support):▫Temporality awareness▫Links and nodes attributes analysis▫Hierarchical graphs
Conclusion
![Page 27: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/27.jpg)
27/26
Thank you for your attention.Any questions ?
![Page 28: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/28.jpg)
28/26
Bibliography• Gartner http://www.gartner.com/it/page.jsp?id=1293114• Gartner Hype Cycle for Social Software, 2008• Fortunato, S. (2009). Community detection in graphs. Physics Reports, 103.
Retrieved from http://arxiv.org/abs/0906.0612.Pons, P., & Latapy, M. (2005). Computing communities in large networks using random walks. Computer and Information Sciences-ISCIS 2005. Retrieved from http://www.springerlink.com/index/P312811313637372.pdf.
• Newman, M., & Girvan, M. (2004). Finding and evaluating community structure in networks. Physical review E. Retrieved from http://link.aps.org/doi/10.1103/PhysRevE.69.026113.
• Kamada, T., & Kawai, S. (1989). An algorithm for drawing general undirected graphs. Information processing letters, 31(12), 7--15. Retrieved from http://linkinghub.elsevier.com/retrieve/pii/0020019089901026.
![Page 29: A comparative study of social network analysis tools](https://reader036.vdocument.in/reader036/viewer/2022070304/54c643724a7959ba0b8b464f/html5/thumbnails/29.jpg)
29/26
Bibliography (2)• Brin, S., & Page, L. (1998). The anatomy of a large-scale
hypertextual Web search engine* 1. Computer networks and ISDN systems. Retrieved from http://linkinghub.elsevier.com/retrieve/pii/S016975529800110X.
• Fruchterman, T. M., & Reingold, E. M. (1991). Graph Drawing by Force-directed Placement. Huisman, M., & Van Duijn, M. (2003). Software for social network analysis. In Models and methods in social network analysis (p. 270–316).
• Freeman, L. (1979). Centrality in Social Networks Conceptual Clarification. Social Networks.