konect – the koblenz network collection
DESCRIPTION
We present the Koblenz Network Collection (KONECT), a project to collect network datasets in the areas of web science, network science and related areas, as well as provide tools for their analysis. In the cited areas, a surprisingly large number of very heterogeneous data can be modeled as networks and consequently, a unified representation of networks can be used to gain insight into many kinds of problems. Due to the emergence of the World Wide Web in the last decades many such datasets are now openly available. The KONECT project thus has the goal of collecting many diverse network datasets from the Web, and providing a way for their systematic study. The main parts of KONECT are (1)~a collection of over 160 network datasets, consisting of directed, undirected, unipartite, bipartite, weighted, unweighted, signed and temporal networks collected from the Web, (2)~a Matlab toolbox for network analysis and (3)~a website giving a compact overview the various computed statistics and plots. In this paper, we describe KONECT's taxonomy of networks datasets, give an overview of the datasets included, review the supported statistics and plots, and briefly discuss KONECT's role in the area of web science and network science.TRANSCRIPT
KONECTThe Koblenz Network Collection
Jérôme KunegisInstitute for Web Science and Technologies (WeST), University of Koblenz–Landau
With acknowledgments to everyone who has made network datasets available to the public
Jérôme Kunegis KONECT – The Koblenz Network Collection 2
What Is Koblenz?
Jérôme Kunegis KONECT – The Koblenz Network Collection 3
My PhD Thesis
Jérôme Kunegis KONECT – The Koblenz Network Collection 4
The Trick Is…
Jérôme Kunegis KONECT – The Koblenz Network Collection 5
Everything Is a NETWORK !
Jérôme Kunegis KONECT – The Koblenz Network Collection 6
Well, Only Almost Everything Is a Network
Communication
Authorship
Friendship
Interaction
Trust
Co-occurrence
Jérôme Kunegis KONECT – The Koblenz Network Collection 7
A Network Dataset Is Like a Gummi Bear
Jérôme Kunegis KONECT – The Koblenz Network Collection 8
A Network Dataset Is Like a Gummi Bear
Lots of contentto analyse
Evaluate prediction algorithms
Test network models
Test search and recommender
systems
Jérôme Kunegis KONECT – The Koblenz Network Collection 9
When You Have Tested One, You Have Tested All ?!
Jérôme Kunegis KONECT – The Koblenz Network Collection 10
Or Do You?
Jérôme Kunegis KONECT – The Koblenz Network Collection 11
Diversity of Network Datasets
Jérôme Kunegis KONECT – The Koblenz Network Collection 12
Overview
Total: 168 datasets
Jérôme Kunegis KONECT – The Koblenz Network Collection 13
Network Formats
U • Undirected D • Directed B • Bipartite
Jérôme Kunegis KONECT – The Koblenz Network Collection 14
Edge Weight and Multiplicity Types
= • Multiple + • Positive
w > 0
± • Signed
– • Unweighted
w ± 1
* • Rating
★★★★★
** • Multiple Ratings
★★★★★
Jérôme Kunegis KONECT – The Koblenz Network Collection 15
Timestamps
t = 1 t = 2 t = 3
Jérôme Kunegis KONECT – The Koblenz Network Collection 16
Example Dataset
Jérôme Kunegis KONECT – The Koblenz Network Collection 17
Network Comparison
Jérôme Kunegis KONECT – The Koblenz Network Collection 18
Network Comparison: Plots
Jérôme Kunegis KONECT – The Koblenz Network Collection 19
More Plots
Jérôme Kunegis KONECT – The Koblenz Network Collection 20
Download
Jérôme Kunegis KONECT – The Koblenz Network Collection 21
RDF
Jérôme Kunegis KONECT – The Koblenz Network Collection 22
Matlab Toolbox
Jérôme Kunegis KONECT – The Koblenz Network Collection 23
Handbook of Network Analysis
http://konect.uni-koblenz.de/publications