navigation in small worlds social networks: models and applications seminar toronto, fall 2007...

Navigation in small Navigation in small worldsworlds

Social Networks: Models and Applications

Seminar

Toronto, Fall 2007

(based on a presentation by Stratis Ioannidis)

2

the small-world phenomenonthe small-world phenomenon

“most people are linked by short chains of acquaintances”

3

Milgram’s experiment (1960s)Milgram’s experiment (1960s)

► people in Omaha, Nebraska, were each given a letter addressed to a target person in Boston, Massachusetts, along with demographic information (name, address, profession) on this person.

► they were asked to send the letter to the target person, by forwarding it to other people that they knew on a first-name basis, instructing them to do the same.

► median number of hops to get the letter to the target: 6-> six degrees of separation

4

significance of small-world phenomenonsignificance of small-world phenomenon

► qualitatively similar results by subsequent experiments on e.g. [Dodds et al. ‘03]

► small-world phenomenon also appears in other networks: powergrid actor collaboration graph WWW neural network of C. elegans semantic networks of languages food webs …

5

modeling the small-world phenomenonmodeling the small-world phenomenon

► small-world network model:1. short paths between almost all pairs of nodes2. small node degree (on average)3. locally clustered:

a node’s neighbors are likely to be neighbors of each other

► a graph selected at random from all n-node graphs where each node has degree =3, has diameter O(logn), whp

► but, does not satisfy clustering requirement

6

[Watts-Strogats ‘98]’s model[Watts-Strogats ‘98]’s model

► d-dimensional lattice of nd nodes [here, d=2]► for each node u:

local edges to nodes v, s.t. dist ρ(u,v) ≤ p [p=2] long-range directed edges to q random nodes selected

independently & uniformly over all nodes [q=3]► expected diameter O(logn)

7

[Kleinberg’00]: a new perspective [Kleinberg’00]: a new perspective on Milgram’s experimenton Milgram’s experiment

► “short paths not only exist, but can be found by individuals using only local information !”

► proposed a simple extension to Watts-Strogats’ model► used that to demonstrated that:

ability to route efficiently with local information ≠ network diameter

this ability is affected by the correlation between local structure and long-range connections; efficient routing is possible only when this correlation is near a critical threshold; as we move away from this threshold routing deteriorates rapidly.

8

Kleinberg’s (grid-based) modelKleinberg’s (grid-based) model

extends model of [Watts-Strogats ‘98]:► d-dimensional lattice of nd nodes► for each node u:

local edges to nodes v, s.t. dist ρ(u,v) ≤ p long-range directed edges to q random nodes selected

independently & uniformly over all nodes

s.t. Pr(u->v) ~ ρ(u,v)-a

► a: concentration of long-range neighbors around u a: small connections close to uniformly random

-> a = 0 [Watts-Strogats ‘98] model a: large strong preference for close connections

-> a = ∞ long-range neighbors = local neighbors

9

decentralized algorithmsdecentralized algorithms

decentralized algorithm for transmitting messages:► at each step the holder u of the message passes it to one of

its neighbors (local or long-range)► u knows only

the underlying grid structure the location of the target on the lattice the location and the long-range neighbors of all nodes that have

touched the message so far

delivery time T:► expected number of steps to forward a message from a

random source to a random target

10

Kleinberg’s resultsKleinberg’s results

when d = 21. for 0 ≤ a < 2, any decentralized algorithm has T = Ω(n(2-a)/3)2. for a = 2, there is a decentralized algorithm s.t. T = O(log2n)3. for a > 2, any decentralized algorithm has T = Ω(n(a-2)/(a-1))

can be extended for d ≠ 2, with 0 ≤ a < d, a = d, and a > d, respectively

the upper bound when a = 2 is achieved by greedy algorithm:► a node forwards a message for t to its neighbor v such that

ρ(v,t) is minimum

corresponding diameter results [Martel-Nguyen ’04+’05]:

1. for a ≤ d, Θ(logn)2. for d < a < 2d, Polylog(n)3. for a = 2d, ??4. for a > 2d, Poly(n)

11

outline of proof of the upper boundoutline of proof of the upper bound

► a = 2, p = q = 1► in each step, the dist. from current node u to target t is halved

with prob. ~1/logn [Ω(1/logn)]► so, the expected number of steps until from u we reach a

node u’ such that ρ(u’,t) ≤ ρ(u,t)/2 is at most ~logn[O(logn)]

► the target is reached after at most logn+1 halvings, so, in ~log2n expected steps [O(log2n)]

► crucial property of a = 2:it produces long-range neighbors approx. uniformly distributed over “distance scales”: for u’s long-range neighbor v, the probability that 2j≤ρ(u,v) ≤2j+1, is the same for all j

13

outline of proof of lower boundsoutline of proof of lower bounds

► a = 0, p = q = 1► Let U: set of nodes w s.t. ρ(w,t) < n2/3

► |U|~ n4/3

► Prob(s U) ~ |U|/n2 ~ 1/n2/3 -> almost certainly s U

► if s U and no node u in the path to t has a long-range neighbor in U, then the number of steps to t are ≥ n2/3

► for any u, Prob(u->U) = |U|/n2 = 1/n2/3

-> starting from s U, the expected number of steps to reach a node with a long-range neighbor in U is ~ 1/ Prob(u->U) = n2/3

► expected number of steps to t is ≥ n2/3

► a = ∞, p = q = 1► the random graph is the grid; expected number of steps ~n

14

hierarchical model [Kleinberg 01]hierarchical model [Kleinberg 01]

ρ(u,v) = 2

vu

► natural model for categorizing occupation, web pages,…► ρ(u,v): height of lowest common ancestor of u,v► polylog(n) long-range neighbors from distr ~ b-aρ(u,v);

efficient routing only for a = 1► [Watts et al. 02]: many indep. trees; ρ the min of dist. in any tree

b = 3

15

group-based model [Kleinberg 01]group-based model [Kleinberg 01]

► set of groups {S1,S2,…}

“bounded growth”: if Sj,Sk,… have sizes < g and all contain u, their union’s size is O(g)

► ρ(u,v): size of smallest group containing u,v► polylog(n) long-range neighbors from distr ~ ρ(u,v)-a;

efficient routing only for a = 1 (and a > 1 in some cases)

vuρ(u,v) = 6

16

rank-based model [Liben-Nowel 05]rank-based model [Liben-Nowel 05]

► based on data from LiveJournal► variation of grid-based model to handle non-uniformity► each lattice point has ≥1 people associated with it► local edges to one of people in each neighboring lattice point► long-range edges to random nodes selected from distr.

~1/ranku(v)

ranku(v): rank of v when nodes sorted in increasing dist. from u

► delivery time (to lattice point) O(log3n)

17

related resultsrelated results

► decentralized search with additional information:a node may “consult” a small number of nearby nodes for free [Lebhar-Schabanel ‘04]: paths of O(logn(loglogn)2) steps with

O(log2n) nodes consulted [Fraigniaud et al.’05], [Martel-Nguyen ’04]: paths of O(log3/2n) steps

by consulting neighborhood of size log(n) of current node [Manku et al.’04]: neighbor of neighbor approach: optimal for some

settings

► alternative distributions for choosing long-range neighbors:can we improve routing by choosing long-range neighbors from a distribution other than ~1/ρa

allowing variation in node degrees allowing dependence between long-range neighbors of same node?

- (almost certainly) no: [Aspnes et al. ‘02], [Flamini et al.’05], [Giakkoupis-Hadzilacos ‘07], [Woelfel ’08?]

- what if we make edges to long-range neighbors bidirectional?

18

related resultsrelated results

► small-world networks on arbitrary underlying graphs: is it possible to augment any graph such that greedy routing is efficient ? (greedy: wrt initial graph) [Fraigniaud ‘05]: yes, for graphs of bounded tree-width [Duchon et al.’06]: yes, for bounded growth rate [Slivkin ‘05]: yes, for doubling dimension O(loglonn) [Fraigniaud ‘05]: no, for doubling dimension >> loglonn

19

applicationsapplications

► peer-to-peer networks file sharing

► searching the web focused web crawling

► sensor networks► on-line communities

20

Thank you!

navigation in small worlds social networks: models and applications seminar toronto, fall 2007...

Documents