models and algorithms for complex networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfmilena...

39
1 Milena Mihail Georgia Tech. “with network elements maintaining characteristic profilesModels and Algorithms for Complex Networks “with categorical attributes” [C. Faloutsos MMDS08] “with parametrization, typically localwith Stephen Young, Gagan Goel, Giorgos Amanatidis, Bradley Green, Christos Gkantsidis, Amin Saberi, D. Bader, T. Feder, C. Papadimitriou, P. Tetali, E. Zegura

Upload: others

Post on 23-Feb-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

1

Milena Mihail

Georgia Tech.

“with network elements maintaining characteristic profiles”

Models and Algorithms for Complex Networks

“with categorical attributes”[C. Faloutsos MMDS08]

“with parametrization, typically local”

with

Stephen Young, Gagan Goel, Giorgos Amanatidis, Bradley Green,

Christos Gkantsidis, Amin Saberi,

D. Bader, T. Feder, C. Papadimitriou, P. Tetali, E. Zegura

Page 2: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Talk Outline

1. Structural/Syntactic Flexible Models

2. Semantic Flexible Models

2

Flexible (further parametrized) Models

Models & Algorithms Connection : Kleinberg‟s Model(s) for Navigation

Distributed Searching Algorithms with Additional Local Info/Dynamics

Complex Networks

in Web N.0

1. On the Power of Local Replication

2. On the Power of Topology Awareness via Link Criticality

Conclusion : Web N.0 Model & Algorithm characteristics:

further parametrization, typically local,

locality of info in algorithms & dynamics.

Dynamics become especially important.

Page 3: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

The Internet is constantly growing and evolving giving

rise to new models and algorithmic questions. 3

degree

4 102 100

freq

uen

cy

, but

no sharp concentration:

Erdos-Renyi

Sparse graphs

with large degree-variance.

“Power-law” degree distributions.

Small-world, i.e. small diameter,

high clustering coefficients.

scaling Web N.0

Page 4: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Global Routing Network

(Autonomous Systems)

A rich theory of power-law random graphs

has been developed [ Evolutionary, Configurational Models, &

e.g. see Rick Durrett‟s ‟07 book ].

degree

4 102 100

freq

uen

cy

, but

no sharp concentration:

Erdos-Renyi

Sparse graphs

with large degree-variance.

“Power-law” degree distributions.

Small-world, i.e. small diameter,

high clustering coefficients.

However, in practice, there are discrepancies …

Flickr

Network

Friendship

Network Patent Collaboration

Network (in Boston)

Random Graph

with same degrees

Local Routing Network

with same Degrees

4

Page 5: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

5

“Flexible” models for complex networks:

exhibit a “large” increase in the

properties of generated graphs

by introducing a “small” extension in the

parameters of the generating model.

Page 6: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Case 1: Structural/Syntactic Flexible Model

The networking community proposed that [Sigcomm 04, CCR 06 and Sigcomm 06],

beyond the degree sequence ,

models for networks of routers should capture

how many nodes of degree are connected to nodes of degree .

Assortativity:

small large

6

Modifications and Generalizations

of Erdos-Gallai / Havel-Hakimi

Models with power law and arbitrary degree sequences

with additional constraints,

such as specified joint degree distributions

(from random graphs, to graphs with very low entropy).

Page 7: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Networking Proposition [CCR 06, Sigcomm 06]:

A real highly optimized network G.A random graph with same

average degree as G.

A random graph with same

degree sequence as G.A graph with same number of links

between nodes of degree and as G. 7

Page 8: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

connected,

mincost,

random

The (well studied) Degree Sequence Realization Problem is:

Definitions

The Joint-Degree Matrix Realization Problem is:connected,

mincost,

random

8

Page 9: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Theorem [Amanatidis, Green, M „08]: The natural necessary conditions for an instance

to have a realization are also sufficient (and have a short description).

The natural necessary conditions for an instance

to have a connected realization are also sufficient (no known short

description).

There are polynomial time algorithms to construct

a realization and a connected realization of ,

or produce a certificate that such a realization does not exist.

The Joint-Degree Matrix Realization Problem is:

connected,

mincost,

random

9

Page 10: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

10

Reduction to

perfect matching:

2

Given arbitrary

Is this degree sequence realizable ?

If so, construct a realization.

Degree Sequence Realization Problem:Advantages: Flexibility in:

enforcing or precluding certain edges,

adding costs on edges and finding mincost realizations,

close to matching close to sampling/random generation.

connected,

mincost,

random

Page 11: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

4

3

2

2

1

11

[ Havel-Hakimi ] Construction:

Greedy: any unsatisfied vertex

is connected with the vertices of

highest remaining degree requirements.

032

2

0

0 0

1

0

1

0

0

Connectivity, if possible,

attained with 2-switches .

Theorem [Erdos-Gallai]:

A degree sequence is realizable

iff the natural necessary condition holds:

moreover, there is a connected realization

iff the natural necessary condition holds:

Note :all 2-switches are legal . 11

Page 12: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Theorem, Joint Degree Matrix Realization [Amanatidis, Green, M „08]:

Proof [sketch]:

12

Page 13: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Balanced Degree Invariant:

Example Case Maintaining

Balanced Degree Invariant:

deletedelete

add

add add

Note: This may NOT be a

simple “augmenting” path.

Page 14: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Theorem, Joint Degree Matrix Connected Realization

[Amanatidis, Green, M „08]:

Proof [remarks]: Main Difficulty: Two connected components

are amenable to rewiring by 2-switches,

only using two vertices of the same degree.

connected

component

connected

component

The algorithm explores

vertices of the same degree

in different components,

in a recursive manner.14

Page 15: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Open Problems for Joint Degree Matrix Realization

Construct mincost and/or random realization,

or connected realization.

Satisfy constraints between arbitrary subsets of vertices.

Is there a reduction to matchings or flow or

some other well understood combinatorial problem?

Is there evidence of hardness ?

Is there a simple generative model for graphs

with low assortativity ? ( explanatory or other …)

15

Page 16: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Talk Outline

1. Structural/Syntactic Flexible Models

2. Semantic Flexible Models

ODD

Flexible (further parametrized) Models

Models & Algorithms Connection : Kleinberg‟s Model(s) for Navigation

Distributed Searching Algorithms with Additional Local Info/Dynamics

Complex Networks

in Web N.0

1. On the Power of Local Replication

2. On the Power of Topology Awareness via Link Criticality

Conclusion : Web N.0 Model & Algorithm characteristics:

further parametrization, typically local,

locality of info in algorithms & dynamics.

Dynamics become especially important.

Page 17: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Case 2: Semantic Flexible Model(s)

Flickr

Network

Friendship

Network Patent Collaboration

Network (in Boston)

Also densification, shrinking diameter, … C. Faloutsos, MMDS08

Varying structural characteristics

Models with semantics on nodes,

and links among nodes with semantic proximity

generated by very general probability distributions.

RANDOM DOT PRODUCT GRAPHS

KRONECKER GRAPHS

Generalizations of

Erdos-Renyi random graphs

16

Page 18: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

RANDOM DOT PRODUCT GRAPHSKratzl,Nickel,Scheinerman 05

Young,Scheinerman 07

Young,M 08

17

Page 19: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

very

likely

somewhat

likely

unlikely

never

18

Page 20: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

SUMMARY OF RESULTS

A semi-closed formula for degree distribution

Model can generate graphs with a wide variety of densities

average degrees up to .

and wide varieties of degree distributions, including power-laws.

Diameter characterization :

Determined by Erdos-Renyi for similar average density,

if all coordinates of X are in (will say more about this).

Positive clustering coefficient,

depending on the “distance” of the generating distribution

from the uniform distribution.

Remark: Power-laws and the small world phenomenon

are the hallmark of complex networks.19

Page 21: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Theorem [Young, M ‟08]

A Semi-closed Formula for Degree Distribution

Theorem ( removing error terms) [Young, M ‟08]

20

Page 22: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Example:

(a wide range of degrees,

except for very large degrees)

This is in agreement with real data.

21

Page 23: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Theorem [Young, M ‟08]

Diameter Characterization

Remark: If the graph can become disconnected.

It is important to obtain characterizations of connectivity as

approaches .This would enhance model flexibility22

Page 24: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Clustering Characterization

Theorem [Young, M ‟08]

Remarks on the proof

23

Page 25: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

24

Open Problems for Random Dot Product Graphs

Fit real data, and isolate “benchmark” distributions .

Characterize connectivity (diameter and conductance)

as approaches .

Do/which further properties of X

characterize further properties of ?

Evolution: X as a function of n ?

(including: two connected vertices with small similarity,

either disconnect, or increase their similarity).

Should/can ?

Similarity functions beyond inner product (e.g. Kernel functions).

Algorithms: navigability, information/virus propagation, etc.

Page 26: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

KRONECKER GRAPHS [Faloutsos, Kleinberg,Leskovec 06]

1

0

1

1

0

0

0

0

1

0

1

1

1

0

1

1

1

0

1

1

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

0

1

0

1

1

1

0

1

1

1

0

1

1

0

0

0

0

1

0

1

1

1

0

1

1

1

0

1

1

0

0

0

0

1

0

1

1

1

0

1

1

1

0

1

1

Another “semantic” “ flexible” model, introducing parametrization.

1-bit

vertex

character-

ization

2-bit

vertex

character-

ization

3-bit

vertex

character-

ization

25

Page 27: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

STOCHASTIC KRONECKER GRAPHS

c

a

b

b

ac

aa

ab

ab

bc

ba

bb

bb

cc

ca

cd

cb

bc

ba

bb

bb

aac

aaa

aab

aab

abc

aba

abb

abb

accacd

acb

abc

aba

abb

abb

bac

baa

bab

bab

bbc

bba

bbb

bbb

bcc

bca

bcd

bcb

bbc

bba

bbb

bbb

bac

baa

bab

bab

bbc

bba

bbb

bbb

bcc

bca

bcd

bcb

bbc

bba

bbb

bbb

cac

caa

cab

cab

cbc

cba

cbb

cbb

ccc

cca

ccd

ccb

cbc

cba

cbb

cbb

aca

Several properties characterized (e.g. multinomial degree distributions,

densification, shrinking diameter, self-similarity).

Large scale data set have been fit efficiently !

[ Faloutsos, Kleinberg, Leskovec 06]

26

Page 28: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Talk Outline

1. Structural/Syntactic Flexible Models

2. Semantic Flexible Models

27

Flexible (further parametrized) Models

Models & Algorithms Connection : Kleinberg‟s Model(s) for Navigation

Distributed Searching Algorithms with Additional Local Info/Dynamics

Complex Networks

in Web N.0

1. On the Power of Local Replication

2. On the Power of Topology Awareness via Link Criticality

Conclusion : Web N.0 Model & Algorithm characteristics:

Further Parametrization, Locality of Info & Dynamics.

Page 29: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

29

Where it all started: Kleinberg’s navigability model

Theorem [Kleinberg]:

The only value for which

the network is navigable

is r =2.

Parametrization is essential

in the study of complex networks

Moral: ?

Page 30: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

30

Strategic Network Formation Process [Sandberg 05]:

Experimentally, the resulting network has structure and

navigability similar to Kleinberg’s small world network.

Page 31: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Strategic Network Formation Process [ Green & M ‟08 ]

simplification of [Clauset & Moore 03]:

Experimentally,

the resulting network

becomes navigable

after steps

but does not have

structure similar

to Kleinberg’s

small world network.

30

Page 32: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Talk Outline

1. Structural/Syntactic Flexible Models

2. Semantic Flexible Models

31

Flexible (further parametrized) Models

Models & Algorithms Connection : Kleinberg‟s Model(s) for Navigation

Distributed Searching Algorithms with Additional Local Info/Dynamics

Complex Networks

in Web N.0

1. On the Power of Local Replication

2. On the Power of Topology Awareness via Link Criticality

Conclusion : Web N.0 Model & Algorithm characteristics:

further parametrization, typically local,

locality of info in algorithms & dynamics.

Dynamics become especially important.

Page 33: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

1. On the Power of Local Replication

How do networks search (propagate information) :

Cost = queried nodes / found information

[ Gkantidis, M, Saberi , „04 „05 ]

32

Page 34: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

1. On the Power of Local Replication

Equivalent to one-step local

replication of information.

[ Gkantidis, M, Saberi , „04 „05 ]

[M, Saberi , Tetali „05 ]

Theorem (extends to power-law

random graphs):

Proof :

33

Page 35: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

2. On the Power of Topology Awareness via Link Criticality

34

Page 36: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

2. On the Power of Topology Awareness via Link Criticality

Via Semidefinite Programming

“Fastest Mixing Markov Chain”Boyd,Diaconis,Xiao 04

Via Computation

of Principal Eigenvector(s),

Distributed, AsynchronousGkantsidis, Goel, Mihail, Saberi 07

How do social networks

compute link criticality ?

35

Page 37: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Link Criticality via Distributed Asynchronous

Computation of Principal Eigenvector(s)

+

-++

+

+

-

-

-

-+1+1

-1-1

Start with

Step: For all nodesStep: For links, asynchronously

[Gkantsidis,Goel,M,Saberi 07]

Hardest part: Numerical Stability.36

Page 38: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

Talk Outline

1. Structural/Syntactic Flexible Models

2. Semantic Flexible Models

37

Flexible (further parametrized) Models

Models & Algorithms Connection : Kleinberg‟s Model(s) for Navigation

Distributed Searching Algorithms with Additional Local Info/Dynamics

Complex Networks

in Web N.0

1. On the Power of Local Replication

2. On the Power of Topology Awareness via Link Criticality

Conclusion : Web N.0 Model & Algorithm characteristics:

further parametrization, typically local,

locality of info in algorithms & dynamics.

Dynamics become especially important.

Page 39: Models and Algorithms for Complex Networksweb.stanford.edu/group/mmds/slides2008/mihail.pdfMilena Mihail Georgia Tech. “with network elements maintaining characteristic profiles”

39

Theorem [Feder,Guetz,M,Saberi 06]:The Markov chain

corresponding to a local 2-link switch is rapidly mixing

if the degree sequence enforces diameter at least 3,

and for some .

Topology Maintenance = Connectivity & Good Conductance