cse416a analysis of network data semester summary › ~m.neumann › fl2019 › cse416 › slides...

Post on 27-Jun-2020

0 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

CSE416A ANALYSIS OF NETWORK DATA

Fall 2019Marion Neumann

SEMESTER SUMMARY

Contents in these slides may be subject to copyright. Some materials are adopted from: http://www.cs.cornell.edu/home/kleinber/networks-book, http://web.stanford.edu/class/cs224w/, http://www.mmds.org.

REASONING ABOUT NETWORKS

• What do we study in networks? • Structure and evolution:• What is the structure of a network? • Why and how did it become to have such structure?

• Processes and dynamics: • networks provide “skeleton” for spreading of information,

behavior, diseases, …

2

REASONING ABOUT NETWORKS

• How do we reason about/understand networks? • Empirical: Study network data to find organizational

principles• Mathematical models: Study probabilistic models

and graph theory to derive properties theoretically• Algorithms: Methods for analyzing graphs to

compute patterns, similarities, and interesting hidden features

3

REASONING ABOUT NETWORKS• Empirical à organizational principles• Mathematical models à theoretical properties• Algorithms à patterns, similarities, features

4

Mathematics à prob/statsà graph theory à linear algebra

Field of Application

à social/political science

à biologyà intelligenceà ...

Computer Science

à algorithms & data structures

à data scienceà big data

Structure and Evolution

Processes and Dynamics

WHAT WE LEARNED 1. Communities in Networks2. Betweenness-based Clustering

• Girvan-Newman3. Modularity

• maximization• modularity matrix

4. Spectral clustering• Graph Laplacian

5. Overlapping Communities• Clique Percolation Method• Finding Cliques

6. Node Similarity• structural & regular equivalence• Random Walks• SimRank

7. Node Classification• Label Propagation

5

1. Random Graph Model• degree distribution• clustering coefficient• avg. path length• giant component• evolution & phase transition• problems

2. Small World Model• avg. path length• clustering coefficient• problems

3. Scale-free Networks• power-law distribution• exponent estimation

4. Preferential Attachment Model• rich-get-richer phenomenon• power-law distribution• clustering coefficient• average path length

1. Spreading Processes• Probabilistic Models of Spread• Epidemics

2. Cascading Behavior• Independent Cascade Model• Exposure Curves• Viral Marketing

3. Graph Classification

Part II

Part III

Part IV

Networked Data & Graphs… Part I

HOW IT ALL FITS TOGETHERMeasures & Properties Models Algorithms

diameter & local structure, small-world

ER & small world models shortest-path algorithms

degree, scale-free, hubs preferential attachment, power-law distribution

centrality measures, power iteration

strong and weak ties,communities

Girvan-Newman, modularity,spectral, CPM

node properties & similarity random walks SimRank, label propagation

contagion & spread epidemicsindependent cascade model

simulations & statistical tests

6

NETWORK MODELS: ER & SM

• Properties• small-world à small diameter• local structure à high clustering coefficient

• Models• Random Graph Model• Small-World Model

à phase transition

• Digression: configuration model

7

NETWORK MODELS: PREFERENTIAL ATTACHMENT

• Properties• degree distribution• rich-get-richer/hubs• scale-free

• Model• Preferential attachment

à model network evolution

8

COMMUNITY DETECTION

• Properties• local structure • homophily

• Algorithms• Girvan-Newman• Modularity Maximization• Spectral Clustering• Clique Percolation Method

9

NODE SIMILARITY & CLASSIFICATION

• Properties• homophily/auto-correlation• node properties• missing information

• Model• Random walk (with restart)

• Algorithms• SimRank• Label Propagation

10

NETWORK DYNAMICS• Properties• contagion/spread• exposure/adoption

• Measures• epidemic threshold• exposure curves

• Models• SIS, SIR• Independence Cascade Model

11

β

δ

WHAT CAN WE DO WITH COMPLEX NETWORK ANALYSIS?

Complex Network Analysis: Use empirical measures, comparisons to network models, and network algorithms to reasonabout network properties and underlying phenomena.

12

map of superpowers

SUMMARY

• You have learned a lot!• answered insightful questions• derived many interesting results • implemented a number of algorithms• practiced many real-world workflows

Thank You for the Hard Work!!!

Please, fill-in the course evaluation.

13

Looking for a good

read for the break?

top related