diffusion of innovation and influence maximization · lecture outline 1 di usion of innovation 2 in...

25
Diffusion of Innovation and Influence Maximization Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Structural Analysis and Visualization of Networks Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 1 / 25

Upload: others

Post on 31-May-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Diffusion of Innovation and Influence Maximization

Leonid E. Zhukov

School of Data Analysis and Artificial IntelligenceDepartment of Computer Science

National Research University Higher School of Economics

Structural Analysis and Visualization of Networks

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 1 / 25

Page 2: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Lecture outline

1 Diffusion of innovation

2 Influence propagation modelsIndependent cascade modelLinear threshold model

3 Influence maximization problemSubmodular function optimization

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 2 / 25

Page 3: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

”Diffusion” process

Propagation process:

Viral propagation:- virus and infection- rumors, news

Decision based models:- adoption of innovation- joining politcal protest- purchase decision

Local individual decision rules will lead to very different global results.”microscopic” changes → ”macroscopic” results

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 3 / 25

Page 4: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Ryan-Gross study

Ryan-Gross study of hybrid seed corn delayed adoption (after firstexposure)

Information effect vs adopting of innnovationRyan and Gross, 1943

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 4 / 25

Page 5: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Diffusion of innovation

Everett Rogers (sociologist) , ”Diffusion of innovation” book, 1962

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 5 / 25

Page 6: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Influence response

Influence response: diminishing returns and threshold (critical mass)

P(n) = 1− (1− p)n P(b) = δ(b > b0)

Two models:

Independent Cascade Model (diminishing returns)

Linear Threshold Model (critical mass)

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 6 / 25

Page 7: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Live journal

10 mln members, joining groups

Backstrom, 2006

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 7 / 25

Page 8: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Independent cascade model

Initial set of active nodes S0

Discrete time steps

On every step an active node v can activate connected neighbor wwith a probability pv ,w (single chance)

If v succeeds, w becomes active on the next time step

Process runs until no more activations possible

If pv ,w = p it is a particular type of SIR model, a node stays infected foronly one step

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 8 / 25

Page 9: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Independent cascade model

Cascade - sequence of changes of behavior, ”chain reaction”

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 9 / 25

Page 10: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Network coordination game

Local interaction game: Let u and v are players, and A and b are possiblestrategiesPayoffs

if u and v both adopt behavior A, each get payoff a > 0

if u and v both adopt behavior B, each get payoff b > 0

if u and v adopt opposite behavior, each get payoff 0

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 10 / 25

Page 11: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Threshold model

Network coordination game, direct-benefit effect

Node v to make decision A or B, p - portion of type A neighborsto accept A:

a · p · d > b · (1− p) · dp ≥ b/(a + b)

Threshold:

q =b

a + bAccept new behavior A when p ≥ q

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 11 / 25

Page 12: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Cascades

Cascade - sequence of changes of behavior, ”chain reaction”

Let a = 3, b = 2, threshold q = 2/(2 + 3) = 2/5Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 12 / 25

Page 13: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Cascade propagation

Let a = 3, b = 2, threshold q = 2/(2 + 3) = 2/5

Start from nodes 7,8: 1/3 < 2/5 < 1/2 < 2/3

Cascade size - number of nodes that changed the behavior

Complete cascade when every node changes the behavior

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 13 / 25

Page 14: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Cascades and clusters

Group of nodes form a cluster of density ρ if every node in the set has atleast fraction ρ of its neighbors in the set

Both clusters of density ρ = 2/3. For cascade to get into cluster q ≤ 1− ρ.

images from Easley & Kleinberg

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 14 / 25

Page 15: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Linear threshold model

Influence comes only from NN N(i) nodes, wij influence i → j

Require∑

j∈N(i) wji ≤ 1

Each node has a random acceptance threshold from θi ∈ [0, 1]

Activation: fraction of active nodes exceeds threshold∑active j∈N(i)

wji > θi

Initial set of active nodes Ao , iterative process with discrete time steps

Progressive process, only nonactive → active

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 15 / 25

Page 16: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Cascades in random networks

Global cascade (sufficiently large)Triggered by single node (or small set)Random graphs ER pkThreshold distribution φ

Cascade window: a) homogenous threshold b) normal thresholddistributionD. Watts, 2002

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 16 / 25

Page 17: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Cascades in random networks

multiple seed nodes

(a) Empirical network; (b), (c) - randomized networkP. Singh, 2013

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 17 / 25

Page 18: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Influence maximization problem

Initial set of active nodes Ao

Cascade size σ(Ao) - expected number of active nodes whenpropagation stops

Find k-set of nodes Ao that produces maximal cascade σ(Ao)

k-set of ”maximum influence” nodes

NP-hard

D. Kempe, J. Kleinberg, E. Tardos, 2003, 2005

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 18 / 25

Page 19: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Submodular functions

Set function f is submodular, if for sets S , T and S ⊆ T , ∀v /∈ T

f (S ∪ {v})− f (S) ≥ f (T ∪ {v})− f (T )

Function of diminishing returns (”concave property”)

Function f is monotone, f (S ∪ {v}) ≥ f (S)

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 19 / 25

Page 20: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Submodular functions

Theorem

Let F be a monotone submodular function andlet S∗ be the k-element set achieving maximal f .Let S be a k-element set obtained by repeatedly, for k-iterations, includingan element producing the largest marginal increase in f .

f (S) ≥ (1− 1

e)f (S∗)

Nemhauser, Wolsey, and Fisher, 1978

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 20 / 25

Page 21: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Influence maximization

Cascade size σ(S) is submodular function (D. Kempe, J. Kleinberg,E. Tardos, 1993)

σ(S) ≥ (1− 1

e)σ(S∗)

Greedy algorithm for maximum influence set finds a set S such thatits influence set σ(S) is within 1/e = 0.367 from the optimal(maximal) set σ(S∗), σ(S) ≥ 0.629σ(S∗)

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 21 / 25

Page 22: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Influence maximization

Approximation algorithm

Algorithm: Greedy optimization

Input: Graph G (V ,E ), k

Output: Maxumum influence set S

Set S ← 0for i = 1 : k do

select v = arg maxu∈V \S(σ(S ∪ {u})− σ(S))S ← S ∪ {v}

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 22 / 25

Page 23: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Experimental results

Independent cascade model Linear threshold model

network: collaboration graph 10,000 nodes, 53,000 edges

D. Kempe, J. Kleinberg, E. Tardos, 2003

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 23 / 25

Page 24: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

Computational considerations

Independent cascade model: influence spread and running time

W. Chen et.al, 2009

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 24 / 25

Page 25: Diffusion of Innovation and Influence Maximization · Lecture outline 1 Di usion of innovation 2 In uence propagation models Independent cascade model Linear threshold model 3 In

References

Contagion, S. Morris, Review of Economic Studies, 67, p 57-78, 2000

Maximizing the Spread of Influence through a Social Network, D.Kempe, J. Kleinberg, E. Tardos, 2003

Influential Nodes in a Diffusion Model for Social Networks, D.Kempe, J. Kleinberg, E. Tardos, 2005

Efficient Influence Maximization in Social Networks, W. Chen, Y.Wang, S, Yang, KDD 2009.

A Simple Model of Global Cascades on Random Networks. D. Watts,2002.

Leonid E. Zhukov (HSE) Lecture 15 28.04.2015 25 / 25