portfolio optimization for influence spread (www'17)

21
Portfolio Optimization for Influence Spread Naoto Ohsaka (UTokyo) Yuichi Yoshida (NII & PFI) 2017/4/6 WWW'17@Perth, Australia ๏ผ 0.2 + 0.3 + 0.5 a b a c d e Kawarabayashi Large Graph Project

Upload: naoto-ohsaka

Post on 21-Jan-2018

101 views

Category:

Science


2 download

TRANSCRIPT

Page 1: Portfolio Optimization for Influence Spread (WWW'17)

Portfolio Optimization

for Influence SpreadNaoto Ohsaka (UTokyo)

Yuichi Yoshida (NII & PFI)

2017/4/6 WWW'17@Perth, Australia

๐›‘๏ผ 0.2 + 0.3 + 0.5a

b

a

c

d

e

Kawarabayashi Large Graph Project

Page 2: Portfolio Optimization for Influence Spread (WWW'17)

Influence maximization

2

max๐ด: ๐ด =๐‘˜

๐„[cascade size trigger by ๐ด]

0.8

a

b

d

fe

c

0.6 0.1

0.30.4

0.2 0.5

Is maximizing expectation enough?

Discrete optimization problem under

stochastic models

Find the most influential group from a social networkMotivated by viral marketing [Domingos-Richardson. KDD'01]

Greedy strategy is 1 โˆ’ eโˆ’1 โ‰ˆ 63%-approx.[Nemhauser-Wolsey-Fisher. Math. Program.'78]

Due to monotonicity & submodularity[Kempe-Kleinberg-Tardos. KDD'03]

# vertices influenced

[Kempe-Kleinberg-Tardos. KDD'03]

Page 3: Portfolio Optimization for Influence Spread (WWW'17)

Risk of having small cascades

3

๐„ = ๐„

High risk

Cascade size

Pro

bab

ilit

y

Low risk

Need a good risk measure for

finding a low-risk strategy

Q. Which is better, or ?

Page 4: Portfolio Optimization for Influence Spread (WWW'17)

Popular downside risk measuresin finance economics & actuarial science

Value at Risk at ฮฑ (VaRฮฑ) = ฮฑ-percentile

Conditional Value at Risk at ฮฑ (CVaRฮฑ)

โ‰’ expectation in the worst ฮฑ-fraction of cases

ฮฑ : significance level (typically 0.01 or 0.05)

4

CVaR is appropriate in practice & theorycoherence, convex/concave, continuous

ฮฑ

Pro

bab

ilit

y

Cascade size

VaRCVaR

๐„

Page 5: Portfolio Optimization for Influence Spread (WWW'17)

Optimizing CVaR

5

Much effort in continuous optimization[Rockafellar-Uryasev. J. Risk'00] [Rockafellar-Uryasev. J. Bank. Financ.'02]

Solve max๐ด: ๐ด =๐‘˜

CVaR๐›ผ[cascade size triggered by ๐ด]?

NOT submodular

Poly-time approximation is impossibleunder some assumption [Maehara. Oper. Res. Lett.'15]

Page 6: Portfolio Optimization for Influence Spread (WWW'17)

Our approach is portfolio optimization

6

๐›‘๏ผ 0.2 + 0.3 + 0.5a

b

a

c

d

e

Return = 0.2 โ‹… 5 + 0.3 โ‹… 4 + 0.5 โ‹… 6 = 5.2

Sample

value 0.2

b c

d

e

a

We found poly-time approximation is possible!!

Sample

value 0.3

Sample

value 0.5

54

6

investment asset

Page 7: Portfolio Optimization for Influence Spread (WWW'17)

Our contributions

โ‘  Formulation

Portfolio optimization for maximizing CVaR

โ‘ก Algorithm

Constant additive error in poly-time

โ‘ข Experiments

Our portfolio outperforms baselines

7

Page 8: Portfolio Optimization for Influence Spread (WWW'17)

Related work

[Zhang-Chen-Sun-Wang-Zhang. KDD'14]

โ–ถFind smallest ๐ด s.t. VaR โ‰ฅ thld.

[Deng-Du-Jia-Ye. WASA'15]

โ–ถFind ๐ด s.t. (# vertices influenced by ๐ด w.h.p.) โ‰ฅ thld.

โ–ถNo approx. guarantee

Robust influence maximization

[Chen-Lin-Tan-Zhao-Zhou. KDD'16] [He-Kempe. KDD'16]

โ–ถModel parameters are noisy or uncertain

โ–ถRobustness is measured by ๐„[cascade size]

8These are single set selection problems

Page 9: Portfolio Optimization for Influence Spread (WWW'17)

Formulation

Diffusion process [Goldenberg-Libai-Muller. Market. Lett.'01]

9

Graph ๐บ = ๐‘‰, ๐ธEdge prob. ๐‘: ๐ธ โ†’ 0,1

uv lives w.p. ๐‘uv2 ๐ธ outcomes

๐ด influences v

in the diffusion process

๐ด can reach v

in the random graphโ‡”๐‘‹๐ด = r.v. for # vertices reachable from ๐ด in the random graph

a

cd

b0.5

0.50.5

0.5

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

Page 10: Portfolio Optimization for Influence Spread (WWW'17)

Formulation

๐‘˜-vertex portfolio, ๐„ โ‹… and CVaR๐›ผ โ‹…

10

๐›‘๏ผš ๐‘‰๐‘˜

-dim. vectors.t. ๐›‘ 1 = 1

E.g., ๐‘˜ = 1, ๐œ‹a = ๐œ‹b = 0.5

๐„ ๐œ‹a๐‘‹a + ๐œ‹b๐‘‹b = 2.25

CVaR0.25 ๐œ‹a๐‘‹a + ๐œ‹b๐‘‹b = 1.25 worst 0.25-fraction

a

cd

b0.5

0.50.5

0.5

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

buv lives w.p. ๐‘uv2 ๐ธ outcomes

Dist. of ๐›‘, ๐— = ๐œ‹a๐‘‹a + ๐œ‹b๐‘‹b

Page 11: Portfolio Optimization for Influence Spread (WWW'17)

Formulation

๐‘˜-vertex portfolio, ๐„ โ‹… and CVaR๐›ผ โ‹…

11

๐›‘๏ผš ๐‘‰๐‘˜

-dim. vectors.t. ๐›‘ 1 = 1

E.g., ๐‘˜ = 1, ๐œ‹a = ๐œ‹b = 0.5

๐„ ๐œ‹a๐‘‹a + ๐œ‹b๐‘‹b = 2.25

CVaR0.25 ๐œ‹a๐‘‹a + ๐œ‹b๐‘‹b = 1.25 worst 0.25-fraction

a

cd

b0.5

0.50.5

0.5

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

b

a

cd

buv lives w.p. ๐‘uv2 ๐ธ outcomes

Dist. of ๐›‘, ๐— = ๐œ‹a๐‘‹a + ๐œ‹b๐‘‹b

0.5โ‹…3 + 0.5โ‹…2 = 2.5

a

cd

b

๐‘‹a = 3

a

cd

b

๐‘‹b = 2

Page 12: Portfolio Optimization for Influence Spread (WWW'17)

1 1.5 1 2

1.5 2.5 1.5 3.5

1.5 2 2 3

2 2.5 2.5 3.5

Formulation

๐‘˜-vertex portfolio, ๐„ โ‹… and CVaR๐›ผ โ‹…

12

๐›‘๏ผš ๐‘‰๐‘˜

-dim. vectors.t. ๐›‘ 1 = 1

E.g., ๐‘˜ = 1, ๐œ‹a = ๐œ‹b = 0.5

๐„ ๐œ‹a๐‘‹a + ๐œ‹b๐‘‹b = 2.25

CVaR0.25 ๐œ‹a๐‘‹a + ๐œ‹b๐‘‹b = 1.25 worst 0.25-fraction

a

cd

b0.5

0.50.5

0.5

uv lives w.p. ๐‘uv2 ๐ธ outcomes

Dist. of ๐›‘, ๐— = ๐œ‹a๐‘‹a + ๐œ‹b๐‘‹b

Page 13: Portfolio Optimization for Influence Spread (WWW'17)

Formulation

Our problem definition

13

max๐›‘

CVaR๐›ผ โŸจ๐›‘, ๐—โŸฉ

๐ด: ๐ด =๐‘˜

๐œ‹๐ด๐‘‹๐ด

โš  ๐›‘ & ๐— are ๐‘‰๐‘˜

-dimensionalExisting methods cannot be applied

# vertices reachable from ๐ดin the random graph

Input : ๐บ = (๐‘‰, ๐ธ), ๐‘, integer ๐‘˜, significance level ๐›ผ

Output : ๐‘˜-vertex portfolio ๐›‘

Page 14: Portfolio Optimization for Influence Spread (WWW'17)

Our algorithm

Main idea

14

CVaR optimization โž” BIG linear programming

Multiplicative weights algorithm [Arora-Hazan-Kale. '12]

Used in optimization, machine learning, game theory, โ€ฆ

E.g., our group's application [Hatano-Yoshida. AAAI'15]

Requires an oracle for

a convex combination of the constraints

Standard approximation

Greedy strategy solves approximately!

โž”

Page 15: Portfolio Optimization for Influence Spread (WWW'17)

Our algorithm

Standard approximation as a first step

max๐›‘,๐œ

๐œ โˆ’1

๐›ผ๐‘ 

1โ‰ค๐‘–โ‰ค๐‘ 

max ๐œ โˆ’ ๐›‘, ๐—๐‘– , 0

Sampling ๐—1, โ€ฆ , ๐—๐‘  from the dist. of ๐—

Solve the feasibility problem โ โ‰ฅ ๐›พ?โž

to perform the bisection search on ๐›พ

max๐›‘

CVaR๐›ผ ๐›‘, ๐—

Write CVaR as an optimization problem[Rockafellar-Uryasev. J. Risk'00]

15

Page 16: Portfolio Optimization for Influence Spread (WWW'17)

Our algorithm

Difficulty of checking โ โ‰ฅ ๐›พ?โž

16

โˆƒ? ๐ฑ โˆˆ ๐๐›พ ๐€๐ฑ โ‰ฅ ๐›

auxiliary variable required for expressing CVaR

auxiliary variables for removing max functions

๐‘˜-vertex portfolio๐ฑ =

๐œ๐ฒ๐›‘

โ Multiple submodular functions

exceed a threshold simultaneously โž

โ โ‰ฅ ๐›พ?โž ๏ผ Feasibility of a BIG linear programming

# variables โ‰ˆ ๐‘‰๐‘˜

# constraints = ๐‘ 

โ‰ˆ

HARD to satisfy!!

Page 17: Portfolio Optimization for Influence Spread (WWW'17)

Our algorithm

Our solution for checking the BIG LP

Multiplicative weights algorithm [Arora-Hazan-Kale. '12]

โ‘  Solve the convex combination โ–ก for ๐ฉ = ๐ฉ1, โ€ฆ , ๐ฉ๐‘‡โ‘ก The average solution โ‰ˆ A solution of โ–ก

17

โˆƒ? ๐ฑ โˆˆ ๐๐›พ ๐ฉ, ๐€๐ฑ โ‰ฅ ๐ฉ, ๐›

โ A single submodular function

exceeds a threshold โž

We can assume ๐›‘ is sparse โž” Greedy works!!Constant additive error โˆ’eโˆ’1 in poly-time

Still looks difficult, but โ€ฆ

โ‰ˆ

Page 18: Portfolio Optimization for Influence Spread (WWW'17)

Our algorithm

Summary : Time & Quality

18

Exact CVaR optimization

Empirical CVaR optimization

BIG linear programming

Convex combination Greedy works!

Multiplicative weights!

Error is bounded

Bisection search

โ‰ˆ

CVaR๐›ผ ๐›‘, ๐— โ‰ฅ max๐›‘โˆ—

CVaRฮฑ ๐›‘โˆ—, ๐— โˆ’ ๐‘‰ eโˆ’1 โˆ’ ๐‘‰ ๐œ–

O ๐œ–โˆ’6๐‘˜2 ๐‘‰ ๐ธ time O ๐‘“ = O ๐‘“ log๐‘ ๐‘“

additive erroroptimum value

Page 19: Portfolio Optimization for Influence Spread (WWW'17)

Experiments

Settings

โ–ถ Test data (see our paper for results on other data)

Physicians friend network ( ๐‘‰ = 117 & ๐ธ = 542)from [Koblenz Network Collection]

๐‘uv = outโˆ’degree of u โˆ’1

โ–ถ Parameter : ๐œ– = 0.4

โ–ถ Baselines : produce a single vertex set

19

Greedy a standard greedy algorithm for influence maximization

Degree select ๐‘˜ vertices in degree decreasing order

Random select ๐‘˜ vertices uniformly at random

โ–ถ Environment : Intel Xeon E5-2670 2.60GHz CPU, 512GB RAM

1/3 1/3

1/3

1/4 1/4

1/4

1/4

Page 20: Portfolio Optimization for Influence Spread (WWW'17)

Experiments

Results for ๐‘˜ = 10 & ๐›ผ = 0.01

๐ด ๐œ‹๐ด25,42,67,81,85,94,103,106,111,112 3/47

7,20,21,48,75,98,104,111,112,113 2/47

0,29,43,52,92,97,107,108,113,116 2/47

25,38,69,71,81,103,105,110,112,116 2/47

โ‹ฎ โ‹ฎ

๐›‘ is extremely sparse!!

(# non-zeros in ๐›‘) = 42

dim. of ๐›‘ = ๐‘‰10

โ‰ˆ 1014

CVaR at ๐œถ Expectation Runtime

This work 23.6 37.7 157.2s

Greedy 16.9 38.2 0.1s

Degree 14.2 30.8 0.2ms

Random 15.1 31.0 0.2ms

Comparable

Concentrated

Page 21: Portfolio Optimization for Influence Spread (WWW'17)

Conclusion

Future studyโ… : Speed-up

O ๐œ–โˆ’6๐‘˜2 ๐‘‰ ๐ธ time is still expensive

Can we solve the BIG LP in a different way?

Future studyโ…ก: Exact computation of CVaR

Is it possible to extend [Maehara-Suzuki-Ishihata. WWW'17] ?

Future studyโ…ข: Other risk measures

Value at Risk, Lower partial moment, โ€ฆ21

Succeed to obtain a low-risk strategy

by a portfolio optimization approach