david daniels, eric liu, charles zhang cme 323rezab/classes/cme323/s15/...Β Β· key results from...
TRANSCRIPT
![Page 1: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/1.jpg)
David Daniels, Eric Liu, Charles Zhang
CME 323
![Page 2: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/2.jpg)
Not all edges are created equal
Example: AngelList ($2.9+ billion) β¦ Investors can follow a start-up (social link)
β¦ Investors can invest in a start-up (economic link)
β¦ What is the relative importance of a social link versus an economic link?
![Page 3: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/3.jpg)
This paper attempts to recover edge weights that are revealed via the propagation of actual influence along a network β¦ βWhat is the edge-type weight vector that best
describes a graph, assuming influence operates as if characterized by Edge-Type Weighted PageRank?β
![Page 4: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/4.jpg)
Edge-Type Weighted Graphs
Weighted PageRank
Structural Estimation
Algorithm β¦ (Inner) PageRank Iterations (for given weight vector)
β¦ (Outer) Search Strategy (for optimal weight vector)
Experiments
![Page 5: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/5.jpg)
πΊ = π, πΈ(π, π‘), π β π’π€
π is the set of vertices with π = π
πΈ is the set of edges with πΈ = π
Each edge π having a type attribute π‘ β π― ={1, 2, β¦ , π}
π = (π1, β¦ , ππ) β β+T is the weight vector
![Page 6: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/6.jpg)
Weighted stochastic adjacency matrix Aβ βπΓπ
π΄ππ =π€ππ
π€πππ where π€ππ = ππ‘ππ is the weight for edge πππ,
if πππ β πΈ (i.e. if π£π β π£π), and π€ππ = 0 if πππ β πΈ
Alternatively, we can write A as
π΄ = π1π΅1 +β―+πππ΅
π πΆ where πΆππ = ππ‘π‘ π΅ π‘
πππ
![Page 7: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/7.jpg)
Find π β βn such that π = (1 β πΏ)/π + πΏπ΄π
i.e -> Find the eigenvector corresponding to the eigenvalue π = 1 for the matrix π = πΏπ΄ +1βπΏ
ππ
(PerronβFrobenius) on positive stochastic matrices
π = 1 is the unique largest eigenvalue of π
=> Power Iterations
![Page 8: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/8.jpg)
Οopt = argminπ
β πππππ ππ πΊ π, πΈ, π , πβ
πβ = πππππ(ππ πΊ π, πΈ,πβ +π© 0, ΟΟ΅
2πΌ )
β π1, π2 = π1 β π2 2
![Page 9: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/9.jpg)
![Page 10: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/10.jpg)
![Page 11: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/11.jpg)
Key results from simulations
The optimal weight vector ππππ‘ that attains minimum of π lies in close neighbourhood of true weight vector, πβ.
Albeit intractability to find closed-form of derivative, the function π is convex and smooth (and at least piecewise continuous/convex for higher dimensions).
Weighted PageRanks perform better than unweighted PageRanks especially on graphs with high in-degree / low edge weights, low in-degree / high edge weights (see paper).
![Page 12: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/12.jpg)
πβ = (0.5714, 0.2857, 0.1429) 95% CI for π1
β : [0.5677, 0.5831] 95% CI for π2
β : [0.2797, 0.2899] 95% CI for π3
β : [0.1325, 0.1471]
![Page 13: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/13.jpg)
Local Machine
Power iteration on π 2π β 1 π ~ πͺ π2 per multiplication
Number of iterations:
πͺlog 1/π β ππ2/π1
log 1/π2 ~πͺ
log 1/π
log 1/πΏ
because πππ£ = π1(π +π2
π1π2
ππ2 +β―+ππ
π1ππ
πππ
and by Haveliwala (2003)βs bound on π2 β€ πΏ
Smart update: π£(π) = (1 β πΏ)/π + πΏπ΄π£(πβ1) 2π β π ~ πͺ π per update
![Page 14: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/14.jpg)
Distributed using Pregel Framework
![Page 15: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/15.jpg)
Distributed using Pregel Framework, π΅ machines, with combiners
Communication cost: πͺ min π, ππ΅
Reduce size for each key:
β¦ πͺ min max indegrees, π΅ β¦ Max in-degrees could be as bad as πͺ π
β¦ On average, it is πͺ π/π
Number of supersteps: πͺ log 1/π
![Page 16: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/16.jpg)
Grid search πͺ 1/πΌ ππ log 1/π
Numerical gradient descent π(π+1) = π(π) β πΎ π»π π π
ππ
πππ‘π π =
π π π + πΌππ‘ β πΌππ β π π π
πΌ
πͺ ππ π log 1/π + π log π πΏ
![Page 17: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/17.jpg)
Accuracy?
N.B.: Experiments use a slightly different minimization objective β squared difference in PageRank scores rather than squared difference in PageRank order β because our laptop Spark setup (4 cores, 4 GB memory) was able to process the former metric, but not the latter, in a reasonable time for graphs of a size worth distributing.
![Page 18: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/18.jpg)
Accuracy?
Nodes Edges # Types Recovered Weights True Weights
600 20% 2 .332, .668 .33, .67
600 20% 4 .098, .199, .3, .4 .1, .2, .3, .4
600 20% 2 .331, .669 .33+Ο΅,.67+Ο΅,
Ο΅ ~π© 0,.1
π€π
2
2000 500 3 .165, .332, .503 .166, .333, .5
![Page 19: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/19.jpg)
Runtime?
![Page 20: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/20.jpg)
Erdos-Renyi random graphs, 600 Nodes
![Page 21: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/21.jpg)
Can recover edge-type weights accurately
Next steps β¦ Dynamically changing PageRank tolerance
β¦ Look for direction in unit ball with small π2
![Page 22: David Daniels, Eric Liu, Charles Zhang CME 323rezab/classes/cme323/S15/...Β Β· Key results from simulations The optimal weight vector π‘ that attains minimum of lies in close neighbourhood](https://reader035.vdocument.in/reader035/viewer/2022071104/5fdd6f1e404b1b209906b8d6/html5/thumbnails/22.jpg)