probabilistic soft logic for trust analysis in social networks i...

1
S I Q L N Probabilistic Soft Logic for Trust Analysis in Social Networks Bert Huang, Angelika Kimmig, Lise Getoor, and Jennifer Golbeck Department of Computer Science University of Maryland, College Park Trust Analysis Probabilistic Soft Logic FilmTrust Experiment Data description and experiment setup Modeling trust benefits many applications: recommendation engines, viral marketing, reputation management Trust analysis in social networks is a rich application for statistical relational A.I. and learning (SRL) We analyze trust using the SRL tool Probabilistic Soft Logic Declarative language for probabilistic models using first-order logic (FOL) syntax Many existing methods for computational trust analysis, e.g., TidalTrust, EigenTrust, Advogato. Truth values are relaxed to soft truth in [0,1] Logical operators relaxed via Lukasiewicz t-norm: Mechanisms for incorporating similarity functions and reasoning about sets MAP inference is a convex optimization Efficient sampling method for marginal inference a ˜ ^b = max{0, a + b - 1}, a ˜ _b = min{a + b ,1}, ˜ ¬a =1 - a, Continuous probability distribution over truth values: Pr(x )= 1 Z exp 0 @ - X r 2P X g 2G (r ) w r (1 - t g (x )) 1 A t g (x ) w r P : the PSL program : set of all groundings of rule r G (r ) : weight of rule r : truth value of grounding g Method MAE τ ρ PSL-Triadic 0.2985 0.0717 0.0944 PSL-Personality 0.2586 0.1681 0.2265 PSL-Similarity 0.2198 0.1089 0.1539 PSL-TriadPers 0.2509 0.1801 0.2417 PSL-TriadSim 0.2146 0.1197 0.1688 PSL-PersSim 0.2154 0.1771 0.2444 PSL-TriadPersSim 0.2246 0.1907 0.2598 sametraits 0.2461 0.0531 0.0739 Avg-Incoming 0.3751 0.0120 0.0167 Avg-Outgoing 0.3327 0.1088 0.1463 Avg-Global 0.2086 EigenTrust 0.6729 -0.0229 -0.0291 TidalTrust 0.2387 0.0478 0.0649 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 True PSL-Triadic Predicted 0 20 40 60 80 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 True PSL-TriadPers Predicted 0 20 40 60 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 True PSL-TriadPersSim Predicted 0 20 40 60 80 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 True sameTraits Predicted 0 50 100 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 PSL-Personality True Predicted 0 20 40 60 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 True PSL-TriadSim Predicted 0 50 100 150 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 True Avg-Incoming Predicted 0 20 40 60 0.2 0.4 0.6 0.8 1 0.02 0.04 0.06 0.08 0.1 0.12 EigenTrust True Predicted 0 100 200 300 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 True PSL-Similarity Predicted 0 50 100 150 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 True PSL-PersSim Predicted 0 50 100 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 True Avg-Outgoing Predicted 0 20 40 60 80 100 0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 True TidalTrust Predicted 0 50 100 150 Histograms of predicted trust vs. ground truth Average scores of trust predictions (MAE: average error, : Kendall-tau, : Spearman's rank correlation) Discussion FilmTrust: social movie recommendation service Records users' trust scores for other users (1 to 10), users' ratings for movies (1 to 5) 500 users in largest connected component, 1574 user-user trust ratings PSL is a flexible tool for exploring social trust analysis Soft first-order logic is convenient and effective for modeling the amount of trust between individuals Future work: latent trust for social sentiment, larger-scale models of group trust and joint group membership Supported by IARPA contract D12PC00337 and the Research Foundation Flanders (FWO-Vlaanderen) Four-fold cross validation for learning weights and parameters PSL trust models sameTraits(A, B ) ) trusts(A, B ), ¬sameTraits(A, B ) ) ¬trusts(A, B ), trusts(A, B ) sameTraits(B , C ) ) trusts(A, C ), ¬trusts(A, B ) sameTraits(B , C ) ) ¬trusts(A, C ), trusts(A, C ) sameTraits(A, B ) ) trusts(B , C ), ¬trusts(A, C ) sameTraits(A, B ) ) ¬trusts(B , C ). trusts(A, B ) ) trusting(A), ¬trusts(A, B ) ) ¬trusting(A), trusts(A, B ) ) trustworthy(B ), ¬trusts(A, B ) ) ¬trustworthy(B ), trusting(A) trustworthy(B ) ) trusts(A, B ), ¬trusting(A) ¬trustworthy(B ) ) ¬trusts(A, B ), trusting(A) ) trusts(A, B ), ¬trusting(A) ) ¬trusts(A, B ), trustworthy(B ) ) trusts(A, B ), ¬trustworthy(B ) ) ¬trusts(A, B ). trusts(A, B ) trusts(B , C ) ) trusts(A, C ), trusts(A, B ) ¬trusts(B , C ) ) ¬trusts(A, C ), ¬trusts(A, B ) ¬trusts(B , C ) ) trusts(A, C ), trusts(A, B ) trusts(A, C ) ) trusts(B , C ), trusts(A, C ) trusts(B , C ) ) trusts(A, B ), trusts(A, B ) ) trusts(B , A), ¬trusts(A, B ) ) ¬trusts(B , A). PSL-Triadic PSL-Personality PSL-Similarity We model trust with logical models inspired by ideas from social science Weight learning via voted perceptron maximum likelihood models triadic closure, which implies the transitivity of trust, such that individuals determine whom to trust based on the opinions of those they trust. maintains predicates for how trusting and trustworthy users are. Trusting individuals are prone to trust others, while trustworthy individuals are likely to receive trust. models the correlation between feature similarity and trust, where the sameTraits predicate is computed as a normalized cosine similarity of users' local features. http://linqs.cs.umd.edu

Upload: others

Post on 27-Sep-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Probabilistic Soft Logic for Trust Analysis in Social Networks I Speople.cs.vt.edu/~bhuang/papers/huang-starai12_poster.pdftransitivity of trust, such that individuals determine whom

S I

Q

L

N

Probabilistic Soft Logic for Trust Analysis in Social NetworksBert Huang, Angelika Kimmig, Lise Getoor, and Jennifer Golbeck

Department of Computer ScienceUniversity of Maryland, College Park

Trust Analysis

Probabilistic Soft Logic

FilmTrust Experiment

Data description and experiment setup

Modeling trust benefits many applications: recommendation engines, viral marketing, reputation management

Trust analysis in social networks is a rich application for statistical relational A.I. and learning (SRL)

We analyze trust using the SRL tool Probabilistic Soft Logic

Declarative language for probabilistic models using first-order logic (FOL) syntax

Many existing methods for computational trust analysis, e.g., TidalTrust, EigenTrust, Advogato.

Truth values are relaxed to soft truth in [0,1]

Logical operators relaxed via Lukasiewicz t-norm:

Mechanisms for incorporating similarity functions and reasoning about sets

MAP inference is a convex optimizationEfficient sampling method for marginal inference

a˜^b = max{0, a+ b � 1},a˜_b = min{a+ b, 1},¬̃a = 1� a,

Continuous probability distribution over truth values:

Pr(x) =

1Z exp

0

@�X

r2P

X

g2G(r)

wr (1� tg (x))

1

A

tg (x)wr

P : the PSL program : set of all groundings of rule r G (r)

: weight of rule r : truth value of grounding g

Method MAE � �PSL-Triadic 0.2985 0.0717 0.0944

PSL-Personality 0.2586 0.1681 0.2265PSL-Similarity 0.2198 0.1089 0.1539PSL-TriadPers 0.2509 0.1801 0.2417PSL-TriadSim 0.2146 0.1197 0.1688PSL-PersSim 0.2154 0.1771 0.2444

PSL-TriadPersSim 0.2246 0.1907 0.2598sametraits 0.2461 0.0531 0.0739

Avg-Incoming 0.3751 0.0120 0.0167Avg-Outgoing 0.3327 0.1088 0.1463Avg-Global 0.2086 – –EigenTrust 0.6729 -0.0229 -0.0291TidalTrust 0.2387 0.0478 0.0649

0.2 0.4 0.6 0.8 1

0.2

0.4

0.6

0.8

1

True

PSL!Triadic

Pre

dic

ted

0

20

40

60

80

0.2 0.4 0.6 0.8 1

0.2

0.4

0.6

0.8

True

PSL!TriadPers

Pre

dic

ted

0

20

40

60

0.2 0.4 0.6 0.8 1

0.2

0.4

0.6

0.8

1

True

PSL!TriadPersSim

Pre

dic

ted

0

20

40

60

80

0.2 0.4 0.6 0.8 1

0.2

0.4

0.6

0.8

1

True

sameTraits

Pre

dic

ted

0

50

100

0.2 0.4 0.6 0.8 1

0.2

0.4

0.6

0.8

1

PSL!Personality

True

Pre

dic

ted

0

20

40

60

0.2 0.4 0.6 0.8 1

0.2

0.4

0.6

0.8

1

True

PSL!TriadSim

Pre

dic

ted

0

50

100

150

0.2 0.4 0.6 0.8 10

0.2

0.4

0.6

0.8

True

Avg!Incoming

Pre

dic

ted

0

20

40

60

0.2 0.4 0.6 0.8 1

0.02

0.04

0.06

0.08

0.1

0.12

EigenTrust

True

Pre

dic

ted

0

100

200

300

0.2 0.4 0.6 0.8 1

0.2

0.4

0.6

0.8

1

True

PSL!Similarity

Pre

dic

ted

0

50

100

150

0.2 0.4 0.6 0.8 1

0.2

0.4

0.6

0.8

1

True

PSL!PersSim

Pre

dic

ted

0

50

100

0.2 0.4 0.6 0.8 10

0.2

0.4

0.6

0.8

True

Avg!Outgoing

Pre

dic

ted

0

20

40

60

80

100

0.2 0.4 0.6 0.8 1

0.2

0.4

0.6

0.8

1

True

TidalTrust

Pre

dic

ted

0

50

100

150

Figure 4: Histograms of predicted trust values over true trust annotations. The brightness of each grid cell indicates thenumber of edges with the corresponding true trust (horizontal axis) and predicted trust (vertical axis). More mass along thediagonal indicates predictions consistent with the ground truth.

malize to [0, 1], making each trust value interpretable as asoft truth value. Similarly, users rate movies with a recom-mendation rating between 1 and 5. There are 500 users inthe largest connected component, among which there are1574 total user-to-user trust values. The trust values aredirected and thus not symmetric. For each pair of userswithin a two-hop distance, we compute their soft similaritySAMETRAITS via a normalized inner product of their over-lapping rated-entry vectors. Let r(A,M) denote the ratinggiven by user A for movie M , scaled to the interval [0, 1](by dividing by 5). If a user did not rate a movie, the scorefor that user-movie pair is 0. We compute SAMETRAITSwith the formula

SAMETRAITS(A,B) =

PM2Movies

r(A,M)r(B,M)PM2Movies

I(r(A,M)r(B,M) > 0)

.

The task we consider is collective prediction of trust val-ues. We generate four folds where, in each fold, 1/4 ofthe trust values are hidden at random. The prediction al-gorithm can use the remaining 3/4 of the trust values tolearn parameters for a model and perform inference of theunknown trust values. PSL learns weights for the rules ineach given model from these observed trust values. Weconsider a transductive prediction setting, in which the in-ference algorithm is given which pairs of users rated each

other (i.e., the full network structure), but is not given theactual trust values on the held-out 1/4.

4.1 Baselines

We now discuss a range of baselines, including two popularapproaches from the literature, to which we compare ourPSL models from Section 3.1.

As a simple baseline, we consider predicting the averagetrust across all observed trust values for every prediction(denoted in tables as Avg-Global). Clearly, the global av-erage is not very informative, so we additionally consider anode-centric metric where we compute average trust valuesfor each node (and only use the global average if no trustvalues are available for a node). We use two variants ofthis: the average of incoming trust values (Avg-Incoming)and the average of outgoing trust values (Avg-Outgoing).

We include the SAMETRAITS predicate itself as a base-line, since PSL-Similarity heavily depends on this similar-ity function.

EigenTrust (Kamvar et al., 2003) is a global metric analo-gous to PageRank (Page et al., 1999) that computes a trustvalue for each node by finding the left principle eigenvec-tor of a normalized trust matrix. The trust matrix is normal-ized such that each row sums to 1.0, making the normalized

Histograms of predicted trust vs. ground truth

Average scores of trust predictions

(MAE: average error, : Kendall-tau, : Spearman's rank correlation)

Discussion

FilmTrust: social movie recommendation serviceRecords users' trust scores for other users (1 to 10), users' ratings for movies (1 to 5)500 users in largest connected component, 1574 user-user trust ratings

PSL is a flexible tool for exploring social trust analysis

Soft first-order logic is convenient and effective for modeling the amount of trust between individuals

Future work: latent trust for social sentiment, larger-scale models of group trust and joint group membership

Supported by IARPA contract D12PC00337 and the Research Foundation Flanders (FWO-Vlaanderen)

Four-fold cross validation for learning weights and parameters

PSL trust models

sameTraits(A,B)) trusts(A,B),¬sameTraits(A,B)) ¬trusts(A,B),

trusts(A,B)� sameTraits(B ,C )) trusts(A,C ),¬trusts(A,B)� sameTraits(B ,C )) ¬trusts(A,C ),trusts(A,C )� sameTraits(A,B)) trusts(B ,C ),

¬trusts(A,C )� sameTraits(A,B)) ¬trusts(B ,C ).

trusts(A,B)) trusting(A),¬trusts(A,B)) ¬trusting(A),trusts(A,B)) trustworthy(B),

¬trusts(A,B)) ¬trustworthy(B),trusting(A)� trustworthy(B)) trusts(A,B),

¬trusting(A)�¬trustworthy(B)) ¬trusts(A,B),trusting(A)) trusts(A,B),

¬trusting(A)) ¬trusts(A,B),trustworthy(B)) trusts(A,B),

¬trustworthy(B)) ¬trusts(A,B).

trusts(A,B)� trusts(B ,C )) trusts(A,C ),trusts(A,B)�¬trusts(B ,C )) ¬trusts(A,C ),

¬trusts(A,B)�¬trusts(B ,C )) trusts(A,C ),trusts(A,B)� trusts(A,C )) trusts(B ,C ),trusts(A,C )� trusts(B ,C )) trusts(A,B),

trusts(A,B)) trusts(B ,A),¬trusts(A,B)) ¬trusts(B ,A).

PSL-Triadic

PSL-Personality

PSL-Similarity

We model trust with logical models inspired by ideas from social science

Weight learning via voted perceptron maximum likelihood

⇢⌧

models triadic closure, which implies the transitivity of trust, such that individuals determine whom to trust based on the opinions of those they trust.

maintains predicates for how trusting and trustworthy users are. Trusting individuals are prone to trust others, while trustworthy individuals are likely to receive trust.

models the correlation between feature similarity and trust, where the sameTraits predicate is computed as a normalized cosine similarity of users' local features.

http://linqs.cs.umd.edu