multilayer perceptron based recommender systemstat.snu.ac.kr/idea/seminar/20171122/mlp.pdf ·...

31
Multilayer Perceptron based Recommender System Jongjin Lee Presented by Jongjin Lee 2017.11.22 Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 1 / 31

Upload: others

Post on 21-Jul-2020

26 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Multilayer Perceptron based Recommender System

Jongjin Lee

Presented by Jongjin Lee

2017.11.22

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 1 / 31

Page 2: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Overview

1 Introduction: MLP, Recommender system

2 Neural Collaborative Filtering

3 CCCFNet

4 Wide & Deep Learning

5 DeepFM

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 2 / 31

Page 3: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Introduction: MLP

Figure: Multilayer Perceptron

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 3 / 31

Page 4: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Introduction: MLP

Concise but effective model.

Widely used in many area

Approximate any measurable function to any desired degree ofaccuracy.

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 4 / 31

Page 5: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Introduction : Recommender System

Recommendation models are mainly categorized into...

Collaborative Filtering Recommender System

→ By learning from user - item historical interactionsContent-based Recommender System

→ Using item’s and user’s auxiliary information.

Hybrid Recommender System

Capture the non-linear relationships between users and item by MLP

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 5 / 31

Page 6: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Neural Collaborative Filtering

Figure: Neural Collaborative Filtering

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 6 / 31

Page 7: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Neural Collaborative Filtering

Y = {yui}(u=1,...,M,i=1,...,N), (R = {rui}(u=1,...,M,i=1,...,N)),User-Item Rating Matrix

suseru , One-hot identifier of user u, (u=1,...,M)

s itemi , One-hot identifier of item i, (i=1,...,N)

P ∈ RM×K , Q ∈ RN×K

,the latent factor matrix for users and items, respectively

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 7 / 31

Page 8: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Learning from implicit feedback

User-Item interaction : explicit feedback vs implicit feedback

Define interaction(rating) matrix Y = {yui}(u=1,...,M,i=1,...,N)

yui ={

1, if interaction (user u, item i) is observed;0, otherwise.

Implicit data provides only noisy signalsScarcity of negative feedbackThe problem of estimating the scores of unobserved entries Y

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 8 / 31

Page 9: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Matrix Factorization

Predict rating matrix by introducing the latent vector for user anditem.Let pu = PT suser

u ∈ RK , and qi = QT s itemi ∈ RK denote the latent

vector for user u and item i

MF estimates an interaction yui as the inner product of uu, and vi

yui = f (u, i | pu, qi) = pTu qi

Matrix Factorization have limitations

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 9 / 31

Page 10: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Neural Collaborative Filtering

Figure: Neural Collaborative Filtering

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 10 / 31

Page 11: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Neural Collaborative Filtering

The prediction rule of NCF is formulated as follows:

yui = f (UT · suseru , V T · s item

i | U, V , θ)

f(·): multilayer perceptron, θ: parameters of this network.

The (general) loss function is

L =∑

(u,i)∈O∪O−

wui(yui − yui)2

O: observed interaction,O−: all(or sampled from) unobserved interactionswui : weight(a hyperparameter)

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 11 / 31

Page 12: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Neural Collaborative Filtering

For implicit feedback, Use Logistic or Probit function

The loss function is

L =∑

(u,i)∈O∪O−

yui logyui + (1 − yui)log(1 − yui)

O: observed interaction,O−: all(or sampled from) unobserved interactionsO− : uniformly sample them from unobserved interactions.Minimize L by Stochastic gradient desent(SGD)

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 12 / 31

Page 13: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Fusion of GMF and NCF

Figure: Neural matrix facorization model

Generalized Matrix Factorization(GMF):yGMF = fout(wT (yMF )) = fout(wT (pT

u qi))Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 13 / 31

Page 14: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

CCCFNet

Figure: CCCFNet

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 14 / 31

Page 15: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

CCCFNet

Cross-domain Content-boosted Collaborative Filtering neuralNetwork.

Combine collaborative filtering and content filtering.

Introduce multiple-domain, use knowledge from relatively denseauxiliary domains to overcome sparsity problem.

Consistently, outperforms several baseline method.

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 15 / 31

Page 16: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

CCCFNet

Suppose we have two domains: Target (1) & Auxiliary(2)

Y (1)N×M(1) , Y (2)

N×M(2) , Rating matrix

A(1)M(1)×T (1) , A(2)

M(2)×T (2) , Content matrix (with T (1), T (2) attribute)

PN×K , Q(1)M(1)×K , Q(2)

M(2)×K , Latent factor matrix for users anditemsUN×L, V (1)

T (1)×L, V (2)T (2)×L, Latent factor matrix for content

preference, item contentθ = (P, U, Q(1), Q(2), V (1), V (2))

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 16 / 31

Page 17: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

CCCFNet

Minimize the followin equation:

L = 12

∑u,i

(y (1)ui − pu � qi

(1) − uu � vi(1))2

+ λ12

∑u,i

(y (2)ui − pu � qi

(2) − uu � vi(2))2 + λ∗

2∑

k(∥θ∥)2

pu = PT suseru , qi = QT s item

u , uu = UT suseru , vi = row(A, i)V T

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 17 / 31

Page 18: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

CCCFNet

Figure: CCCFNet

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 18 / 31

Page 19: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Wide & Deep Learning

Figure: Wide & Deep Learning

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 19 / 31

Page 20: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Wide & Deep Learning

Wide Learning ComponentsSingle layer PerceptronMemorization→ Catching the direct features from historical data→ Loosely defined: Learning the frequent co-occurrence of items orfeatures, and exploiting the correlation in the historical data

Deep Learning ComponentsMultilayer PerceptronGeneralization→ Catching the generalized by producing more general and abstractrepresentations→ Loosely defined: Learning the transitivity of correlation and exploresnew feature combinations that have never or rarely occurred in thepast.

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 20 / 31

Page 21: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Wide & Deep Learning

The wide learning part

y = W Twide(x , ϕ(x)) + b

ϕ(x) is cross-product transformation which is defined as:

ϕk(x) =d∏

i=1x cki

i , cki ∈ {0, 1}

cki ={

1, if i-th feature is part of the k-th transformation ϕ(x)0, otherwise.

Cross-product transformation is manually designed

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 21 / 31

Page 22: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Wide & Deep Learning

The deep learning part

α(l+1) = f (W (l)deepα(l) + b)

The wide & deep learning model is attained by fusing these twomodels:

P(yui = 1 | x) = σ(W Twide{x , ϕ(x)} + W (T )

deepα(lTf ) + bias)

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 22 / 31

Page 23: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

Wide & Deep Learning

Figure: Wide & Deep Learning

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 23 / 31

Page 24: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

DeepFM

Figure: DeepFM

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 24 / 31

Page 25: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

DeepFM

Deep Factorization Machine

Enlightend by Wide & Deep Learning

Integrates Factorization Machine and MLP

Capture the high-order feature interactions via deep neural network.

Capture the low-order feature interactions via factorization machine.

Does not require tedious feature engineering.

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 25 / 31

Page 26: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

DeepFM

x is an m-fields data consisting of pairs(u,i), and d-dimensional vector

x may include categorical fields(gender, location...) and continuousfields (age...)x = [xfield1 ,xfield2 ,· · · , xfieldm ] is d-dimensional vector.

y = σ(yFM(x), yMLP(x))

FM component and Deep component, that share the same input.

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 26 / 31

Page 27: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

DeepFM

Figure: FM Component

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 27 / 31

Page 28: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

DeepFM

It can capture order-2 feature interarctions much more effectivelythan previous approach.(especially the dataset is sparse)

yFM =< w , x > +m∑

i=1

m∑j=i+1

< Vi , Vj > xi � xj

, w ∈ Rdand Vi ∈ Rk

< w , x > reflects order-1 feature interactions, and summation termreflects order-2 feature interactions.

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 28 / 31

Page 29: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

DeepFM

Figure: Deep Component

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 29 / 31

Page 30: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

DeepFM

Figure: DeepFM

y = σ(< w , x > +m∑

i=1

m∑j=i+1

< Vi , Vj > xi � xj + W (T )deepα(lf ) + bias)

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 30 / 31

Page 31: Multilayer Perceptron based Recommender Systemstat.snu.ac.kr/idea/seminar/20171122/MLP.pdf · Multilayer Perceptron based Recommender System JongjinLee Presented by Jongjin Lee 2017.11.22

The End

Jongjin Lee (SNU) MLP based Recommender System 2017.11.22 31 / 31