collaborative filtering: latent variable model

18
1 Collaborative Filtering: Latent Variable Model LIU Tengfei Computer Science and Engineering Departm ent April 13, 2011

Upload: gen

Post on 13-Jan-2016

44 views

Category:

Documents


5 download

DESCRIPTION

Collaborative Filtering: Latent Variable Model. LIU Tengfei Computer Science and Engineering Department April 13, 2011. Outline. Overview of CF approaches Model based approach-latent variable model Probabilistic latent semantic analysis (PLSA) Other latent variable models Summary. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Collaborative Filtering: Latent Variable Model

1

Collaborative Filtering: Latent Variable Model

LIU Tengfei

Computer Science and Engineering DepartmentApril 13, 2011

Page 2: Collaborative Filtering: Latent Variable Model

2

Outline Overview of CF approaches

Model based approach-latent variable model Probabilistic latent semantic analysis (PLSA) Other latent variable models

Summary

Page 3: Collaborative Filtering: Latent Variable Model

3

Overview of CF Approaches

Page 4: Collaborative Filtering: Latent Variable Model

4

Overview of CF Approaches CF Categories

Memory-based CF Conduct certain forms of nearest neighbor search in

order to predict the rating for particular use-item pair.

Model-based CF Train a compact model that explains the given data

so that ratings could be predicted via the model.

Page 5: Collaborative Filtering: Latent Variable Model

5

Outline Overview of CF approaches

Model based approach Probabilistic latent semantic analysis (PLSA) Other latent variable model

Summary

Page 6: Collaborative Filtering: Latent Variable Model

6

Model based approach Question:

What is the shortcomings of memory based methods?

Reasons: Suboptimal solution problem Little knowledge learned from data Computationally expensive in local-neighbor

search ……

Page 7: Collaborative Filtering: Latent Variable Model

7

Probabilistic Latent Semantic Analysis The Problem

We want to predict the rating r that user u may assign to item i

Why latent variable model? Consider a simple case:

User x like/dislike item y “because of” some reason The reason can not be observed, but may exist We introduce a latent variable to model it

Page 8: Collaborative Filtering: Latent Variable Model

8

Probabilistic Latent Semantic Analysis Question: what rating that user u is likely to give to item i?

Can we describe it with probability? The probability that the rating a user give to an item is

decomposed into a sum of products.

z is the latent variable

Probability that class z (can be seen as community in CF) would assign score r to item i. Mixing proportion

Page 9: Collaborative Filtering: Latent Variable Model

9

Probabilistic Latent Semantic Analysis Intuitive Graph Representation

Page 10: Collaborative Filtering: Latent Variable Model

10

Probabilistic Latent Semantic Analysis Model as a Gaussian distribution

Mixing proportion can be modeled as a categorical distribution

Page 11: Collaborative Filtering: Latent Variable Model

11

Probabilistic Latent Semantic Analysis To make predictions, we compute the expected rating

Page 12: Collaborative Filtering: Latent Variable Model

12

Probabilistic Latent Semantic Analysis Model parameters can be learnt by maximizing the

following log likelihood of observed data

This can be readily solved using EM algorithm

Page 13: Collaborative Filtering: Latent Variable Model

13

Probabilistic Latent Semantic Analysis Question 1:

how to learn the model parameters by EM algorithm?

Question 2: how to understand EM algorithm?

Page 14: Collaborative Filtering: Latent Variable Model

14

Other latent variable models Probabilistic latent preference analysis

(PLPA)

Reference: NN. Liu et al, Probabilistic Latent Preference

Analysis for Collaborative Filtering, CIKM’09

Page 15: Collaborative Filtering: Latent Variable Model

15

Outline Overview of CF approaches

Model based approach-latent variable model Probabilistic latent semantic analysis (PLSA) Other latent variable models

Summary

Page 16: Collaborative Filtering: Latent Variable Model

16

Summary CF is popular

Memory based method Advantages and shortcomings

Model based method Latent variable model

Probabilistic latent semantic analysis

Other latent variable models

Page 17: Collaborative Filtering: Latent Variable Model

17

Summary

Thank you !

Questions?

Page 18: Collaborative Filtering: Latent Variable Model

18

Reference Thomas Hofmann, Collaborative Filtering via Gaussian Probabilis

tic Latent Semantic Analysis, SIGIR 2003 Thomas Hofmann, Latent Semantic Models for Collaborative Filt

ering, In ACM Transactions on Information Systems, 2004 Abhinandan Das et al, Google News Personalization: Scalable On

line Collaborative Filtering, WWW 2007 NN. Liu et al, Probabilistic Latent Preference Analysis for Collabo

rative Filtering, CIKM’09 Xiaoyuan Su et al, A Survey of collaborative Filtering Techniques,

2009