latent friend mining from blog data (icdm’06). outline about the blog problem statement three...

Latent Friend Mining from Blog Data (ICDM’06)

Upload: calvin-jackson

Post on 18-Jan-2016

214 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Latent Friend Mining fromBlog Data

(ICDM’06)

Outline

About the Blog Problem statement Three kinds of Approaches Experiment Conclusion

About the Blog

Our goal in this research is to develop methods for mining the potential relationship among people, and we call this problem “latent friend detection”.

Problem statement

Our goal in this paper is to find latent friends from the blog data.

The entries, bloggers give their basic information as well as much interesting information on the blogs. Take MSN spaces as an example, the bloggers may put their favorite songs, sports, pictures on the blogs.

Page 5: Latent Friend Mining from Blog Data (ICDM’06). Outline About the Blog Problem statement Three kinds of Approaches Experiment Conclusion

Three kinds of Approaches (Method 1)

Cosine Similarity-based Method

straightforward solution for the latent friend detection problem is to calculate the similarity between the contents of the entries from two bloggers.

Page 6: Latent Friend Mining from Blog Data (ICDM’06). Outline About the Blog Problem statement Three kinds of Approaches Experiment Conclusion

Three kinds of Approaches (Method 1 cont.)

Cosine Similarity-based Method

where is the term frequency of term k in blogger i’s blog. Given a blogger i, after calculating the similarity between him/her and all other bloggers, we can sort the blog-gers according to the similarity. Then the top bloggers in the list can be recommended as blogger i’s latent friends.

Page 7: Latent Friend Mining from Blog Data (ICDM’06). Outline About the Blog Problem statement Three kinds of Approaches Experiment Conclusion

Three kinds of Approaches (Method 1 cont.)

3 0 6 7 ... 2

1 5 4 2 ... 5

1 2 3 4 ... ki

(3 x 1) + (0 x 5) + (6 x 4) + (7 x 2) + … + (2 x 5)

(9+0+36+49+…+4) x (1+25+16+4+…+25)

Page 8: Latent Friend Mining from Blog Data (ICDM’06). Outline About the Blog Problem statement Three kinds of Approaches Experiment Conclusion

Three kinds of Approaches (Method 2)

Topic Model based Method We define the distance between blogger i

and blogger j as the topic distributions conditioned on each blogger:

where T is the number of topics; θit is the probability of

opic t conditioned on blogger i.

Page 9: Latent Friend Mining from Blog Data (ICDM’06). Outline About the Blog Problem statement Three kinds of Approaches Experiment Conclusion

Three kinds of Approaches (Method 3)

Two-Level Similarity-based Method

Coarse Similarity Calculation Lifestyle Food :Chinese food , Western food

Page 10: Latent Friend Mining from Blog Data (ICDM’06). Outline About the Blog Problem statement Three kinds of Approaches Experiment Conclusion

Three kinds of Approaches (Method 3 cont.)

Two-Level Similarity-based Method

Finer Similarity at Literal Level

Experiment

Conclusion

(1) The paper put forward a new research problem of finding latent friends from blog data

(2) A novel two-level similarity-based approach is proposed to solve the problem effectively and efficiently, which takes into account the time related content information as well as the topic-distribution information.

ICDM Conference, 2011

Influence-based Network-oblivious - ICDM 2013

Plenary lectures - Diabetesicdm2013.diabetes.or.kr/file/2nd_Announcement.pdf · 2013-09-09 · (ICDM) 06_ICDM 2013 & 5 th AASD ICDM 2013 & 5 AASD _07 Genetics Insulin signalling 2

Implicit Icdm

Latent Trait Latent Class Analysis of an Eysenck Personality

| Latent/ACE-V | Latent Case Management and ACE-V ... · PDF fileADAMS Solution Guide Latent/ACE-V Latent Case Management and ACE-V Documentation The ADAMS Latent Case Management and

1--The Latent Power of the Soul The Latent Power of the ... BOOK-The Latent... · 1/18/13 Watchman Nee:The Latent Power of the Soul

AN INTRODUCTION TO LATENT CLASS AND LATENT PROFILE …

PADM-ICDM, Omaha --- Oct. 28, 2007 - ODUmukka/cs795dm/Lecturenotes/Day6/ICDM… · Data are collected separately and intimately by several parties (agencies), representing parts of

Benefits of EMC XtremIO iCDM for Cassandra Database · WHITE PAPER BENEFITS OF EMC XTREMIO ICDM FOR CASSANDRA DATABASE Using EMC XtremIO Virtual Copies (XVC) to create Cassandra database

Top 10 algorithms in data mining - University Of Maryland · 2018-01-12 · the ICDM ’06 panel on Top 10 Algorithms in Data Mining. At the ICDM ’06 panel of December 21, 2006,

ICDM 2010 Conference Information The 10th IEEE ...ICDM 2010 The 10th IEEE International Conference on Data Mining 14-17 December 2010 Sydney, Australia Conference Information Papers