scientific paper recommendation emphasizing each researcher’s most recent research topic
DESCRIPTION
Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic. Kazunari Sugiyama 8 th January, 2010. Introduction. The number of published scientific papers continues to grow. Users of digital library suffer from finding papers relevant to their information needs. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/1.jpg)
Scientific Paper Recommendation Emphasizing Each Researcher’s
Most Recent Research Topic
Kazunari Sugiyama8th January, 2010
![Page 2: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/2.jpg)
2
Introduction• The number of published scientific papers continues
to grow.• Users of digital library suffer from finding papers
relevant to their information needs.• Recommendation systems are promising approach
to address each user’s interest.– Mid-level or senior researchers
• Several different research interests based on several years experience
– Junior researchers• Quite small publication list (too short to construct user profile)
![Page 3: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/3.jpg)
3
Related Work
• Improvement in Ranking of Digital Library– ISI impact factor (ISI IF)
• Papers with high impact and low impact are treated equally. • Its ranking are biased towards popularity.
– Improved approach• “Focused PageRank” [Sun and Giles, ECIR’07]• “FutureRank” [Sayyadi and Getoor, SIAM-Data Mining, ‘09]• Weighted PageRank, Y-factor (product of ISI IF and weighted
PageRank) [Bollen et al., Journal of Scientometrics ‘06]• “Scientific gems” [Chen et al., Journal of Informetrics ‘07]
![Page 4: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/4.jpg)
4
Related Work
• Recommendation Systems in Digital Library– Recommend citations [McNee et al., CSCW’02]– Recommend papers by combining collaborative filtering
and content-based filtering [Torres et al., JCDL’04]– Recommend paper s ranking-oriented collaborative
filtering [Yang et al., JCDL’09]
![Page 5: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/5.jpg)
5
Related Work
• Construction of Robust User Profile in Recommendation Systems– Content-based approach• Frequent patterns obtained by click-history [Kim et al., ICADL’08]• News recommender system [Das et al., WWW’07], [Chu and Park, WWW’09]• Long-term search history [Shen et al., SIGIR’05], [Tan et al., KDD’06], [White et
al., SIGIR’09]
![Page 6: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/6.jpg)
6
Proposed Method
• System Overview• Construction of User Profile– Junior Researchers– Mid-level or Senior Researchers
• Construction of Feature Vectors for Candidate Papers to Recommend
![Page 7: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/7.jpg)
7
System Overview
Researcher
Candidate papers to recommend
userP
(1) Construct user profile from each researcher’s past papers
(2) Compute similarity between
userP
(3) Recommend papers with high similarity
),,1( tjjrecp F
1recpF to jrecpF
and
![Page 8: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/8.jpg)
8
Junior Researchers’ Published Papers
1p
(‘09)
1p
References
(‘06) (‘02) (‘07)
11 refp 21 refp lrefp 1
Relation between reference papersand 1p
[No published papers in the past]
(‘09)11 refpW 21 refpW lrefpW 1
![Page 9: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/9.jpg)
9
Weighting Schemes for Junior Researchers’ Published Papers
• Linear Combination (LC)• Similarity between the most recent paper and
others (SIM)• Reciprocal of the difference between
published year of the most recent paper and that of other papers (RPY)
![Page 10: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/10.jpg)
10
1p 2p ip np
(‘02) (‘03) (‘05) (‘09)
ip
References
(‘06) (‘07) (‘09)
ipcp 1ipcp 2
ik pcp
(‘03) (‘01) (‘04)
1refip 2refip lrefip
old new
Mid-level or senior researchers’ published papers
Relation between citation or referencepapers and ip
(‘05)1refipW
2refipW lrefipW
ipcpW 1 ipcpW 2
ipkcp
W
1npW 2npW inpW
![Page 11: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/11.jpg)
11
Weighting Schemes for Mid-level or Senior Researchers’ Published Papers
• Linear Combination (LC)• Similarity between the most recent paper and
others (SIM)• Reciprocal of the difference between
published year of the most recent paper and that of other papers (RPY)
• Forgetting factor (FF)
![Page 12: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/12.jpg)
12
System Overview
Researcher
Candidate papers to recommend
userP
(1) Construct user profile from each researcher’s past papers
(2) Compute similarity between
userP
(3) Recommend papers with high similarity
),,1( tjjrecp F
1recpF to jrecpF
and
TF-IDF
![Page 13: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/13.jpg)
13
Experiment
• Experimental Data– DBLP papers for each researcher– ACL Anthology• 597 papers published in 2000 - 2006
![Page 14: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/14.jpg)
14
Experiment
• Evaluation Measure– Normalized Discounted Cumulative Gain (NDCG)• NDCG@5, NDCG@10
– Mean Reciprocal Rank (MRR)
![Page 15: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/15.jpg)
15
Experimental Results
• Junior Researchers• Mid-level or Senior Researchers
![Page 16: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/16.jpg)
16
Recommendation Accuracy for Junior Researchers
![Page 17: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/17.jpg)
17
[NDCG@5] [NDCG@10]
[MRR]
*
*
*
* : statistically significant for p < 0.05
![Page 18: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/18.jpg)
18
Recommendation Accuracy for Mid-level or Senior Researchers
![Page 19: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/19.jpg)
19
[NDCG@5] [NDCG@10]
[MRR]
**
*
*
** : statistically significant for p < 0.01
* : statistically significant for p < 0.05
![Page 20: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/20.jpg)
20
[NDCG@5] [NDCG@10]
[MRR]
+
*
+ : statistically significant for p < 0.05
+
+
![Page 21: Scientific Paper Recommendation Emphasizing Each Researcher’s Most Recent Research Topic](https://reader036.vdocument.in/reader036/viewer/2022070411/568148d3550346895db5ee84/html5/thumbnails/21.jpg)
21
Conclusion• Recommendation system of scientific papers for junior
researchers, and mid-level or senior researchers• Junior researcher
– User profile constructed using the most recent paper and its pruned reference paper gives the best recommendation accuracy.• Threshold of pruning: 0.2• NDCG@5: 0.521, NDCG@10: 0.459, MRR: 0.624
• Mid-level or senior researcher– User profile constructed using papers published within 3 years and its pruned citation and reference papers gives the
best recommendation accuracy.• Threshold of pruning : 0.4• NDCG@5: 0.540, NDCG@10: 0.518, MRR: 0.812