extracting query facets from search results

Extracting Query Facets From Search ResultsDate : 2013/08/20Source : SIGIR’13Authors : Weize Kong and James AllanAdvisor : Dr.Jia-ling, KohSpeaker : Wei, Chang

OUTLINE Introduction Approach Experiment Conclusion

What is query facet ? Definition : query facet

a set of coordinate terms( terms that share a semantic relationship by being grouped under a relationship )

a query facet(Mars rovers)

WHAT CAN WE DO WITH QUERY FACETS ?

• Flight type• Domestic• International

• Travel Class• First • Business• Economy

GOAL Extract query facets from the top-k web

search results D={, , … , }

OUTLINE Introduction Approach

Step 1 : Extracting candidate lists Step 2 : Finding query facets from candidate lists

Experiment Conclusion

PATTERN-BASED SEMANTIC CLASS EXTRACTION Reference from : Z. Dou, S. Hu, Y. Luo, R. Song, and J.-R.

Wen. Finding dimensions for queries.

For example : There are many Mars rovers, such as Curiosity, Opportunity,

and Spirit. <ul> <li>first class</li> <li>business class</li> <li>economy class</li> </ul>

CANDIDATE LISTS

The candidate lists are usually noisy, and could be non-relevant to the issued query.

To address this problem, we use a supervised method.

• All the list items are normalized by converting text to lowercase and removing non-alphanumeric characters.

• Then, we remove stopwords and duplicate items in each lists.• Finally, we discard all lists that contain fewer than two item or more than

200 items.

NOTE : WHAT IS SUPERVISED METHOD

Quiz 1 Quiz 2 Quiz 3 Final Exam

John A B+ B- BEric A+ A A+ APeter B+ A- A+ A+Steve A+ A+ B- B+Mark C A+ B+ BLarry B+ B+ B+ A

LA-99 (Training Data)

LA-100 Quiz 1 Quiz 2 Quiz 3 Final

ExamDavid A- B+ A- ?James B A A ?

EXAMPLE :

NOTE : WHAT IS SUPERVISED LEARNING

TrainingTraining

data (with features)

New Data Model Prediction

Experiment Conclusion

PROBLEM DEFINITION

Whether a list item is a facet term Whether a pair of list items is in one query

FEATURES

LOGISTIC-BASED CONDITIONAL PROBABILITY DISTRIBUTIONS

PARAMETER ESTIMATION

Maximizing the log-likelihood using gradient descent.

INFERENCE The training is finished. The graphical model does not enforce the

labeling to produce strict partitioning for facet terms. For example, when=1, =1, we may have = 0.

REPHRASE THE OPTIMIZATION PROBLEM

This optimization problem is NP-hard, which can be provedby a reduction from the Multiway Cut problem. Therefore, we propose two algorithms, QF-I and QF-J, to approximate the results.

The optimization target becomes , where is the set of all possible query facet sets that can be generated from L with the strict partitioning constraint.

QF-I1. Select list items with as facet terms.2.

RANKING QUERY FACETS

score for a query facet :

score for a facet term :

Experiment Evaluation Experiment Result

Conclusion

Using Top 10 query facets generated by different models.

EVALUATION METRICS Using “∗” to distinguish between system

generated results and human labeled results, which we used as ground truth.

CLUSTERING QUALITY

OVERALL QUALITY

fp-nDCG is weighted by rp-nDCG is weighted by

Conclusion

FACET TERMS

CLUSTERING FACET TERMS

OVERALL

Conclusion

CONCLUSION We developed a supervised method based on a

graphical model to recognize query facets from the noisy facet candidate lists extracted from the top ranked search results.

We proposed two algorithms for approximate inference on the graphical model.

We designed a new evaluation metric for this task to combine recall and precision of facet terms with grouping quality.

Experimental results showed that the supervised method significantly outperforms other unsupervised methods, suggesting that query facet extraction can be effectively learned.

extracting query facets from search results

query faceta

query facetfeatures

issued query

possible query facet

candidate listsstep

ranking query facetsscore

query facetmars roverswhat

bb markca b blarryb

Documents

facets - aauw

facets can understanding facets of personality lead to a...

facets part 1

facets of...

cognizant—trizetto® facets® core administration system...

facets healthcare

facets of gender

lncs 8484 - extracting facets from lost fine-grained ... ·...

tessera & facets - ceramic matrixtessera + facets | page 11...

facets tutorial 1

exploring new facets - abn amro · 2017-02-24 · exploring...

extracting user interests from log using long-period...

folksonomies and facets

modelling the incremental value of personality facets: the...

query rewriting for extracting data behind html forms xueqi...

too long; didn’t watch! extracting relevant fragments from...

casablanca.pkcasablanca.pk/surgical/dental.pdf · dental...

mining favorable facets

extracting insights from many scenarios: examples from...

dbseer: pain-free database administration through workload...