beyond boolean queries ranked retrieval thus far, our queries have all been boolean. documents...

Beyond Boolean Queries

Upload: orion-reade

Post on 14-Dec-2015

218 views

Category:

Documents

1 download

Report

Download

Tags:

Embed Size (px):

TRANSCRIPT

Slide 1

Slide 2 Beyond Boolean Queries Slide 3 Ranked retrieval Thus far, our queries have all been Boolean. Documents either match or dont. Good for expert users with precise understanding of their needs and the collection. Also good for applications: Applications can easily consume 1000s of results. Writing the precise query that will produce the desired result is difficult for most users. Slide 4 Problem with Boolean search: feast or famine Boolean queries often result in either too few (=0) or too many (1000s) results. A query that is too broad yields hundreds of thousands of hits A query that is too narrow may yield no hits It takes a lot of skill to come up with a query that produces a manageable number of hits. AND gives too few; OR gives too many Slide 5 Ranked retrieval models Rather than a set of documents satisfying a query expression, in ranked retrieval models, the system returns an ordering over the (top) documents in the collection with respect to a query Free text queries: Rather than a query language of operators and expressions, the users query is just one or more words in a human language In principle, these are different options, but in practice, ranked retrieval models have normally been associated with free text queries and vice versa 4 Slide 6 Feast or famine: not a problem in ranked retrieval When a system produces a ranked result set, large result sets are not an issue IIndeed, the size of the result set is not an issue WWe just show the top k ( 10) results WWe dont overwhelm the user PPremise: the ranking algorithm works Slide 7 Scoring as the basis of ranked retrieval We wish to return, in order, the documents most likely to be useful to the searcher How can we rank-order the documents in the collection with respect to a query? Assign a score say in [0, 1] to each document This score measures how well document and query match. Slide 8 Query-document matching scores We need a way of assigning a score to a query/document pair Lets start with a one-term query If the query term does not occur in the document: score should be 0 The more frequent the query term in the document, the higher the score (should be) We will look at a number of alternatives for this. Slide 9 Take 1: Jaccard coefficient A commonly used measure of overlap of two sets A and B jaccard(A,B) = |A B| / |A B| jaccard(A,A) = 1 jaccard(A,B) = 0 if A B = 0 A and B dont have to be the same size. Always assigns a number between 0 and 1. Slide 10 Jaccard coefficient: Scoring example What is the query-document match score that the Jaccard coefficient computes for each of the two documents below? Query: ides of march Document 1: caesar died in march Document 2: the long march Slide 11 Jaccard Example done Query: ides of march Document 1: caesar died in march Document 2: the long march A = {ides, of, march} B1 = {caesar, died, in, march} B2 = {the, long, march}

Supporting Ranked Boolean Similarity Queries in …...ORTEGA ET AL.: SUPPORTING RANKED BOOLEAN SIMILARITY QUERIES IN MARS 907 Although the above straightforward adaptation of Boo-

Introduction to Information Retrieval - csuohio.edueecs.csuohio.edu/~sschung/CIS660/LectureNotes_IR_lecture...Introduction to Information Retrieval Ranked retrieval Thus far, our queries

Boolean Algebra – I. Outline Introduction Digital circuits Boolean Algebra Two-Valued Boolean Algebra Boolean Algebra Postulates Precedence

Introduction to Information Retrievalceit.aut.ac.ir/~amirkhani/summer-2012/IR-lecture1.pdf · 2012-07-29 · Introduction to Information Retrieval Sec. 1.3 Boolean queries: Exact

PV211: Introduction to Information Retrieval ...sojka/PV211/p01intro.pdf · Introduction History Boolean model Inverted index Processing Boolean queries Query optimization Course

Introducon*to*Informa)on*Retrieval* !! !! CS3245 Informa…kanmy/courses/3245_2015/w7.pdf · CS3245*–Informa)on*Retrieval*!! !! Problem!with!Boolean!search:! Feastor!Famine!! Boolean!queries!o^en!resultin!either!too!few!(=0)!or!

Beyond Boolean Queries

Introduction to Information Retrieval - cse.unsw.edu.aucs6714/18s2/lect/L6-tfidf.pdf · COMP6714: Information Retrieval & Web Search Ranked retrieval §Thus far, our queries have

Introduction to Information Retrievalsmbidoki.ir/courses/245_IR_932_Lecture2_InvertedIndex... · 2015. 3. 9. · Introduction to Information Retrieval Boolean queries: Exact match

A Graphical Filter/Flow Representation of Boolean Queries ... · fying Boolean queries. In an attempt to overcome this limitation, a graphical Filter/Flow representation of Boolean

Web Search Information Retrieval. 2 Boolean queries: Examples Simple queries involving relationships between terms and documents Simple queries involving

UKTO USER GUIDE · i) Boolean Search Accepts queries made with Boolean logic. Stemmed query terms are used to produce broader results, for example, searching “territorial” will