information retrieval

Information Retrieval Lecture 7 Introduction to Information Retrieval (Manning et al. 2007) Chapter 17 For the MSc Computer Science Programme Dell Zhang Birkbeck, University of London

Upload: wilbur

Post on 02-Feb-2016

10 views

Category:

Documents

0 download

Report

Download

Tags:

Embed Size (px):

DESCRIPTION

Information Retrieval. For the MSc Computer Science Programme. Lecture 7 Introduction to Information Retrieval (Manning et al. 2007) Chapter 17. Dell Zhang Birkbeck , University of London. … (30). agriculture. biology. physics. CS. space. dairy. botany. cell. AI. courses. crops. - PowerPoint PPT Presentation

TRANSCRIPT

Information Retrieval

Lecture 7Introduction to Information Retrieval (Manning et al. 2007)

Chapter 17

For the MSc Computer Science Programme

Dell ZhangBirkbeck, University of London

Yahoo! Hierarchy

http://dir.yahoo.com/science

dairycrops

agronomyforestry

HCIcraft

missions

botany

evolution

cellmagnetism

relativity

courses

agriculture biology physics CS space

... ... ...

… (30)

... ...

Hierarchical Clustering

Build a tree-like hierarchical taxonomy (dendrogram) from a set of unlabeled documents. Divisive (top-down)

Start with all documents belong to the same cluster. Eventually each node forms a cluster on its own.

Recursive application of a (flat) partitional clustering algorithm, e.g., kMeans (k=2) Bi-secting kMeans.

Agglomerative (bottom-up) Start with each document being a single cluster.

Eventually all documents belong to the same cluster.

Dendrogram

Clustering is obtained by cutting the dendrogram at a desired level: each connected component forms a cluster.

The number of clusters k is not required in advance.

Dendrogram – Example

Clusters of News Stories:Reuters RCV1

Dendrogram – Example

Clusters of Things that People Want: ZEBO

HAC

Hierarchical Agglomerative Clustering Starts with each doc in a separate cluster. Repeat until there is only one cluster:

Among the current clusters, determine the pair of clusters, ci and cj, that are most similar. (Single-Link, Complete-Link, etc.)

Then merges ci and cj to a single cluster. The history of merging forms a binary tree or

hierarchy.

Single-Link

The similarity between a pair of clusters is defined by the single strongest link (i.e., maximum cosine-similarity) between their members:

After merging ci and cj, the similarity of the resulting cluster to another cluster, ck, is:

),(max),(,

yxsimccsimji cycx

),(),,(max)),(( kjkikji ccsimccsimcccsim

HAC – Example

HAC – Example

HAC – Example

d1,d2

d4,d5

d3,d4,d5

As clusters agglomerate, docs are likely to fall into a dendrogram.

HAC – Example Single-Link

Take Home Message

Single-Link HAC Dendrogram

),(max),(,

yxsimccsimji cycx

),(),,(max)),(( kjkikji ccsimccsimcccsim

Introduction to Information Retrieval Introduction to Information Retrieval Lecture 12-13 Probabilistic Information Retrieval

1 Retrieval: Getting Information Out Module 27. 2 Retrieval: Getting Information Out Retrieval Cues

Information Retrieval

Information Retrieval and Extractionberlin.csie.ntnu.edu.tw/Courses/Information Retrieval and... · Information Retrieval and Extraction ... Bruce Croft, Donald Metzler, and Trevor

Introduction to Information Retrieval Introduction to Information Retrieval Document ingestion

Media Retrieval Information Retrieval Image Retrieval Video Retrieval Audio Retrieval Information Retrieval Image Retrieval Video Retrieval Audio Retrieval

Multimedia Information Retrieval -Slidesdcalab.unipv.it/wp-content/uploads/2015/02/video_search.pdf · Multimedia Information Retrieval. Multimedia Information Retrieval Motivation

WMES3103 INFORMATION RETRIEVAL WEEK 1 AND 2. WHAT IS INFORMATION RETRIEVAL? Information Retrieval – IR Information Retrieval – IR Information Information

2 Information Retrieval. Prof. Dr. Knut Hinkelmann 2 Information Retrieval and Knowledge Organisation - 2 Information Retrieval Motivation Information

Probabilistic Models in Information Retrieval SI650: Information Retrieval

Introduction to Information Retrieval XML Retrieval

Introduction to Information Retrieval Introduction to Information Retrieval Evaluation

Language Models for Information Retrieval - NTNUberlin.csie.ntnu.edu.tw/Courses/Information Retrieval and Extraction... · Language Models for Information Retrieval References: 1

Information Retrieval: Introduction · Information Retrieval: Introduction De nition of Information Retrieval De nition of Information Retrieval Information Retrieval (IR) is about

Multimedia Information Retrieval -Slidesmhd/8337sp09/mir.pdfMultimedia Information RetrievalMultimedia Information Retrieval Text-based Information Retrieval ¾Too many images to annotate

Introduction to Information Retrieval Information Retrieval Models

CS336: Intelligent Information Retrieval Why is Information Retrieval difficult?

Introduction to Information Retrieval Introduction to Information Retrieval CS276 Information Retrieval…

Introduction to Information Retrieval - Stanford University...Introduction to Information Retrieval Introduction to Information Retrieval CS276: Information Retrieval and Web Search

Information Retrieval · Information Retrieval vs. Relational Databases § Information Retrieval(in contrast) § datais mostly unstructuredwith no precise semantics § information

Introduction to Information Retrieval Information ...zhaojin/cs3245_2018/w11.pdf · Introduction to Information Retrieval CS3245 Information Retrieval Lecture 11: Probabilistic IR

Introduction to Information Retrieval Introducing Information Retrieval and Web Search

Introduction to Information Retrieval Introduction to Information Retrieval CS276 Information Retrieval and Web Search Christopher Manning and Prabhakar

Introduction to Information Retrieval Boolean Retrieval

Information Retrieval and Extraction - Berlin Chen's ...berlin.csie.ntnu.edu.tw/Courses/Information Retrieval and... · Information Retrieval and Extraction ... Concepts and Technology

Introduction*to Information*Retrieval · Introduction*to*Information*Retrieval Introduction*to Information*Retrieval CS276:*Information*Retrieval*and*Web*Search Pandu*Nayak*and*Prabhakar*Raghavan

Introduction to Information Retrieval - cis.csuohio.educis.csuohio.edu/~sschung/cis612/Lecture_Intro_IR_PhrasePositioning.pdf · Information Retrieval • Information Retrieval (IR)

Introduction to Information Retrieval - csuohio.edueecs.csuohio.edu/~sschung/CIS660/Lecture_Intro_IR_Phrase...Information Retrieval • Information Retrieval (IR) is finding material

Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL