multimedia information retrieval · exercise sketch a block diagram showing how you would implement...

68
Multimedia Information Retrieval Prof Stefan Rüger Multimedia and Information Systems Knowledge Media Institute The Open University http://kmi.open.ac.uk/mmis

Upload: others

Post on 25-Apr-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Multimedia Information Retrieval

Prof Stefan Rüger

Multimedia and Information SystemsKnowledge Media Institute

The Open Universityhttp://kmi.open.ac.uk/mmis

Page 2: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Why content-based?

Actually, what is content-based search?

Is human thinking content-based?

Metadata annotation (text) is good but - - - -

Page 3: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Multimedia Information Retrieval

1. What are multimedia queries?

2. Fingerprinting

3. Metadata & piggy-back retrieval

4. Automated image annotation

5 Visual content-based retrieval I (challenges + features)

6 Visual content-based retrieval II (distances)

7. Evaluation

8. Browsing, search and geography

Page 4: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 5: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

The semantic gap

1m pixels with a spatial

colour distribution

faces & vase-like object

victory, triumph, ...

disappointment, ...

Page 6: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Polysemy

Page 7: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Scalability

Page 8: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Features and distances

x

xx

x

o

Feature space

Page 9: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Architecture

Page 10: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Features

Visual Colour, texture, shape, edge detection, SIFT/SURF

AudioTemporalHow to describe the features? For people For computers

Page 11: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Digital Images

Page 12: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Content of an image

145 173 201 253 245 245153 151 213 251 247 247181 159 225 255 255 255165 149 173 141 93 97167 185 157 79 109 97121 187 161 97 117 115

Page 13: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Histogram

1: 0 - 312: 32 - 633: 64 - 954: 96 – 1275: 128 – 1596: 160 – 1917: 192 - 2238: 224 – 255

1 2 3 4 5 6 7 80

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

Page 14: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Colour

phenomenon of human perceptionthree-dimensional (RGB/CMY/HSB)spectral colour: pure light of one wavelength

spectral colours: wavelength (nm)

blue cyan green yellow red

Page 15: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Colour

Page 16: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

http://blog.xkcd.com/2010/05/03/color-survey-results/

Page 17: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Colour: subjective

http://eagereyes.org/blog/2011/you-only-see-colors-you-can-name

Page 18: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Colour histogram

Page 19: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Exercise

a) Sketch a 3D colour histogram for

b) Sketch 2D colour histograms per channelc) Which one is more informative? Why?

R G B 0 0 0 black255 0 0 red 0 255 0 green 0 0 255 blue 0 255 255 cyan255 0 255 magenta255 255 0 yellow255 255 255 white

Page 20: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Other Colour Spaces

HSV, HSL, CIELAB/CIELUV

Page 21: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

HSB colour model

hue (0°-360°)spectral colour

saturation (0% - 100%)= spectral purity

brightness (0% - 100%)= energy or luminance

chromaticity = hue+saturation

Page 22: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

HSB colour model

Page 23: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

HSB model

disadvantage: hue coordinate is not continuous0 and 360 degrees have the same meaningbut there is a huge difference in terms of numeric

distanceexample:

red = (0,100%,50%) = (360,100%,50%)

advantage: it is more natural to describe colour changes “brighter blue”, “purer magenta”, etc

Page 24: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Texture

coarseness contrast directionality

[Tamura et al, 1978]

Page 25: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Texture histograms

Coarseness coNtrast Directionality

[with Howarth, IEE Vision, Image & Signal Proc 15(6) 2004; Howarth PhD thesis]

Page 26: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Gabor filter

Query

Orientation

Sca

le

[with Howarth, CLEF 2004]

Page 27: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Shape Analysis

shape = class of geometric objects invariant undertranslationscale (changes keeping the aspect ratio)rotations

information preserving description(for compression)

non-information preserving (for retrieval)boundary based (ignore interior)region based (boundary+interior)

Particularly important for drwings, eg, patent drawings

Page 28: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 29: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 30: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 31: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 32: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 33: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 34: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 35: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 36: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 37: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 38: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 39: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 40: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 41: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 42: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 43: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 44: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 45: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 46: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 47: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 48: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing
Page 49: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Shape Analysis

• boundary based• perimeter & area• corner points• circularity• chain codes

• region based (considering interior and holes, …)• not covered here

Page 50: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Perimeter and area

parameterised curvex(t), y(t)

R

count pixels in areaboundary pixel count

vsR

Page 51: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Circularity

A=area, P=perimeterT is 1 for a circleT is smaller than 1 for all other shapescircularity is aka compactness

R

Page 52: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Convexity

ratio of perimeter of convex hull and the original curve

1 for convex shapes, less than 1 otherwise

convex hull

Page 53: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Chain codes

contour description by a string of directionsFreeman chain code

f1f2f3...eight possible directions for each pixeltranslation invariant, not rotation invariant

3 2 1 4 0 5 6 7

= 210077654343

= 100776543432

= 007765434321

choose lowest numberfor closed contours

= 007765434321

Page 54: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Difference chain codes

sequence of angles: ai = (fi+1- fi) mod 8

rotation invariant

04

0° 45° 90° 135° 180° 225° 270° 315°

765321

= = ... = 070777717777=

Page 55: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Sound

Page 56: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Audio Features

• Spectrogram– graph of frequencies/energy/time

• tempo, pitch, mode• See

Z Liu, Y Wang and T Chen (1998). Audio feature extraction and analysis for scene segmentation and classification. VLSI Signal Processing 20(1-2), 69-79.

Page 57: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Histograms

Feature vectors → histograms145 173 201 253 245 245153 151 213 251 247 247181 159 225 255 255 255165 149 173 141 93 97167 185 157 79 109 97121 187 161 97 117 115

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

1 2 3 4 5 6 7 8

1: 0 - 312: 32 - 633: 64 - 954: 96 – 127

5: 128 – 1596: 160 – 1917: 192 - 2238: 224 – 255

Page 58: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Central moments

Simple statistics Mean

Variance (squared standard deviation)

3rd central moment (skewness)

where w is image width and h is image height

Page 59: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Moment features

Page 60: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Moment features

Page 61: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Global vs local

Global histogram also matches polar bears, marble floors, …

Page 62: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Localisation

0

0.05

0.1

0.15

0.2

0.25

0.3

1 2 3 4 5 6 7 8

64% centre

36% border

Page 63: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Tiled Histograms

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

1 2 3 4 5 6 7 8

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

1 2 3 4 5 6 7 8

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

1 2 3 4 5 6 7 8

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

1 2 3 4 5 6 7 8

0

0.1

0.2

0.3

0.4

0.5

0.6

1 2 3 4 5 6 7 8

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

1 2 3 4 5 6 7 8

Page 64: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Segmentation

0

0.05

0.1

0.15

0.2

0.25

0.3

1 2 3 4 5 6 7 8

foreground

background

Page 65: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Points of interest

Many PoI, ie, many feature vectors

Quantised feature vectors ≈ words

Bag of word model ≈ text retrieval

Page 66: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Implementation

speed vs flexibility vs precisionProcess:

1. best abstracted representation of your media2. best method for calculating difference/similarity3. implement efficiently, considering responsiveness and

scalability

Page 67: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Content-Based Image Retrieval

Which features are best for searching? Depends on the information need:Looking for sunset holiday pictures in your digital shoebox? Use colour histograms.Want to build a wallpaper customer database? Use colour + texture.Want to build a b/w sketch database for technical industrial designs? Use shape descriptors.Not sure which features are best for a query (eg, if you also have abstract features such as Fourier coefficients)? Deploy relevance feedback and let the system learn the relevant features for this query...

Page 68: Multimedia Information Retrieval · Exercise Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios: 1.Browsing

Exercise

Sketch a block diagram showing how you would implement a Multimedia Information Retrieval system for one of these scenarios:

1. Browsing wallpaper patterns in a home decorator store2. Finding “interesting” photos in a personal collection of holiday snaps3. Managing industrial design pattern templates for a manufacturing

company

Think about: what types of features you might use what would the query be the user interface