quantum probabilities and quantum-inspired information retrieval

145
The Geometry of Information Retrieval Quantum Probabilities and Quantum-inspired IR Models Ingo Frommholz University of Bedfordshire Luton, UK [email protected] Herbstschule Information Retrieval Schloss Dagstuhl

Upload: ingo-frommholz

Post on 28-Aug-2014

2.390 views

Category:

Technology


1 download

DESCRIPTION

The tutorial about quantum probabilities and quantum-inspired information retrieval I held in October 2012 at the IR Autumn School of the German Information Retrieval Specialist Group at Schloss Dagstuhl, Germany.

TRANSCRIPT

Page 1: Quantum Probabilities and Quantum-inspired Information Retrieval

The Geometry of Information RetrievalQuantum Probabilities and Quantum-inspired IR Models

Ingo Frommholz

University of BedfordshireLuton, UK

[email protected]

Herbstschule Information RetrievalSchloss Dagstuhl

Page 2: Quantum Probabilities and Quantum-inspired Information Retrieval

Table of Contents

1 Introduction

2 Introduction to Quantum Probabilities

3 Quantum-inspired IR Models

4 Further Models and Conclusion

2 / 145

Page 3: Quantum Probabilities and Quantum-inspired Information Retrieval

Acknowledgements

Some parts (in particular the quantum probability one) are basedon the ECIR 2012 “Quantum Information Access and Retrieval”tutorial given by Benjamin Piwowarski and Massimo MelucciThis tutorial is worth checking out

It covers some aspect more thoroughlyIt discusses some further quantum-based IR models

See alsohttp://www.bpiwowar.net/2012/04/

slides-from-the-ecir12-quantum-information-access-and-retrieval-tutorial/

Direct download:http://www.bpiwowar.net/wp-content/uploads/2012/04/tutorial-handout.pdf

I’d like to thank Guido Zuccon for providing material about qPRP

3 / 145

Page 4: Quantum Probabilities and Quantum-inspired Information Retrieval

Introduction

IR and Geometry

Page 5: Quantum Probabilities and Quantum-inspired Information Retrieval

Introduction

IR and Geometry

IR Models and PrinciplesGeometry, Probability and Logics

LUP

pDatalog

VSM LSI

LM

PRPBM25iPRP Boolean

QMqPRP

QIA

5 / 145

Page 6: Quantum Probabilities and Quantum-inspired Information Retrieval

Introduction

IR and Geometry

A Language for IR

The geometry and mathematics behind quantum mechanics canbe seen as a ’language’ for expressing the different IR models[van Rijsbergen, 2004].

Combination of geometry, probability and logics

Leading to non-classical probability theory and logics

Potential unified framework for IR modelsApplications in areas outside physics emerging

Quantum Interaction symposia (e.g. [Song et al., 2011])

6 / 145

Page 7: Quantum Probabilities and Quantum-inspired Information Retrieval

Introduction

IR and Geometry

IR as Quantum System?An Analogy

Quantum System IR System

Particles, physical properties inHilbert spaces

Documents, relevance, informa-tion needs in Hilbert Spaces

System state uncertain Information need (IN) uncertainObservation changes systemstate

User interaction changes sys-tem state

Observations interfere (Heisen-berg)

Document relevance interferes

Combination of systems Combination of IN facets,polyrepresentation

7 / 145

Page 8: Quantum Probabilities and Quantum-inspired Information Retrieval

Introduction

IR and Geometry

Brief History of the Quantum Formalism

1890s Max Planck’s hypothesis: energy not continuous butcomes in quantas

1920s Born/Jordan: matrix reformulation of Heisenberg’s work

1930s Dirac/von Neumann: mathematical formulation ofquantum physics (Hilbert space), theory of quantummeasurement, quantum logics

8 / 145

Page 9: Quantum Probabilities and Quantum-inspired Information Retrieval

Introduction to Quantum ProbabilitiesQuantum FormalismPreliminaries: Hilbert Spaces and Inner Products

Complex NumbersHilbert SpacesOperators and ProjectorsTensor Spaces

Quantum ProbabilitiesQuantum and Classical ProbabilitiesDirac NotationEventsQuantum LogicsQuantum StatesPure StatesMixed StatesDensity OperatorsConclusion

Page 10: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Formalism

Quantum Formalism

The quantum formalism is build on top of Hilbert spacesEach finite-dimensional vector space with an inner product is aHilbert space [Halmos, 1958]

We focus on finite-dimensional spaces here

A vector space is defined over a field K, e.g R or C

10 / 145

Page 11: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Complex Numbers

Complex Numbers

Complex number z ∈ C

z = a+ ib, a,b ∈ R, i2 =−1

Polar form: z =r(cos(φ)+ isin(φ)) = reiφ

with r ∈ R+,φ ∈ [0,2π]

Addition/Multiplication:z1 = a1+ ib1 = r1eiφ1 ,z2 = a2+ ib2 = r2eiφ2 :z1+z2 =(a1+a2)+ i(b1+b2)z1 ·z2 = r1r2ei(φ1+φ2)

11 / 145

Page 12: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Complex Numbers

Complex Numbers

Complex number z ∈ C

z = a+ ib, a,b ∈ R, i2 =−1

Polar form: z =r(cos(φ)+ isin(φ)) = reiφ

with r ∈ R+,φ ∈ [0,2π]

Complex conjugatez = a− ib = re−iφ

b = 0⇔ z ∈ R⇔ z = z

Absolute value|z|=

p

a2+b2 = r =p

zz|z|2 = zz

12 / 145

Page 13: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Hilbert Spaces

Vector Space

Vector Space

Set V of objects called vectors satisfyingAddition: ∀x,y ∈ V : x+y ∈ V and

Commutative: x+y = y+xAssociative: (x+y)+z = x+(y+z)Origin: ∃!0 ∈ V : x+0 = x ∀x ∈ VAdditive inverse: ∀x ∈ V ∃!−x with x+(−x) = ϕ

Multiplication by scalar: Let α ∈ K be a scalar and x ∈ V . Thenαx is the product of α and x with the properties

Associative: (αβ)x = α(βx)Distributive:

α(x+y) = αx+αy(α+β)x = αx+βx

13 / 145

Page 14: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Hilbert Spaces

Vector SpaceExample

Example: n-dimensional complex vector space Cn:

x =

x1...

xn

with xi ∈ C and

x+y =

x1+y1...

xn +yn

and αx =

αx1...

αxn

14 / 145

Page 15: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Hilbert Spaces

Vector SpaceLinear Combinations

Linear combination: y = c1x1+ . . .+cnxn

{x1, . . . ,xn} are linearly independent if

c1x1+ . . .+cnxn = 0 iff c1 = c2 = . . .= cn = 0

with 0 being the zero vector

15 / 145

Page 16: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Hilbert Spaces

Vector SpaceBasis

(Finite) Basis

A set of n linearly independent vectors B= {x1, . . . ,xn} form a(finite) basis of a vector space V if every vector in V is a linearcombination of vectors in B:

x = c1x1+ . . .+cnxn =∑

i

cixi

Example: Canonical basis in Rn (orthonormal basis)

e1 =

10.0

,e2 =

01.0

, . . . ,en =

00.1

with x =n∑

i=1

xiei

The number of elements in B is the dimension dim(V) of V16 / 145

Page 17: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Hilbert Spaces

Vector SpaceSubspace

Subspace

A non-empty subset V ′ of a vector space V is a subspace if along withevery pair x,y ∈ V ′, every linear combination αx+βy ∈ V ′.

A subspace is also a vector space

dim(V ′)≤ dim(V)For each x ∈ V ′, x−x = 0 ∈ V ′ (each subspace passes throughthe origin)

Example: Each 2-dimensional plane that passes through theorigin is a subspace of R3

17 / 145

Page 18: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Hilbert Spaces

Hilbert SpaceInner product/1

Hilbert space H: vector space with an inner product

(Complex) Inner Product

A function ⟨., .⟩ ∈ C with

Conjugate symmetry: ⟨x,y⟩= ⟨y,x⟩ (symmetric if ⟨., .⟩ ∈ R, inparticular ⟨x,x⟩ ∈ R!)

Linearity:⟨λy,x⟩= λ⟨x,y⟩= ⟨y,λx⟩⟨x+y,z⟩= ⟨x,z⟩+ ⟨y,z⟩Positive definite: ⟨x,x⟩ ≥ 0 and ⟨x,x⟩= 0 iff x = 0

for λ ∈ C and x,y,z ∈H

18 / 145

Page 19: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Hilbert Spaces

Hilbert SpaceInner product/2

Properties of the inner product:

Real case (quite obvious):

⟨x,αy1+βy2⟩= α⟨x,y1⟩+β⟨x,y2⟩Be careful in the complex case:

⟨x,αy1+βy2⟩ = ⟨αy1+βy2,x⟩= α⟨y1,x⟩+β⟨y2,x⟩= α⟨x,y1⟩+β⟨x,y2⟩

19 / 145

Page 20: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Hilbert Spaces

Hilbert SpaceInner product/3

There are many possible inner products, they need to makesense for the application

Inner product example (standard inner product)

⟨x,y⟩ = x†y =∑

i

x iyi

= xTy =∑

i

xiyi if real

with row vectors

x† = (x1, . . . ,xn) (adjoint)

xT = (x1, . . . ,xn) (transpose)

20 / 145

Page 21: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Hilbert Spaces

Hilbert SpaceNorm

Norm

||x||=p

⟨x ,x⟩is the norm of a vector x.

Geometric interpretation: length of the vector||x|| ∈ RStandard inner product:

||x||=s

n∑

i=1

x2i=Æ

x21+ . . .+x2

n

Vector x with ||x||= 1 is called a unit vector , e.g. x =

� 1p2

1p2

21 / 145

Page 22: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Hilbert Spaces

Hilbert SpaceOrthogonality

Orthogonality

Two vectors x and y are orthogonal if ⟨x,y⟩= 0

Example (R2):

x =

� 1p2

1p2

,y =

� 1p2

− 1p2

Example (C2):

x =

ii

,y =

i−i

22 / 145

Page 23: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

Towards Projectors

One of the most important operations for quantum probabilitiesare projectors

We need to learn about linear operators and their matrixrepresentation first (see also [van Rijsbergen, 2004, Chapter 4])

23 / 145

Page 24: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

Linear Operator

Basic operations in a Hilbert space H are performed by linearoperators (a special case of linear maps).

Linear Operator

A linear operator is a map f :H 7→H such that for any scalar λ ∈ Cwe have

f (x+λy) = f (x)+λf (y)

Examples: rotation, projection, scaling

Linear operators can be represented by a (square) matrix A

Applying A to vector x: Ax

24 / 145

Page 25: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

Operators as Matrices

yi =n∑

k=1

aik xk

25 / 145

Page 26: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

Product of TransformationsCommutativity

The product of two transformations A and B is defined by theeffect it has on the vector x

ABx means: Apply B to x then apply the result to A

ABx is usually (but not always) different from BAx

If ABx = BAx, that is AB = BA, A and B are said to becommutative (commuting)

They are non-commutative if AB 6= BA

26 / 145

Page 27: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

Product of TransformationsMatrix Multiplication

AB = C

cij =n∑

k=1

aik bkj

27 / 145

Page 28: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

Product of TransformationsMatrix Multiplication Example

1 23 4

��

3 45 6

=

13 1629 36

6=�

15 2223 34

=

3 45 6

��

1 23 4

non-communitative

28 / 145

Page 29: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

Adjoints

Adjoint and Self-Adjoint (Hermitian)

The adjoint of a linear operator (or matrix) A is an operator A† so that

⟨A†x,y⟩= ⟨x,Ay⟩An operator/matrix is self-adjoint (Hermitian) if A = A†.

When the scalars are complex: A† = AT (conjugate transpose)

In the real case: A† = AT   symmetric (AT = A) and self-adjointmatrices are the same

29 / 145

Page 30: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

AdjointsExample

Real case:�

a xu c

�†

=

a ux c

Complex case:

a+ ib x− iyu− iv c+ id

�†

=

a− ib u+ ivx + iy c− id

[van Rijsbergen, 2004, p. 55]

30 / 145

Page 31: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

Orthogonal Projector

Orthogonal Projector

An orthogonal projector P is an idempotent, self-adjoint linear operatorin H.

Idempotent: P = PP (P leaves its image unchanged)There is a one-to-one correspondence between orthogonalprojectors and subspaces

Each vector orthogonal to the subspace projects to 0

We use the notation P to denote the orthogonal projector (amatrix) that represents a subspace

31 / 145

Page 32: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

Orthogonal ProjectorExample

P

z

y = Py

x

Px

P =

1 0 00 1 00 0 0

z =

001

orthogonal to P,

hence Pz = 0

y =

010

contained in P,

so projection has no effect

32 / 145

Page 33: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

Special Projector: Ray

Ray

If x is a unit vector (||x||= 1), then xx† is an orthogonal projector ontothe 1-dimensional subspace defined by x. This projector is called a rayand denoted Px .

Example (R2):

e1 =

10

e1e†1=

10

1 0�

=

1 00 0

= Pe1

Pe1e1 =

1 00 0

��

10

=

10

Pe1

11

=

10

33 / 145

Page 34: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

Spectral Theorem

Spectral Theorem

To any self-adjoint matrix A on a finite-dimensional complex innerproduct space V there correspond real numbers α1, . . . ,αr andprojectors E1, . . . ,Er , r ≤ dim(V), so that

1 the αj are pairwise distinct;

2 the Ej are mutually orthogonal;

3∑r

j=1Ej = I (I is the identity matrix);

4 A =∑r

j=1αjEj

Orthogonality: Ei⊥Ej iff EiEj = EiEj = 0The αj are the distinct eigenvalues of AThe Ej are the subspaces generated by the eigenvectors

34 / 145

Page 35: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Operators and Projectors

Spectral TheoremSimple Example

A =

1 0 00 0 00 0 0

Eigenvalues α1 = 1 α2 = 0

A = α1

1 0 00 0 00 0 0

︸ ︷︷ ︸

E1

+α2

0 0 00 1 00 0 1

︸ ︷︷ ︸

E2

Eigenvector e1 =

100

: e1e†

1= E1

35 / 145

Page 36: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Tensor Spaces

Tensor Space

A number of Hilbert spaces H1, . . . ,Hn can be combined to acomposite tensor space H=H1⊗ . . .⊗Hn

Ifn

eij

o

is an orthonormal basis of Hi , then

i,j

eij

(the tensor product of all combination of basis vectors) is anorthonormal basis of the tensor space

36 / 145

Page 37: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Tensor Spaces

Tensor SpaceExample

C21⊗C2

2is a 4-dimensional space C4 with base vectors

¦

e11⊗e2

1,e1

1⊗e2

2,e1

2⊗e2

1,e1

2⊗e2

2

©

e1i

©

andn

e2j

o

base vectors of C21

and C22, respectively)

37 / 145

Page 38: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Preliminaries: Hilbert Spaces and Inner Products

Tensor Spaces

Tensor SpaceProduct Operators

If A is an operator in H1 and B is an operator in H2, then A⊗B isan operator in H1⊗H2 and it is

(A⊗B)(a⊗b) = Aa⊗Bb

for a ∈H1, b ∈H2 and a⊗b ∈H1⊗H2.

A matrix representation for the tensor product is given by theKronecker product (see also [Nielsen and Chuang, 2000, p. 74])

38 / 145

Page 39: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Quantum Probabilities

39 / 145

Page 40: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Quantum Probabilities

40 / 145

Page 41: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Quantum and Classical Probabilities

Quantum and Classical Probabilities

Quantum probabilities are used in quantum theory to describe thebehaviour of matter at atomic and subatomic scales

Quantum probabilities are a generalisation of classical probabilitytheory

41 / 145

Page 42: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Quantum and Classical Probabilities

Quantum and Classical ProbabilitiesCorrespondance

Sample space Set Hilbert spaceAtomic event Element Ray

Event Subset SubspaceNull element Empty set Empty spaceMembership Indicator function Projector

Exclusiveness Empty intersection Empty intersection

42 / 145

Page 43: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Dirac Notation

Dirac Notation

In many textbooks on quantum mechanics a different (and quitehandy) notation is used for vectors, the so-called Dirac notation(named after Paul Dirac).

Dirac Notation

A vector y in a Hilbert space H is represented by a |y⟩, a ket. A bra⟨x | denotes a linear functional (a map f :H 7→ K). Thus the bra(c)ket⟨x |y⟩ denotes the inner product ⟨x,y⟩= x†y = f (y).

⟨x |= (|x⟩)† (adjoint) if K = C

⟨x |= (|x⟩)T (transpose) if K = R

Bra linear: ⟨x |(α |y⟩+β |z⟩) = α⟨x |y⟩+β⟨x |z⟩

43 / 145

Page 44: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Dirac Notation

Dyads

Dyads are a special class of operators

Outer product of a ket and a bra (a matrix!): |x⟩⟨y |In particular useful to describe (projectors onto) rays: Px = |x⟩⟨x |Example:

01

0 1�

=

0 00 1

44 / 145

Page 45: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Dirac Notation

Notation

Vector x or |x⟩Adjoint x† or ⟨x | or A† (for matrices)

Inner product ⟨x,y⟩ or x†y or ⟨x |y⟩Projector S projects onto subspace S (and represents it)

Ray Px ray projector determined by |x⟩Standard probability PrQuantum probability bPr

45 / 145

Page 46: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Events

Events

An event S in quantum probabilities is described by the subspaceS

Atomic events are 1-dimensional subspaces (rays)Combination of events (a glimpse into quantum logics):

Join ∨: spanning subspaceMeet ∧: biggest included subspaceComplement ⊥: orthogonal subspace

If S1 and S2 commute:S1∨S2 = S1+S2−S1S2S1∧S2 = S1S2S⊥ = I−S (I identity matrix)

46 / 145

Page 47: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Quantum Logics

Quantum LogicViolation of distributive law

Let

|φ1⟩=�

10

|φ2⟩=�

1/p

21/p

2

|φ3⟩=�

01

Then

(Pφ1 ∧Pφ2)∨Pφ3 = Pφ3

but

(Pφ1 ∨Pφ3)∧ (Pφ2 ∨Pφ3) = I

|φ1⟩

Pφ1

|φ2⟩

Pφ2

|φ3⟩Pφ3

47 / 145

Page 48: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Quantum Logics

The Classical Case

Quantum logics and probability reduces to classical logic andprobability if all measures are compatible

Compatibility basically means that the involved projectors arecommuting (so the order does not matter)

This happens when all event rays are orthogonal

48 / 145

Page 49: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Quantum States

Quantum states

Quantum probabilities are induced by quantum states

A pure quantum state (or pure state) is represented by a unitvector |φ⟩ (the state vector) in the Hilbert space HA mixed (quantum) state is a probabilistic mixture of pure states

Both kinds of states can be described by probability densities (amatrix with trace 1)

49 / 145

Page 50: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

Quantum Probabilities: Pure state

Quantum probability (pure state)

The probability of the event S given the state φ is the squared lengthof the projection onto the corresponding subspace S:

bPr(S|φ) = ||S |φ⟩||2

S

|φ⟩

S |φ⟩

Example: Qubit (simplest quantummechanical system)

Orthonormal basis |0⟩ and |1⟩State vector |φ⟩= α |0⟩+β |1⟩ withα,β ∈ C and |α|2+ |β|2 = 1

bPr(1|φ) = ||P1 |φ⟩||2= |||1⟩⟨1| |φ⟩||2= ||⟨1|φ⟩ |1⟩||2= (|⟨1|φ⟩| |||1⟩||)2

= |⟨1|φ⟩|2 = |β|2

|0⟩

|1⟩|φ⟩

We can learn from the qubit example

Qubit state |φ⟩=α |0⟩+β |1⟩ is a superposition of the states |0⟩and |1⟩α,β ∈ C are probability amplitudes (with bPr(0) = |α|2)

Classical bit can only be in state |0⟩ or |1⟩

Non-classical example

S1 and S2 are mutually exclusive events(biggest included subspace is 0)bPr(S1|φ)+ bPr(S2|φ)> 1!

S1

S2

|φ⟩

50 / 145

Page 51: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

Vector Space ModelAn example “quantum” system

We are now ready to build our first “quantum” IR system

Vector space model: document d and query q normalised vectors|d⟩ and |q⟩ in term space Rn

|q⟩ state vector, Pd = |d⟩⟨d |bPr(d |q) = ||Pd |q⟩||2= |||d⟩ ⟨d |q⟩||2

= ||⟨d ,q⟩||2=cos2(θ)

= ||⟨q,d⟩||2= |||q⟩ ⟨q|d⟩||2=�

�Pq |d⟩�

2= bPr(q|d)

|d⟩ state vector, Pq = |q⟩⟨q| (dual view)

|d⟩ |q⟩

θ

51 / 145

Page 52: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

Effect of Measurement

We have seen how we can express the probability of and event S

What happens if we observe or measure that an event occurs (forinstance the relevance of a document)?

The state needs to be updated

For a pure state φ this is just the normalised projection

�φ′�

= |φ⟩Â S =S |φ⟩||S |φ⟩||

S

|φ⟩

S |φ⟩

An immediate observation of the same event would not changethe state any more (S |φ′⟩= |φ′⟩, S idempotent!)bPr(S|φ′) = 1

52 / 145

Page 53: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

Effect of Measurement

We have seen how we can express the probability of and event S

What happens if we observe or measure that an event occurs (forinstance the relevance of a document)?

The state needs to be updated

For a pure state φ this is just the normalised projection

�φ′�

= |φ⟩Â S =S |φ⟩||S |φ⟩||

S

|φ′⟩

An immediate observation of the same event would not changethe state any more (S |φ′⟩= |φ′⟩, S idempotent!)bPr(S|φ′) = 1

53 / 145

Page 54: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

Order Effects

|a⟩

A

|b⟩

B

|x⟩

X

Quantum probabilities provide a theoryfor explaining order effects

Such effects appear when incompatiblemeasures are involved

We sketch this with a simple example

54 / 145

Page 55: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

Order Effects

|a⟩

A

|b⟩

B

|x⟩

X

3 Events A,B,XA = |a⟩⟨a|B = |b⟩⟨b|X = |x⟩⟨x |All non-commutative!bPr(X |AB) 6= bPr(X |BA)The probability that we observe X isdifferent if we observed A then B or if weobserved B then A

[van Rijsbergen, 2004]: Determiningrelevance then aboutness is not thesame as determining aboutness thenrelevance

55 / 145

Page 56: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

InterferenceThe Double Slit Experiment

(Taken from [Feynman, 1951])

Physical experiment thatmotivates interference

Some works (e.g.[Zuccon et al., 2009,Melucci, 2010b]) use thisanalogy for IR

Particle either passes slit 1 orslit 2 before it appearssomewhere on the screen

Probability Pr(x) that itappears at position x?

56 / 145

Page 57: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

InterferenceThe Double Slit Experiment

(Taken from [Feynman, 1951]) Classical Kolmogorovianprobabilities:

Pr12(x) = Pr(x |slit 1 or slit 2)

= Pr(x |slit 1)+Pr(x |slit 2)

= Pr1(x)+Pr2(x)

But this is not what we observe!

57 / 145

Page 58: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

InterferenceThe Double Slit Experiment

(Taken from [Feynman, 1951]) Quantum Probabilities: Useprobability amplitudes!

An amplitude ϕ is a complexnumber

Refer to qubit example: ϕcould be an inner productbPr(x) = |ϕ(x)|2

ϕ12 = ϕ(x |slit 1 or slit 2)

= ϕ(x |slit 1)+ϕ(x |slit 2)

= ϕ1+ϕ2

58 / 145

Page 59: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

InterferenceThe Double Slit Experiment

(Taken from [Feynman, 1951]) Following [Zuccon, 2012, p. 80]

bPr12(x) = |ϕ12|2 = |ϕ1+ϕ2|2= |ϕ1|2+ |ϕ2|2+

ϕ1ϕ2+ϕ2ϕ1

= bPr1(x)+ bPr2(x)+

2 ·Æ

bPr1(x)Æ

bPr2(x)·cos(θ1−θ2)

= bPr1(x)+ bPr2(x)+ I12

with ϕ1 = r1eiθ1 and ϕ2 = r2eiθ2

59 / 145

Page 60: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

InterferenceInterference Term

I12 = 2 ·p

bPr1(x)p

bPr2(x) ·cos(θ1−θ2) is called the interferenceterm

It also depends on the phase θ1−θ2 ofthe two complex numbers involved

I12 = 0⇔ cos(θ1−θ2) = 0⇔θ1−θ2 =π2 +kπ

(both numbers are perpendicular in the complex plane)

60 / 145

Page 61: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

Composite SystemsTensor Spaces

Quantum systems (Hilbert spaces) can be combined using thetensor product (see also [Griffiths, 2002])

If |φi⟩ ∈Hi is the state of system i then

i

|φi⟩

is the state in the composite system⊗

i Hi (product state)

Let Si be a subspace (event) in Hi . Then

bPr

i

Si |⊗

i

φi

!

=∏

i

bPr(Si |φi) (1)

61 / 145

Page 62: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

Composite Systems2 Qubit Example, Separable State

|0⟩

|1⟩|φ1⟩

|0⟩

|1⟩

|φ2⟩

Combining two qubits with

|φ1⟩= a1 |0⟩+a2 |1⟩|φ2⟩= b1 |0⟩+b2 |1⟩

State of composite system is the product state

|φ1⟩⊗ |φ2⟩= a1b1 |00⟩+a1b2 |01⟩+a2b1 |10⟩+a2b2 |11⟩(with, e.g., |01⟩= |0⟩⊗ |1⟩)If composite state is a product state, it is said to be separableBoth systems are independent – if we measure, say, |1⟩in the firstqubit1, we can still measure either |0⟩ or |1⟩ in the second one!Bivariate distribution with ai , bi as marginals [Busemeyer, 2012]

1Expressed by the subspace |10⟩⟨10|+ |11⟩⟨11|62 / 145

Page 63: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Pure States

Composite SystemsEntanglement

There are states in a composite system that cannot be expressedas product states

For example in the 2 qubit system,

|φ⟩= 1p

2|00⟩+ 1

p2|11⟩

is such a non-separable state

The systems are not independent any more – if for instance wemeasure |0⟩ in the first qubit, this means the second qubit will bein state |0⟩!The composite system is in an entangled state

Equation 1 does not hold any more

63 / 145

Page 64: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Mixed States

Mixed State

In general we don’t know the state ofthe system (or in IR we don’t knowwhat the user really wants)

We assume the system to be in acertain state with a certain(classical!) probability

S

|φ1⟩(Pr(φ1)) |φ2⟩(Pr(φ2))

|φ3⟩(Pr(φ3))

Mixed (Quantum) State

We assume the system is in a state φi with probability Pr(φi) so that∑

i Pr(φi) = 1. Then the probability of an event S is

bPr(S) =∑

i

Pr(φi) bPr(S|φi) =∑

i

Pr(φi) ||S |φi⟩||2

64 / 145

Page 65: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Mixed States

The Effect of MeasurementMixed State

When observing/measuring S all state vectors are projected andrenormalised, resulting in a new state set {ψi}The probability of each new state ψi is computed as follows:

Pr(ψi |S) =∑

φ|ψi=|φ⟩ÂS

bPr(S|φ)Pr(φ)bPr(S)

φ|ψi = |φ⟩Â S means all vectors that have the same normalisedprojection |ψi⟩Conditional quantum probabilities:

bPr(S2|S1) =∑

ψ

bPr(S2|ψ)Pr(ψ|S1)

65 / 145

Page 66: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Density Operators

Preliminaries: Trace

Trace

tr(T) =n∑

i=1

⟨ei |T |ei⟩

is known as the trace of T with {|ei⟩} as an orthonormal basis. It isequal to the sum of the diagonal elements of T.

Some important properties (see [van Rijsbergen, 2004, p. 79]):

Linearity: tr(αT1+βT2) = αtr(T1)+βtr(T2)

Cyclic permutation: e.g. tr(T1T2) = tr(T2T1)

trT† = tr(T)

tr(T)≥ 0 if T is positive definite

An operator T is of trace class if T is positive and its trace is finite66 / 145

Page 67: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Density Operators

Trace and ProbabilityDensity Operator

bPr(S) =∑

i

Pr(φi) ||S |φi⟩||2

=∑

i

Pr(φi)⟨Sφi |Sφi⟩ Def. norm

=∑

i

Pr(φi)⟨φi |S |φi⟩ S self-adjoint, idempotent

=∑

i

Pr(φi) tr(S |φi⟩⟨φi |) [Nielsen and Chuang, 2000, p. 76]

= tr

S∑

i

Pr(φi) |φi⟩⟨φi |︸ ︷︷ ︸

Trace linearity

67 / 145

Page 68: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Density Operators

Density Operator

We saw that bPr(S) = tr(Sρ)ρ is a density operator (usually a density matrix)

ρ encodes a quantum probability distribution (either mixed orpure)

Density Operator

A density operator ρ is a trace-class operator with tr(ρ) = 1.

Simple example (2 events with probability 1/2):

ρ=

12 00 1

2

68 / 145

Page 69: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Density Operators

Gleason’s Theorem

Gleason’s Theorem

Let μ be any measure on the closed subspaces of a separable (real orcomplex) Hilbert space H of dimension of at least 3. There exists apositive self-adjoint operator T of trace class such that, for all closedsubspaces S of H,

μ(S) = tr(TS)

If μ is a probability measure, then tr(T) = 1, so T is a densityoperator.

69 / 145

Page 70: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Density Operators

Gleason’s TheoremIn other words...

Following the Piwowarski/Melucci tutorial:

Distribution over a Hilbert Space

A distribution over a Hilbert space H is any functionbPr : S ⊆H 7→ [0,1] such that:

bPr(∅) = 0 and bPr(Pφ)≥ 0 ∀φ ∈H∑

ibPr(Pei ) = 1 for any basis {ei}

Gleason’s Theorem

To every probability distribution over a Hilbert space H (dimension≥ 3), there exists a unique density matrix ρ such that for any S ⊆H

bPr(S) = tr(ρS)

70 / 145

Page 71: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Density Operators

What does this mean?

Gleason’s theorem provides a 1-to-1 relationship betweenquantum probability distributions and density operators

We can approximate density operators using spectral techniques,decompositions etc.

71 / 145

Page 72: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Density Operators

Some Observations

If the system is in a pure state φ, the density operator isρ= |φ⟩⟨φ|  pure distributions are represented by projectorsDensity matrices are HermitianApplying the spectral theorem, we can decompose ρ:

ρ=∑

i

piEi

The pi ≥ 0 (with∑

i pi = 1 and pi ∈ R) are the eigenvalues andprobabilities associated to the events represented by theprojectors Ei

Example

ρ=

12 00 1

2

=1

2

1 00 0

+1

2

0 00 1

72 / 145

Page 73: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Density Operators

Measurement and Conditional Probabilities

Update of density matrix after measuring/observing S1:

ρ′ =S1ρS1

tr(S1ρS1)=

S1ρS1

tr(ρS1)

Lüders’ Rule for conditional probabilities:

bPr(S2|S1) =tr(S1ρS1S2)

tr(ρS1)

If S1 and S2 are compatible, this reduces to classicalconditionalisation (see [Hughes, 1992, p. 224])

73 / 145

Page 74: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Density Operators

Tensor product

Let ρi be the state of the system represented by Hi . Then

ρi ⊗ . . .⊗ρn

is the state of the composite system H1⊗ . . .⊗Hn

bPr(⊗

i Si |⊗

i ρi) =∏

ibPr(Si |ρi)

Note that there can be a state ρ in the composite system that isnot separable any more

74 / 145

Page 75: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Probabilities Introduction

Quantum Probabilities

Conclusion

Quantum ProbabilitiesConclusion

Introduced quantum probabilities based on Hilbert Spaces

Some salient features: pure and mixed states, densities,measurement/observation, order effects, interference, compositesystems, entanglement

Quantum probabilities as a generalisation of classicalprobabilities

Now: How can we use this for information retrieval?

75 / 145

Page 76: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA ModelQuantum Information AccessPolyprepresentation

Quantum Probability Ranking Principle

Page 77: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum Information Access (QIA)

Page 78: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Notation Wrap-Up

Hilbert space: vector space with an inner product

Dirac Notation:|φ⟩ is a ket (a vector φ)⟨φ| is a bra (a transposed vector φT)⟨φ|ψ⟩ ∈ C is a bra(c)ket (inner product)|φ⟩⟨ψ| is a ketbra (a matrix)

A subspace S is represented by a projector (another matrix)

|φ⟩⟨φ| projector onto 1-dimensional subspace

Orthogonal projection of vector |φ⟩ onto S: S |φ⟩

78 / 145

Page 79: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

Assumptions underlying QIA

IR system uncertain about user’s information need (IN)System view of the user’s IN becomes more and more specificthrough interaction

The IN may change from the user’s point of view

There is an IN Space, a Hilbert space

79 / 145

Page 80: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

Information Need Space

|quantummechanics〉

INs as vectors: INvector |φ⟩Event “document d isrelevant” representedby subspace R

Probability ofrelevance: squaredlength of projectionPr(R|d ,φ) =||R |φ⟩ ||2Unit vector imposesrelevance distributionon subspaces (events)

80 / 145

Page 81: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

Information Need Space

R

|quantummechanics〉

INs as vectors: INvector |φ⟩Event “document d isrelevant” representedby subspace R

Probability ofrelevance: squaredlength of projectionPr(R|d ,φ) =||R |φ⟩ ||2Unit vector imposesrelevance distributionon subspaces (events)

81 / 145

Page 82: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

Information Need Space

R

|quantummechanics〉

INs as vectors: INvector |φ⟩Event “document d isrelevant” representedby subspace R

Probability ofrelevance: squaredlength of projectionPr(R|d ,φ) =||R |φ⟩ ||2Unit vector imposesrelevance distributionon subspaces (events)

82 / 145

Page 83: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

System’s Uncertainty about User’s Intentions

R

p1

p2

p4

p3

p5

System uncertain aboutuser’s IN

Expressed by an ensemble Sof possible IN vectors (densityρ):

S ={(p1, |φ1⟩) , . . . ,(pn, |φn⟩)}Probability of relevance:

Pr(R|d ,S)=∑

i

pi ·Pr(R|d ,φi)︸ ︷︷ ︸

=||R|φ⟩||2

83 / 145

Page 84: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

System’s Uncertainty about User’s Intentions

R

p1

p2

p4

p3

p5Dual representation usingdensity operator and tracefunction

ρ=∑

i pi · |φi⟩⟨φi |Pr(R|d ,S) = tr(ρR)

84 / 145

Page 85: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

User Interaction and Feedback

R∗

|ϕ1〉

|ϕ2〉

|ϕ5〉

|ϕ3〉

Outcome of feedback: Queryand query reformulation, (clickon) relevant document, ...

Expressed as subspace

Project IN vectors ontodocument subspace

Document now getsprobability 1

System’s uncertaintydecreases

Also reflects changes ininformation needs

85 / 145

Page 86: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

User Interaction and Feedback

R∗

|ϕ1〉

|ϕ2〉|ϕ4〉

|ϕ3〉

|ϕ5〉

Outcome of feedback: Queryand query reformulation, (clickon) relevant document, ...

Expressed as subspace

Project IN vectors ontodocument subspace

Document now getsprobability 1

System’s uncertaintydecreases

Also reflects changes ininformation needs

86 / 145

Page 87: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

Textual RepresentationIN Space / Documents

|crash〉 (Term)

|car〉 (Term)

|jupiter〉 (Term)|jupiter crash〉

|car crash〉

R∗topic

|ϕ〉

IN space based on termspace

IN vectors made of documentfragments

Weighting scheme (e.g., tf,tf-idf,...)

Document is relevant to allINs found in its fragments

Document subspace Rspanned by IN vectors

No length normalisationnecessary

87 / 145

Page 88: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

Single Query Term

Take all fragments vectors (INvectors) containing term t

This makes up ensemble St

88 / 145

Page 89: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

Mixture

Mixture of all combinations ofterm fragments

Document must at leastsatisfy one term fragment

The more term fragments arecontained, the more relevanta document is

S(M) =∑n

i=1wiSti

wi is term weight

89 / 145

Page 90: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

Mixture of Superposition

Superpose all combinations(e.g. 1p

2(|φ⟩+ |ψ⟩))

At least one query termfragment superposition mustbe contained

The more fragmentsuperpositions are contained,the more relevant a documentis

Indication that it works wellwith multi-term concepts (e.g.“digital libraries”)

90 / 145

Page 91: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

Tensor product

Assumption: each termcovers an IN aspect

Tensor product of all fragmentvectors  combination of INaspects

Document must satisfy all INaspects

The more tensor products aresatisfied, the more relevant isthe document

91 / 145

Page 92: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Quantum Information Access

What can it bring to IR?

Evaluation with several TREC collections[Piwowarski et al., 2010]

Tensor representation of query could compete with BM25

We don’t lose retrieval effectiveness in an ad hoc scenarioFramework is open for possible extension:

Different forms of interactions (query reformulations, relevancejudgements)  sessionsDiversity and noveltyStructured queries (Boolean; based on mixture, superposition andtensor)Polyrepresentation [Frommholz et al., 2010]

92 / 145

Page 93: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

Book Store Example

93 / 145

Page 94: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

Book Store Example

94 / 145

Page 95: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

The Principle of PolyrepresentationBook Store Scenario

Content Author

RatingsComments

1 Get ranking for different representations2 Find the cognitive overlap

Based on different document representations, but also differentrepresentations of user’s information needHypothesis: cognitive overlap contains highly relevant documents(experiments support this)

95 / 145

Page 96: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

How can we apply this in QIA?

Model single representations in a vector space (by example)AuthorsRatings

Combine the representations

96 / 145

Page 97: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

Example: Author Space

Each author is a dimension

Non-orthogonal vectors:dependencies

Angle between vectorsreflects the degree ofdependency (90◦ =orthogonal = upright =independent)

Example: Jones and Smith(somehow) related, Smith andMiller not

97 / 145

Page 98: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

Example: Author Space

Document by Smith and Miller

User seeks for documents byJones

Document retrieved due torelationship between Jonesand Smith

98 / 145

Page 99: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

Rating Space

Example: rating scalegood/bad/average – each is adimension

“Average” rated bookrepresented by 2-dimensionalsubspace

User wants books which arerated good⇒ not relevant (|good⟩orthogonal)

Rrating

|good〉

|average〉

|bad〉

99 / 145

Page 100: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

Combining the EvidenceTotal Cognitive Overlap and Tensors

Modelled different representations in vector space

Probabilities w.r.t. single representations

How do we express user’s IN w.r.t. all representations?

How do we get a cognitive overlap?

100 / 145

Page 101: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

Combining the EvidenceTotal Cognitive Overlap and Tensors

Content Author

RatingsComments

Polyrepresentation space as tensor product (“⊗”) of singlespaces

Probability that document is in total cognitive overlap:Prpolyrep = Prcontent ·Prratings ·Prauthor ·Prcomments

101 / 145

Page 102: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

Wishlist

Content Author

RatingsComments

Documents not relevant in one representation should not get avalue of 0Ignore selected representationsRelative importance of representations to user (mixing andweighting)

102 / 145

Page 103: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

“Don’t care” dimension

Introduction of a “don’t care” dimensionPart of each document subspace  each document “satisfies” the don’t care “need”Example: Document by Smith, user doesn’t care about authorswith probability α

|smith〉

|jones〉

|∗〉α

Rauthor

α= 1 means representation is ignored at all103 / 145

Page 104: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

Example

What the system assumes about the user’s IN:Seeks books either by Jones or by SmithLooks either for good books or doesn’t care about ratings

Assume a document d by Smith which is rated “bad”

Polyrepresentation space: 9-dimensional (3×3)

104 / 145

Page 105: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

IN Vectors in Polyrepresentation SpaceHow do they look like and what do they mean?

Polyrepresentation space

Reflects all 4 possiblecombinations of INs w.r.t. singlerepresentations:

Smith/good: |smith⟩⊗ |good⟩Smith/dont’ care: |smith⟩⊗ |∗⟩Jones/good: |jones⟩⊗ |good⟩Jones/dont’ care: |jones⟩⊗ |∗⟩

105 / 145

Page 106: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

Documents in Polyrepresentation Space

Represented as tensor product of single document subspaces

Here: 4-dimensional subspace (2×2)

106 / 145

Page 107: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

Determining the Retrieval Weight

Why the system retrieves thebad book by Smith

107 / 145

Page 108: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

Something left on the WishlistRelationships between Representations

System observes (interaction/feedback) user preferences:If book is by Smith, it has to be rated goodIf book is by Jones, don’t care about the ratings

System evolves to new state in polyrepresentation space (2combinations not allowed any more)

XX

108 / 145

Page 109: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

What does that mean?

Only two assumed INcases left:

1 Smith/good2 Jones/don’t care

Cannot be expressed ascombination of singlerepresentations

Bad book by Smith onlyretrieved due torelationship to Jones!

109 / 145

Page 110: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

QIA Summary

We showed how we can express polyrepresentation in a mathematicalframework inspired by quantum mechanics:

Presented IN space

Modelled different example representations

Combined representations in polyrepresentation space

110 / 145

Page 111: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

QIA Conclusion

QIA frameworkUser’s IN as ensemble of vectorsDocuments as subspacesUser interaction and feedbackTerm space, query constructionCan compete in an ad hoc scenario

PolyrepresentationDifferent non-topical representations as subspacesPolyrepresentation space as tensor space to calculate cognitiveoverlap“Don’t care” dimension for weighting of representationsNon-separate states reflect interdependencies

111 / 145

Page 112: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

The QIA Model

Polyprepresentation

QIA Extensions

Queries in sessions [Frommholz et al., 2011]Use geometry and projections to determine type of and handlefollow-up query (generalisation, information need drift,specialisation)

Summarisation [Piwowarski et al., 2012]QIA interpretation of LSA-based methods

Query algebra for the QIA framework [Caputo et al., 2011]

112 / 145

Page 113: Quantum Probabilities and Quantum-inspired Information Retrieval

The Quantum Probability Ranking Principle (qPRP) [Zuccon, 2012]

Page 114: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

Quantum Probability Ranking Principle

Quantum Interference (again)

Recall the double slit experiment

Probability amplitudes instead ofprobabilities to model interference

Derived the interference term

I = ϕ1ϕ2+ϕ2ϕ1

= 2 ·Æ

bPr1(x)Æ

bPr2(x) ·cos(θ1−θ2)

to compute

bPr12(x) = bPr1(x)+ bPr2(x)+ I

114 / 145

Page 115: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

Quantum Probability Ranking Principle

Probability Ranking Principle

Probability Ranking Principle (PRP): Rank according todecreasing Pr(R|d ,q)Which document to present next?

argmaxd∈B

(Pr(R|d ,q))

B: candidate documents not presented yet

Assumes relevance judgements are independent, nodependencies between documents

Potentially does not suit tasks considering novelty and diversitywell!

115 / 145

Page 116: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

Quantum Probability Ranking Principle

Quantum Probability Ranking PrincipleDouble Slit Analogy

IR analogy to double slit experimentUser is “particle”Each slit corresponds to adocumentEvent that particle passes throughslit: user analyses documentScreen measures user satisfaction(proportional to bPr(R|d ,q))

Which document to present next(after user saw dA)?

argmaxdB∈B

bPr(R|dB,q)+ IdAdB

4.4 Ranking Documents within the Analogy

dB

dA

pdAdB

dA

pdAdB

......dB1dBn�1dBi

dB?

B

Figure 4.6: The IR analogous of the situation pictured in Figure 4.5.

pdAdB: there is no reason for which to expect that pdAdBi

= pdAdBj, 8dBi

, dBj2 B.

A question now arises: which document (slit) dB 2 B should be selected such

that, once coupled with document (slit) dA, the probability pdAdBis maximised?

Answering this question corresponds to define a criteria for selecting documents

such that, once coupled with the already ranked document, the likelihood of de-

livering maximum satisfaction to the user is maximised. In terms of the physical

experiment, this would correspond to selecting slits among the set of available

ones such that the likelihood of hitting the detector panel (at a particular loca-

tion) is maximised.

Di↵erent probability theories provide di↵erent answers to the previous ques-

tion. Therefore, the pair of documents that are obtained when using Kolmogoro-

vian probability theory to model our analogy may be di↵erent from those obtained

when using quantum probability theory. In the following we examine these two

88

(taken from [Zuccon, 2012])

116 / 145

Page 117: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

Quantum Probability Ranking Principle

Quantum Probability Ranking PrincipleAssumptions

1 Ranking is performed sequentially

2 Empirical data is best described by quantum probabilities

3 Relevance of documents not assessed in isolation;documents that have been ranked before influence futurerelevance assessments

117 / 145

Page 118: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

Quantum Probability Ranking Principle

Quantum Probability Ranking Principle (qPRP)

Let A (where A=∅ is also considered) be the set containing thedocuments that have been already retrieved until rank i−1 and let Bbe the set of candidate documents for being retrieved at rank i . Inorder to maximise its effectiveness, an Information Retrieval systemhas to rank document dB ∈ B at rank i if and only if

bPr(R|dB,q)+ IdAdB ≥ bPr(R|dC ,q)+ IdAdC ,

for any dC ∈ B. IdAdB is the sum of the quantum interference termsproduced considering all the pairs composed by dB and each alreadyretrieved document dA ∈A (similarly for IdAdC ).

118 / 145

Page 119: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

Quantum Probability Ranking Principle

Quantum Probability Ranking Principle

Each document would be selected according to

argmaxdB∈B

bPr(R|dB,q)+∑

dA∈AIdAdB

!

Documents may interfere (at relevance level) with alreadypresented ones

Encodes dependencies between documents

119 / 145

Page 120: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

Quantum Probability Ranking Principle

qPRP: Interference TermEstimation

Recall

IdAdB = ϕdAϕdB +ϕdBϕdA

= 2 ·Æ

bPr(R|dA,q)Æ

bPr(R|dB,q) ·cos(θAB)

(with cos(θAB) = cos(θdA−θdB))Interference governed by the phase difference θAB betweenϕdA ∈ C and ϕdB ∈ C

Destructive: cos(θAB)< 0Constructive: cos(θAB)> 0

Estimate IdAdB with similarity function fsim

cos(θAB)≈ βfsim(dA,dB)

β ∈ R: normalisation, sign of interference120 / 145

Page 121: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

Quantum Probability Ranking Principle

qPRP: Evaluation

Different similarity functions applied in diversity task (TREC 6,7,8and ClueWeb B)

qPRP outperformed PRP and (sometimes) a Portfolio Theorysetting (but no parameter tuning required)

Kullback-Leibler divergence performed best

Pearson’s correlation coefficient seems most robust

121 / 145

Page 122: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

Quantum Probability Ranking Principle

qPRP: EvaluationFurther Experiments

Comparison with PRP, Maximal Marginal relevance (MMR),Portfolio Theory (PT) and interactive PRP (iPRP)Ad hoc task

PRP best performing ranking approach when independenceassumption hold

Diversity taskPRP often outperformed by PT, iPRP and qPRP

122 / 145

Page 123: Quantum Probabilities and Quantum-inspired Information Retrieval

Quantum-inspired IR Models

Quantum Probability Ranking Principle

qPRP: Discussion

Kolmogorovian probabilities (in PRP) adequate when usingindependence assumption (ad hoc task)

Quantum probabilities seem good choice if documents are notindependent and interfere (diversity task)

qPRP reduces to PRP if phases are perpendicular(cos(θAB) = 0)

Integration into QIA framework possible

123 / 145

Page 124: Quantum Probabilities and Quantum-inspired Information Retrieval

Further Models and Conclusion

Further Work and SoftwareDiscussion and Conclusion

Page 125: Quantum Probabilities and Quantum-inspired Information Retrieval

Further Models and Conclusion

Further Work and Software

Further Selected Works

Quantum probability in context [Melucci, 2008]Effective query expansion with quantum interference[Melucci, 2010b]Semantics and meaning [Widdows, 2004]Entanglement and word meaning[Bruza et al., 2009b, Bruza et al., 2009a]Lattice structures and documents [Huertas-Rosero et al., 2009]Quantum theory and search [Arafat, 2007]Query expansion and query drift [Zhang et al., 2011]Document re-ranking [Zhao et al., 2011]Complex numbers in IR [Zuccon et al., 2011]DB+IR: Commutative Quantum Query Language [Schmitt, 2008]Further overview [Song et al., 2010]

125 / 145

Page 126: Quantum Probabilities and Quantum-inspired Information Retrieval

Further Models and Conclusion

Further Work and Software

Software

Kernel Quantum Probability API by Benjamin Piwowarskihttp://kqp.bpiwowar.net/

126 / 145

Page 127: Quantum Probabilities and Quantum-inspired Information Retrieval

Further Models and Conclusion

Further Work and Software

Other Resources

http://www.quantuminteraction.org/homeA collection of useful links and resources in the context of theQuantum Interaction series

http://www.mendeley.com/groups/496611/quantum-and-geometry-ir/Aiming at providing an updated list of quantum and geometry IRresearch papers

You may also follow me on Twitter: @iFromm

127 / 145

Page 128: Quantum Probabilities and Quantum-inspired Information Retrieval

Further Models and Conclusion

Discussion and Conclusion

DiscussionQuantum Theory and IR

The quantum formalism is a powerful ’language’ for IR – isn’t it?

We’ve seen some examples of quantum-inspired models (QIA,qPRP)

Quantum probabilities may give us a hint of what is wrong withexisting approaches (but not always!) [Piwowarski et al., 2012]

But there is criticism: “Ornamental but not useful” (Kantor, whohopes to be proven wrong) [Kantor, 2007]

128 / 145

Page 129: Quantum Probabilities and Quantum-inspired Information Retrieval

Further Models and Conclusion

Discussion and Conclusion

Conclusion

Quantum probabilities

2 quantum-inspired approaches: QIA and qPRP

Further approaches

129 / 145

Page 130: Quantum Probabilities and Quantum-inspired Information Retrieval

Questions?

Page 131: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography I

Aerts, D. and Gabora, L. (2005).A theory of concepts and their combinations II: A Hilbert spacerepresentation.Kybernetes, 34(1/2):192–221.

Arafat, S. (2007).Foundations Research in Information Retrieval Inspired byQuantum Theory.Phd thesis, University of Glasgow.

131 / 145

Page 132: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography II

Bruza, P., Kitto, K., Nelson, D., and McEvoy, C. (2009a).Extracting Spooky-Activation-at-a-Distance from Considerationsof Entanglement.In Proc. of the Third Quantum Interaction Symposium (QI 2009),volume 5494 of Lecture Notes in Computer Science, pages71–83, Berlin, Heidelberg. Springer Berlin Heidelberg.

Bruza, P., Kitto, K., Nelson, D., and McEvoy, C. (2009b).Is there something quantum-like about the human mental lexicon?

Journal of mathematical psychology, 53(5):363–377.

132 / 145

Page 133: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography III

Busemeyer, J. R. (2012).Introduction to Quantum Probability for Social and BehavioralScientists.In Rudolph, L. and Valsiner, J., editors, Qualitative Mathematicsfor Social Sciences - Mathematical Models for Research onCultural Dynamics. Routledge.

Caputo, A., Piwowarski, B., and Lalmas, M. (2011).A Query Algebra for Quantum Information Retrieval.In Proceedings of the 2nd Italian Information Retrieval Workshop2011.

133 / 145

Page 134: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography IV

Di Buccio, E., Melucci, M., and Song, D. (2011).Towards Predicting Relevance Using a Quantum-Like Framework.In Clough, P., Foley, C., Gurrin, C., Jones, G. J. F., Kraaij, W., Lee,H., and Mudoch, V., editors, Proceedings ECIR 2011, volume6611 of Lecture Notes in Computer Science, pages 755–758.Springer.

Feynman, R. P. (1951).The Concept of Probability in Quantum Mechanics.In Proc. Second Berkeley Symp. on Math. Statist. and Prob. (Univ.of Calif. Press, 1951), pages 533–541.

134 / 145

Page 135: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography V

Frommholz, I., Larsen, B., Piwowarski, B., Lalmas, M., Ingwersen,P., and van Rijsbergen, K. (2010).Supporting Polyrepresentation in a Quantum-inspired GeometricalRetrieval Framework.In Proceedings of the 2010 Information Interaction in ContextSymposium, pages 115–124, New Brunswick. ACM.

Frommholz, I., Piwowarski, B., Lalmas, M., and van Rijsbergen, K.(2011).Processing Queries in Session in a Quantum-Inspired IRFramework.In Clough, P., Foley, C., Gurrin, C., Jones, G. J. F., Kraaij, W., Lee,H., and Mudoch, V., editors, Proceedings ECIR 2011, volume

135 / 145

Page 136: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography VI

6611 of Lecture Notes in Computer Science, pages 751–754.Springer.

Griffiths, R. B. (2002).Consistent quantum theory.Cambridge University Press, Cambridge; New York.

Halmos, P. R. (1958).Finite-Dimensional Vector Spaces.Springer-Verlag New York Heidelberg Berlin.

Huertas-Rosero, A., Azzopardi, L., and van Rijsbergen, C. J.(2009).Eraser Lattices and Semantic Contents.In Proceedings Quantum Interaction 2009, pages 266–275.

136 / 145

Page 137: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography VII

Hughes, R. (1992).The Structure and Interpretation of Quantum Mechanics.Harvard University Press.

Kantor, P. B. (2007).Keith van Rijsbergen, The Geometry of Information Retrieval.Information Retrieval, 10(4-5):485–489.

Melucci, M. (2008).A basis for information retrieval in context.ACM Transactions on Information Systems (TOIS), 26(3).

137 / 145

Page 138: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography VIII

Melucci, M. (2010a).An Investigation of Quantum Interference in Information Retrieval.In Cunningham, H., Hanbury, A., and Rüger, S., editors,Proceedings of the First Information Retrieval Facility Conference(IRFC 2010), pages 136–151. Springer.

Melucci, M. (2010b).An Investigation of Quantum Interference in Information Retrieval.In Cunningham, H., Hanbury, A., and Rüger, S., editors,Proceedings of the First Information Retrieval Facility Conference(IRFC 2010), pages 136–151. Springer.

138 / 145

Page 139: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography IX

Melucci, M. and Rijsbergen, K. V. (2011).Quantum Mechanics and Information Retrieval.In Melucci, M. and Baeza-Yates, R., editors, Advanced Topics inInformation Retrieval, volume 33 of The Information RetrievalSeries, chapter 6, pages 125–154. Springer Berlin Heidelberg,Berlin, Heidelberg.

Nielsen, M. A. and Chuang, I. L. (2000).Quantum Computation and Quantum Interaction.Cambridge University Press, Cambridge, UK.

139 / 145

Page 140: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography X

Piwowarski, B., Amini, M.-R., and Lalmas, M. (2012).On using a Quantum Physics formalism for Multi-documentSummarisation.Journal of the American Society for Information Science andTechnology (JASIST).

Piwowarski, B., Frommholz, I., Lalmas, M., and Van Rijsbergen, K.(2010).What can Quantum Theory Bring to Information Retrieval?In Proc. 19th International Conference on Information andKnowledge Management, pages 59–68. Springer.

140 / 145

Page 141: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography XI

Piwowarski, B. and Lalmas, M. (2009).A Quantum-based Model for Interactive Information Retrieval.In Proceedings of the 2nd International Conference on the Theoryof Information Retrieval (ICTIR 2009), Cambridge, UK.

Schmitt, I. (2008).QQL : A DB&IR Query Language.The VLDB Journal, 17(1):39–56.

Song, D., Lalmas, M., van Rijsbergen, K., Frommholz, I.,Piwowarski, B., Wang, J., Zhang, P., Zuccon, G., Bruza, P., Arafat,S., Azzopardi, L., Huertas-Rosero, A., Hou, Y., Melucci, M., andRüger, S. (2010).How Quantum Theory is Developing the Field of InformationRetrieval.

141 / 145

Page 142: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography XII

In Proceedings Quantum Interaction 2010.

Song, D., Melucci, M., Frommholz, I., Zhang, P., Wang, L., andArafat, S., editors (2011).Quantum Interaction: 5th International Symposium, QI 2011,Aberdeen, UK, Revised Selected Papers.Springer Berlin Heidelberg, lecture no edition.

van Rijsbergen, C. J. (2004).The Geometry of Information Retrieval.Cambridge University Press, New York, NY, USA.

Widdows, D. (2004).Geometry and Meaning.Center for the Study of Language and Information, StanfordUniversity.

142 / 145

Page 143: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography XIII

Zhang, P., Song, D., Zhao, X., and Hou, Y. (2011).Investigating Query-Drift Problem from a Novel Perspective ofPhoton Polarization.In Amati, G. and Crestani, F., editors, Advances in InformationRetrieval Theory - Proc. of ICTIR 2011, pages 332–336. Springer.

Zhao, X., Zhang, P., Song, D., and Hou, Y. (2011).A Novel Re-ranking Approach Inspired by QuantumMeasurement.In Clough, P., Foley, C., Gurrin, C., Jones, G. J. F., Kraaij, W., Lee,H., and Mudoch, V., editors, Proceedings ECIR 2011, volume6611 of Lecture Notes in Computer Science, pages 721–724.Springer.

143 / 145

Page 144: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography XIV

Zuccon, G. (2012).Document Ranking with Quantum Probabilities.Phd thesis, University of Glasgow.

Zuccon, G., Azzopardi, L., and van Rijsbergen, K. (2009).The Quantum Probability Ranking Principle for InformationRetrieval.In Advances in Information Retrieval Theory - Proc. of ICTIR2009, volume 5766 of Lecture Notes in Computer Science, pages232–240, Berlin, Heidelberg. Springer Berlin Heidelberg.

144 / 145

Page 145: Quantum Probabilities and Quantum-inspired Information Retrieval

Bibliography

Bibliography XV

Zuccon, G., Piwowarski, B., and Azzopardi, L. (2011).On the use of Complex Numbers in Quantum Models forInformation Retrieval.In Amati, G. and Crestani, F., editors, Advances in InformationRetrieval Theory - Proc. of ICTIR 2011, pages 346–350. Springer.

145 / 145