about the tradeoff between searching time and browsing ...final search query is test-set...

IBM/Columbia team at TRECVID 2004

About The Tradeoff Between Searching Time and Browsing Time in Interactive Search

IBM Research: A. Amir, J. O. Argillander, M. Berg, G. Iyengar, J. Kender, C-Y Lin, M. Naphade, A. Natsev, J. R. Smith, J. Tesic, G. Wu, R. Yan, D. ZhangColumbia University: S-F. Chang, W. Hsu

Presented by Arnon Amir, arnon@almaden.ibm.comNIST TRECVID 2004 Workshop, Gaithersburg, MD 11/16/2004.

IBM Research

IBM Search Systems – indexed data

We extend our thanks toCLIPS-IMAG – reference shot listCMU – Video OCR and time alignment of CC and ASRLDC – Video data, closed captionsLIMSI – ASR dataNIST – TRECVID

Multimodal query retrieval:– Speech: combined ASR, CC and Phonetic

– Video OCR

– Visual concepts

– CBIR: color histograms, color correlograms, texture.

IBM Research

System 1: Marvel (“TJW”)

Shot-based retrieval

Speech, CC

Concepts, models, filters

Session

Vector operations, fusion

Shot aggregation

Internet based GUI

IBM Research

System 2: Multimedia Mining (“ARC”)

Video+time retrieval

Speech, CC, phonetic

Concepts, Video OCR

Session

Statistic ranking

Shot aggregation

Relevance Feedback

Internet based GUI

IBM Research

System 3: Automatic Search

Shot-based retrieval

Speech, CC

Concepts

Vector operations, fusion

Automatic query formulation– No human in the loop, no GUI☺

IBM Research

A Perspective on TRECVID Search

What Manual search is?

What Interactive search is?

What is the importance of browsing?

What is the importance of shots elevation?

IBM Research

Manual Search

Input: Multimodal topic, textual + image/video samples

Task: Form the best MM query for the given topic

Measure system+human performance+ Human ability to form a good query

+ System discrimination

+ System generalization

Final search query is test-set independent, scalable

IBM Research

Interactive Search

Task: Get me as many hits as possible, now!

Measure system+human performance+ Human ability to form a good query

+ Human browsing skills

+ System search capabilities

+ System browsing capabilities

Final search result is test-set dependent, not scalable

IBM Research

Automatic Search

Samples should include only relevant shots/images (Dow Jones Showing Rise)

Task: Acquire this Previously Unseen Semantic Concept

Measures system (!) performance– No human in the loop

– System topic->query conversion capability

– System learning capability

– System search capability

Final result is test-set independent, scalable

IBM Research

Interactive Search = a Combination of Search and Browse

E.g., searching for “sport”using multiple queries

A travel in video shots space• Search: travel across the entire space, collect many shots• Browse: local, in a neighborhood: storyboard, CBIR, etc.

Browse

BrowseQry1: “basketball”Qry2: “football”Qry3: “swim dive”

Qry4: “hockey puck ice”

query …

IBM Research

Search and Browse

One at a time, unranked*Many, rankedCollecting hits

Anything the user can findQuery matchesHits

Result list/tableQueryStarting point

??Contribution to MAP

LongShortTime

HumanComputerProcessor

Local, “show similar ones”GlobalExtent

BrowseSearch

IBM Research

Browsing

Browsing, “find more like this”

Helps in query refinement

Find and pick new matches

Feedback, shots elevation

IBM Research

Multimodal Query: sport~ & basketball & (N.B.A.# | NBA$) & 04.38333Concept ASR/CC phonetic VOCR CBIR

IBM Research

The impact of Shot Elevation on MAP

Simulated scenario: For each topic,– Take final Interactive query as baseline list

– Elevate all the correct shots to the top, up to depth n

Plot MAP(n)

A random result list of length K (=1000) would yield a linear expected AP(n) of

KpppnGT

ipninppnp

GTnMAP

⎟⎠

⎞⎜⎝

⎛ −++⋅= ∑

)(0.11)(

IBM Research

Shot Elevation Impact on MAP – Simulation Result

0 100 200 300 400 500 600 700 800 900 10000

MAP(n)

Browsing and marking depth, n•Above linear -> means the search results are better than random.•Labelling the top 100 gives more than 100% improvement in MAP

IBM Research

0 100 200 300 400 500 600 700 800 900 10000

Shots Elevation Impact on Topics AP

Browsing and marking depth, n

IBM Research

Interactive Search Runs

Results can get much higher than manual search

Does it scale up for large data sets ?

0.0000

0.0500

0.1000

0.1500

0.2000

0.2500

0.3000

0.3500

0.4000

Interactive

Manual

Automatic

IBM.Interactive_1_ARC

IBM.Interactive_2_ARC

IBM.Interactive_Marvel_TJW

IBM Research

Manual Search Runs

0.0000

0.0500

0.1000

0.1500

0.2000

0.2500

0.3000

0.3500

0.4000

Scalable: same query may be run on large data sets

IBM.SpeechBaseline_ARC

IBM.Manual_ARC

IBM.Manual_Fusion_1_TJW

IBM.Manual_Marvel_TJW

IBM.Manual_Fusion_2_TJW

Interactive

Manual

Automatic

IBM Research

Automatic Search Runs

Automatic pilot runs:– Results are comparable with the ballpark of Manual search

0.0000

0.0500

0.1000

0.1500

0.2000

0.2500

0.3000

0.3500

0.4000

IBM.Automatic_Fusion_TJW

Interactive

Manual

Automatic

IBM Research

AP by Topics

0.00000.10000.20000.30000.40000.50000.60000.70000.8000

Total M

APBori

s Yelt

sinHock

Sam don

Benjam

in Netan

Henry H

m usse

nisHorse

Buildin

g on f

ireDog w

alkBicy

Umbrella

sBill C

linton

Keyboa

rdsWhee

lchair

Weapon

firing

Stretch

rsStairs

Capitol d

omeSki

slalom

on Best IBM Run1st runMaxAverageMedian

0.00000.10000.20000.30000.40000.50000.60000.70000.80000.90001.0000

Total M

APBori

s Yelts

Sam do

Benjam

in Neta

m usse

nisHors

Buildin

g on f

ireDog

Bicycle

sUmbre

Bill Clin

tonKey

lchair

Weapon

firing

Stretch

rsStairs

Capito

l dome

Ski sla

Best IBM Run1st runMaxMedianAverage

Manual

Interactive

IBM Research

Comparison – Interactive vs. Manual Runs

0.00000.10000.20000.30000.40000.50000.60000.70000.80000.90001.0000

Total M

APBori

s Yelt

sinHock

Sam don

Benjam

in Netan

Henry H

m usse

nisHorse

Buildin

g on f

ireDog w

alkBicy

Umbrella

sBill C

linton

Keyboa

rdsWhee

lchair

Weapon

firing

Stretch

rsStairs

Capitol d

omeSki

slalom

InteractiveManual multimodalManual baseline

Topics are sorted by decreasing Manual AP order

IBM Research

Conclusions

Browsing and shots elevation are essential in Interactive search

Relevance Feedback found very helpful in query formulation

Automatic search – concept acquisition, the most objective, comparable to manual, scales up.

Manual search – less objective, scales up, speech is still the most dominant part

Interactive search – very interactive, subjective, CBIR is very instrumental, may not scale up as well.

about the tradeoff between searching time and browsing ...final search query is test-set...

Documents

protein sequence search tool: a web-based interactive search...

time cost tradeoff

the effects of adding search functionality to interactive...

search engine marketing - kostas matziaris, mindworks...

a reactive search optimization approach to interactive

interactive adapter search: w . ma ct olsrdi

ezdl: an interactive ir framework, search tool, and ... ·...

interactive event search through transfer learning...

the role of interactive agent-based simulation modeling...

providing interactive guided search for consumers

web-based interactive technologies: data search tools

year 6 school closure reading interactive learning links ·...

a stochastic graph grammar algorithm for interactive search

efï¬cient interactive fuzzy keyword search

learning distance metrics for interactive search-assisted...

analyzing temporal keyword queries for interactive search

interactive video search - tutorial at acm multimedia 2015

clustering similar query sessions toward interactive web...

multimodal tradeoff

personalized interactive faceted search · personalized...