trec-2003 (cdvp trecvid 2003 team)- 1 - center for digital video processing c e n t e r f o r d i g...
TRANSCRIPT
TREC-2003 (CDVP TRECVID 2003 Team) - 1 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
CDVP & TRECVID-2003Interactive Search Task Experiments
Paul Browne, Georgina Gaughan, Cathal Gurrin, Gareth J.F. Jones, Hyowon Lee, Sean Marlow,
Kieran Mc Donald, Noel Murphy, Noel E. O’Connor, Alan F. Smeaton, Jiamin Ye
Centre for Digital Video Processing
Dublin City University, Glasnevin, Dublin 9, Ireland
TREC-2003 (CDVP TRECVID 2003 Team) - 2 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Contents
• Introduction– Físchlár Systems
• Interactive Search Experiment– System & Experiment Design– System Demonstration– Submitted Runs
• Findings– Comparing Systems Performance– User Observations
• Conclusions
TREC-2003 (CDVP TRECVID 2003 Team) - 3 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Físchlár Demonstrator System
• A Digital Video Management System– Web-based, supports browsing and search
• Many different versions of the system
– Underlying XML Architecture• XSL supporting display on multiple devices
• TREC2003 is our 3rd TRECVID Search Task– 2003 : explored benefits of incorporating image and
feedback into a text search process– 2002 : explored benefits of incorporating features– 2001 : examined different keyframe browsers
TREC-2003 (CDVP TRECVID 2003 Team) - 4 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Interactive Search Experiment
• Testing if a text/image search system incorporating ‘more like this’ feedback outperforms a text-only system.
• Developed two Físchlár systems:– Each highly interactive with a keyframe browser and
playback window– (1) Text-only search and retrieval
• ASR (LIMSI) & CC Text– (2) Text & Image search incorporating a feedback
mechansim• ASR & CC Text• Keyframe-keyframe similarity (image matching)• ‘more like this’ feedback
TREC-2003 (CDVP TRECVID 2003 Team) - 5 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Experiment Set-up
• User experiments in a computer lab environment– We used the recommended mixing algorithm for
searchers / topics– Number of Users : 16
• Typical postgraduate students• No prior experience of using the system
– Topics per User : 12 (6 per system)– Minutes per Topic : 7 (last year 4 mins)– Each topic evaluated 8 times, 4 times on each
system… reduces the effect of user variability– Users were trained for 10 mins then allowed two
sample topics before experiment– Coffee, cookies & headphones were provided
TREC-2003 (CDVP TRECVID 2003 Team) - 6 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Experimental Setup
TREC-2003 (CDVP TRECVID 2003 Team) - 7 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
System Architecture
IndexingTool
SearchServer
XMLGenerator
XSL definitionsfor presentation to the browser
Q ueryCom position
Request for specificXSL to
be used
Inte rnal XMLData G enera tion
Q uery
Q uery
RankedResult
HTML & SVGto be disp layed
Q uery Resu ltVisualation
Web Browser interface
WebApplication
Web-based User Interface
XML description, containing:
---
MPEG-7 description of the requested video Query-related information Scoring for each feature for each shot
- Reference to matched shots
Search TestCollection Metadata
(ASR & CC transcripts)
MPEG -7 XM LVideo
Descriptions
Se
arc
h S
yste
mF
ísc
hlá
r T
RE
C S
yst
em
Ranking Functionfor Text
Co
mbi
natio
n
Ranking Functionfor Im ages
Im age S im ilarityIndex
ASR TranscriptIndex
CC TranscriptIndex
TREC-2003 (CDVP TRECVID 2003 Team) - 8 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Two search options
• Text Search– Using conventional Search Engines (BM25)– Two employed, simple combination:
• ASR Text• CC Text
– Required alignment with the ASR text
• Image Search– Keyframe-keyframe or query image-
keyframe similarity using:• 4 low-level visual features
– 3 colour-based features and 1 edge-based feature• Combined to produce dis-similarity values and
were then normalised
TREC-2003 (CDVP TRECVID 2003 Team) - 9 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
User Interaction Differences
• User Interaction is/can be different for both systems:
Text Search Image SearchText Search
User Query User Query
FeedbackMechanism
TREC-2003 (CDVP TRECVID 2003 Team) - 10 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Format of Results
• Results presented as Groups of Shots– Five sequential shots– Associated ASR text is also presented
– Each shot contributes to the overall score of the group (0.08, 0.16, 0.5, 0.16, 0.08)
– Top 100 groups of shots ranked and presented in pages of size 20
TREC-2003 (CDVP TRECVID 2003 Team) - 11 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Feedback Mechanism
Clicking on Add to Query button below a keyframe adds that shot content (text and image) into Query panel: subsequent search will use this shot along with the initial text term used
Search result
Query panel Type in search term(s) and Click on Search button
Query panel
TREC-2003 (CDVP TRECVID 2003 Team) - 12 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Demonstration
Text, Image & Feedback SystemDemonstration
TREC-2003 (CDVP TRECVID 2003 Team) - 13 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 14 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 15 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 16 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 17 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 18 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 19 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 20 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 21 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 22 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 23 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 24 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Demonstration
Text-only System Demonstration
TREC-2003 (CDVP TRECVID 2003 Team) - 25 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 26 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 27 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
TREC-2003 (CDVP TRECVID 2003 Team) - 28 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Submitted Runs
• Eight Runs in total– Text-only Interface
• DCUTREC12a_1 – Combined results of first 4 users• DCUTREC12a_3 – Combined results of next 4 users• DCUTREC12a_5 – Combined results of next 4 users• DCUTREC12a_7 – Combined results of last 4 users
– Text, Image & Feedback Interface• DCUTREC12b_2 – Combined results of first 4 users• DCUTREC12b_4 – Combined results of next 4 users• DCUTREC12b_6 – Combined results of next 4 users• DCUTREC12b_8 – Combined results of last 4 users
Topic 6 Topic 12 Topic 18Topic 0 Topic 24
User 1User 2User 3User 4
Text, image & feedback
Text-only
User 5User 6User 7User 8
TREC-2003 (CDVP TRECVID 2003 Team) - 29 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Precision Recall graph
0
0.2
0.4
0.6
0.8
1
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
Recall
Pre
cis
ion
Text-only
Text & Image
Aggregation of all 4 runs for each system
TREC-2003 (CDVP TRECVID 2003 Team) - 30 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Examing time
0
0.05
0.1
0.15
0.2
0.25
0.3
Time (minutes)
Rec
all
DCUTrec12a
DCUTrec12b
4 Minutes
TREC-2003 (CDVP TRECVID 2003 Team) - 31 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Recall over Topic
00.10.20.30.40.50.60.70.80.9
1
Topic Number
Rec
all
Text-only
Text & Image
TREC-2003 (CDVP TRECVID 2003 Team) - 32 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Text, Image & Feedback Queries
Topic 102Find shots from behind the pitcher in a baseball game as he throws a ball that the batter swings at
Topic 107Find shots of a rocket or missile taking off. Simulations are acceptable
TREC-2003 (CDVP TRECVID 2003 Team) - 33 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Text-only Queries
Topic 111Find shots with a locomotive (and attached railroad cars if any) approaching the viewer
Topic 119Find shots of Morgan Freeman
TREC-2003 (CDVP TRECVID 2003 Team) - 34 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
4%
51%
8%
12%25%Text-onlyMostly Text50-50Mostly ImageImage-only
User Observations
• Average of 6 queries / topic (both systems)– 564 in total on the Text-only and 581 on Text, Image
and Feedback– Of 581 Text, Image & Feedback Queries:
• > 99% contain text and 81% contain an image
• When given the choice, users chose:
TREC-2003 (CDVP TRECVID 2003 Team) - 35 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Conclusions
• Both systems perform comparably– Text-only seems to be slightly better than the text,
image and feedback system– But not by any significant amount
• Why is this the case?– Text-only is better…– Users more comfortable with text querying…– Query response time of the text, image and feedback
system was slower than text-only…• By a few seconds only over the seven minutes.
• We still have more work to do on evaluating the user data gathered during the experiments
TREC-2003 (CDVP TRECVID 2003 Team) - 36 -
Center for Digital Video Processing
C e n t e r f o r D I g I t a l V I d e o P r o c e s s I n g
Thank You