mediaeval 2015 - upc-ub-stp @ mediaeval 2015 diversity task: iterative reranking of relevant images...

UPC-UB-STP @ MediaEval 2015 Diversity Task:Iterative Reranking of Relevant Images

Aniol Lidon, Marc Bolaños, Markus Seidl, Xavier Giró-i-Nieto, Petia Radeva, Matthias ZeppelzauerSt. Pölten Universityof Applied Sciences

Ranking by relevance

Filtering of irrelevant images

Feature and distance computation

Reranking by diversity

A relevance score for each image is estimated by either using visual or textual information.

Only a percentage of the top ranked images are considered in later steps. Runs 1 to 3 keeps top 20% while Run 5 top 15%.

Visual and/or textual features are extracted for each image, and the similarity between each pair computed.

An iterative algorithm selects the most different image with respect to all previously selected ones. Iterations start by adding the most relevant image as the first element of the reranked list.

Visual data for relevance

Textual data for relevance

Visual data for similarity

Textual data for similarity

Relevance CNN was created based on HybridNet [1], a CNN trained with objects from the ImageNet dataset and locations from the Places dataset. HybridNet was fine-tuned in two classes: relevant and irrelevant, as labeled by human annotators.

The fully connected layers fc7 from a CNN trained on ImageNet, and the fully connected layer fc8 from HybridNet were used as feature vectors.

Results Visual Text Multi Multi

testset (overall) Run 1 Run 2 Run 3 Run 5

P@20 0.649 0.703 0.688 0.677

CR@20 0.413 0.378 0.422 0.405

F1@20 0.491 0.474 0.508 0.489

Run 3 uses the best combination of textual and visual data. Run 5 considers multimodal information for relevance and purely visual information for diversity.

Align to the query model

Compute TFIDF weights.

Compare with cosine metric

Textual query term model

Remove undesired words

Select most representative

Histogram of terms

Mappingmatched terms retained Build feature

vectorRetrieve tearm freq.

Relevance score

Cosine similarity original rank

References[1] Zhou, B., Lapedriza, A., Xiao, J., Torralba, A., & Oliva, A. (2014). Learning deep features for scene recognition using places database. In Advances in Neural Information Processing Systems (pp. 487-495)

Acknowledgements

mediaeval 2015 - upc-ub-stp @ mediaeval 2015 diversity task: iterative reranking of relevant images...

Education

discriminative reranking of discourse parses using tree...

mediaeval 2015 - ohsu @ mediaeval 2015: adapting textual...

mediaeval 2015 - overview of the mediaeval 2015 drone...

ranking and reranking with perceptron - home - …...ranking...

video search reranking through random walk over document

upc at mediaeval hyperlinking 2013

video search reranking

image reranking by example: a semi-supervised learning

upc at mediaeval social event detection 2013

mediaeval 2015 - dmun at the mediaeval 2015 c@merata task:...

mediaeval 2012 opening

mediaeval 2015 - uniza system for the "emotion in music"...

click-boosting multi-modality graph-based reranking for...

mediaeval 2015 - certh at mediaeval 2015 synchronization of...

complexity-weighted loss and diverse reranking for

mediaeval 2015 - recod @ mediaeval 2015: diverse social...

the mediaeval fair

submodular reranking with multiple feature modalities for...

fast dynamic reranking in large graphs

mediaeval 2015 - recod@placing task of mediaeval 2015