mediaeval 2015 - but quesst 2015 system description

11

BUT QUESST 2015 System Description Miroslav Skácel, Igor Szöke Speech@FIT Faculty of Information Technology Brno University of Technology MediaEval QUESST 2015 workshop, September 14.-15. 2015, Wurzen

Upload: multimediaeval

Post on 20-Jan-2017

91 views

Category:

Education

1 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: MediaEval 2015 - BUT QUESST 2015 System Description

BUT QUESST 2015 System Description

Miroslav Skácel, Igor SzökeSpeech@FIT

Faculty of Information TechnologyBrno University of Technology

MediaEval QUESST 2015 workshop, September 14.-15. 2015, Wurzen

Page 2: MediaEval 2015 - BUT QUESST 2015 System Description

System overviewOur internal task was:

● to reuse some Atomic systems as we have● to incorporate bottlenecks● to calibrate and fuse● to cope with T2/T3 queries

We ended up with:● 4 Atomic systems● 3 QbE subsystems based on DTW● 4 languages (Czech, Portuguese, Russian and Spanish).

2

Page 3: MediaEval 2015 - BUT QUESST 2015 System Description

3

Page 4: MediaEval 2015 - BUT QUESST 2015 System Description

Atomic system● no adaptation on target data (SMVN, VTLN, …)● Artificial Neural Networks – to estimate bottlenecks ● bottlenecks – trained on GlobalPhone (GP) database

4

Page 5: MediaEval 2015 - BUT QUESST 2015 System Description

Subsystem

Neural network based features:● bottleneck features (30 dimensional)● No VTLN, No SMN/SVN

Query detector● based on Dynamic Time Warping (DTW)

5

Page 6: MediaEval 2015 - BUT QUESST 2015 System Description

DTW QbE subsystem● segmental DTW (query can start in any frame of utterance)● Voice Activity Detection (VAD) only on queries● Pearson product-moment correlation distance (dcorr)● slope limitation● online normalizing of the path● bottlenecks superior to posteriors

features dcorr in minCnxe (ALL)

SD CZ POST 0.984

SD HU POST 0.972

SD RU POST 0.952

GP CZ BN 0.853

GP PO BN 0.894

GP RU BN 0.893

GP SP BN 0.904

6

Page 7: MediaEval 2015 - BUT QUESST 2015 System Description

Slope limitation

7

Page 8: MediaEval 2015 - BUT QUESST 2015 System Description

Dealing with T2● query split into equal parts● each part searched in utterance separately● results averaged together● query split into 2 (denoted as 2w) and 3 (3w) parts

in late evaluation

8

Page 9: MediaEval 2015 - BUT QUESST 2015 System Description

Score normalization● raw detection scores normalized by length● the best detection per utterance-query pair selected● mode normalization performed

original mode norm.

9

Page 10: MediaEval 2015 - BUT QUESST 2015 System Description

Results

● posteriors do not work for this year dataset● slope limitation helps to control path shape● fea stack of more than 4 langs does not improve performance● mode norm is good for raw score normalization

● we will focus on denoising and dereverberation in next year

10

Page 11: MediaEval 2015 - BUT QUESST 2015 System Description

Thanks for your attention

MediaEval 2015 - RECOD@Placing Task of MediaEval 2015

Mediaeval Studies 2015 - Reading Walter of Châtillon's ...Mediaeval Studies 77 (2015): 81 –101.© Pontifical Institute of Mediaeval Studies. READING WALTER OF CHÂTILLON’S ALEXANDREIS

MediaEval 2015 - Overview of the MediaEval 2015 Drone Protect Task

MediaEval 2015 - Emotion in Music: Task Overview

MediaEval 2015 - Multimodal Person Discovery in Broadcast TV - poster

MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Poster

MediaEval 2015 - SAVA at MediaEval 2015: Search and Anchoring in Video Archives

MediaEval 2015 Drone Protect Task: Privacy Protection in …user.ceng.metu.edu.tr/~sciftci/pdfs/mediaeval2015.pdf · false coloring approach for Drone Protect Task of MediaEval 2015

MediaEval 2015 - Query by Example Search on Speech Task

Synchronization of Multi-User Event Media (SEM) at MediaEval 2014: Task Description, Datasets, and Evaluation

MediaEval 2015 - UNIZA System for the "Emotion in Music" task at MediaEval 2015

MediaEval 2015 - Multimodal-based Diversified Summarization in Social Image Retrieval - Poster

MediaEval 2015 - The Placing Task at MediaEval 2015

MediaEval 2015 - Affective Impact of Movies: Task Overview and Results

MediaEval 2015 - The C@merata Task at MediaEval 2015: Natural Language:Queries on Classical Music Scores

MediaEval 2015 - RFA at MediaEval 2015 Affective Impact of Movies Task: A Multimodal Approach

MediaEval 2016 - EUMSSI Team at the MediaEval Person Discovery Challenge

Mediaeval Mason

MediaEval 2015 - CERTH/CEA LIST at MediaEval Placing Task 2015

MediaEval 2015 - Synchronization of Multi-User Event Media at MediaEval 2015: Task Description, Datasets and Evaluation

MediaEval 2015 - EURECOM @ SAVA2015: Visual Features for Multimedia Search

Roman, Mediaeval and Post-Mediaeval Metalworking Debris at

MediaEval 2015 - GTM-UVigo Systems for the Query-by-Example Search on Speech Task at MediaEval 2015

MediaEval 2015 - The CERTH-UNITN Participation @ Verifying Multimedia Use 2015

Mediaeval Studies 2015 - Fragments from a Lost Court of ... · Mediaeval Studies 77 (2015): 183–201.© Pontifical Institute of Mediaeval Studies. FRAGMENTS FROM A LOST . COURT OF

MediaEval 2015 - UNIZA System for the "Emotion in Music" task at MediaEval 2015 - Poster

MediaEval 2015 - Multi-Scale Approaches to the MediaEval 2015 "Emotion in Music" Task

MediaEval 2015 - OHSU @ MediaEval 2015: Adapting Textual Techniques to Multimedia Search

Atelier VIF : Visualisation d’informations, Interaction ... · Multimodal person discovery in broadcast TV at MediaEval 2015. In Proceedings of MediaEval .-1-Analyse interactive

MediaEval 2015 - The NNI Query-by-Example System for MediaEval 2015

Fudan-Huawei at MediaEval 2015: Detecting Violent …zxwu.azurewebsites.net/MediaEval2015-Fudan-Huawei.pdfFudan-Huawei at MediaEval 2015: Detecting Violent Scenes and Affective Impact

MediaEval 2015 - JRS at Synchronization of Multi-user Event Media Task

MediaEval 2015 - UPC-UB-STP @ MediaEval 2015 Diversity Task: Iterative Reranking of Relevant Images - Poster

CERTH/CEA LIST at MediaEval Placing Task 2015

MediaEval 2015 - Short introduction of Odyssey 2016