tunc gultekin eren golge havva gulay gurbuz ahmet iscencanf/cs533/hwspring13/five... · 21...

Tunc Gultekin Eren Golge Havva Gulay Gurbuz Ahmet Iscen

Upload: others

Post on 13-Feb-2020

3 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Tunc Gultekin

Eren Golge

Havva Gulay Gurbuz

Ahmet Iscen

Goal & Contribution

Motivation

Test Environment

New Approach

Error Rate Calculation

Error Rates

Conclusion

Page 3: Tunc Gultekin Eren Golge Havva Gulay Gurbuz Ahmet Iscencanf/CS533/hwSpring13/five... · 21 different query set run on 9 different I.R algorithms. An evaluation measure was choosen

There are many evaluation measures, which one should we trust for comparing algorithms?

How do we interpret their results?

Are they stable?

A novel approach for quantifying errors of evaluation measures has been developed.

Page 4: Tunc Gultekin Eren Golge Havva Gulay Gurbuz Ahmet Iscencanf/CS533/hwSpring13/five... · 21 different query set run on 9 different I.R algorithms. An evaluation measure was choosen

To compare performances of different I.R algorithms, some experiments are performed on test collections.

Relative performances of algorithms are expressed with evaluation measures. ◦ Precision, recall...

Page 5: Tunc Gultekin Eren Golge Havva Gulay Gurbuz Ahmet Iscencanf/CS533/hwSpring13/five... · 21 different query set run on 9 different I.R algorithms. An evaluation measure was choosen

Evaluation of these measures rely on some rules of thumb; ◦ Experiments must use reasonable evaluation measures.

◦ Conclusions must be based on reasonable performace differences.

The meaning of «Reasonable» can be changed among people.

An objective decision making system is required.

Page 6: Tunc Gultekin Eren Golge Havva Gulay Gurbuz Ahmet Iscencanf/CS533/hwSpring13/five... · 21 different query set run on 9 different I.R algorithms. An evaluation measure was choosen

Precision(@), Recall(1000), Precision at 0.5 Recall, R-Precision, Average Precision methods were compared.

9 different I.R algorithms were used from TREC-8

21 different query set.

21 different query set run on 9 different I.R algorithms.

An evaluation measure was choosen and a fuzziness* value was defined.

A query set was selected and the mean of evaluation measure was computed over set for each of 9 retrieval methods.

*a threshold value that defines if the difference of measures are discriminative enough

Page 8: Tunc Gultekin Eren Golge Havva Gulay Gurbuz Ahmet Iscencanf/CS533/hwSpring13/five... · 21 different query set run on 9 different I.R algorithms. An evaluation measure was choosen

For each pair of retrieval methods, better method was found.

Another query set was selected and comparisons were repeated multiple times.

Results are presented in 9x9 comparison matrix.

Page 9: Tunc Gultekin Eren Golge Havva Gulay Gurbuz Ahmet Iscencanf/CS533/hwSpring13/five... · 21 different query set run on 9 different I.R algorithms. An evaluation measure was choosen

Page 10: Tunc Gultekin Eren Golge Havva Gulay Gurbuz Ahmet Iscencanf/CS533/hwSpring13/five... · 21 different query set run on 9 different I.R algorithms. An evaluation measure was choosen

For each cell in the matrix, greater value of better-than, worse than values were accepted as correct answer and other one is error.

Lesser values of all cells were summed and divided by total number of decisions. ◦ Error Rate for Average Precision Matrix;

16 / 756 = 0.021 = 2.1%

Error* = min(A>B,B>A)/ (A>B + B>A + A==B)

*if Error of the comparison matrix >~ 25% then discrimination converges to randomness

Page 11: Tunc Gultekin Eren Golge Havva Gulay Gurbuz Ahmet Iscencanf/CS533/hwSpring13/five... · 21 different query set run on 9 different I.R algorithms. An evaluation measure was choosen

Page 12: Tunc Gultekin Eren Golge Havva Gulay Gurbuz Ahmet Iscencanf/CS533/hwSpring13/five... · 21 different query set run on 9 different I.R algorithms. An evaluation measure was choosen

Error rates of evaluation measures inversely proportional with the topic set size.

Query sets should be carefully choosen. ◦ Something may be biased.

Recall(1000) is very stable but it appropriated for limited environments.

Average Precision is good for general purpose.

Precision at a cut off level is appropriate for web.

Effects of Different Storage Conditions on the Nutritional ...tarimbilimleri.agri.ankara.edu.tr/2015/21_2/10.makale.pdf · Barley and Maize Havva Eylem POLATa aAnkara University,

Summer 2018 Newsletter - Plant Pathology · Dr. Havva Ilbagi is a professor in Agriculture from the Namik Kemal University. Havva’s research focuses on the study of natural hosts

Wear and Friction Behavior of Metal Impregnated … Wear and Friction Behavior of Metal Impregnated Microporous Carbon Composites GULTEKIN GOLLER, D.P. KOTY, S.N. TEWARI, M. SINGH,

THIRD-PARTY LOGISTICS HAKAN GULTEKIN. OUTLINE Introduction and definitions Literature review Future research

Strategic Planning Presentation Rıza GURBUZ, Çankırı Karatekin University August 24,Sep. 01 2015,Çankırı

Havva Alizadeh Ferdowsi University of Mashhad, WTLab Spring 2011 1

REKTÖR GÜNDOĞAN TÜBA ÜYELİK BERATINI BAKAN FARUK … · Basın ve Halkla İlişkiler Müdürlüğü Telefon: 0.222 335 05 80 - 2484 Bilim ve Teknoloji Havva ŞEKERCİOĞLU Çevre

An Empirical Study on Identifying the Performance ...kosbed.kocaeli.edu.tr/sayi13/gurbuz-dikmenli.pdf · potheses using sample data collected from 150 ... ganization should give performance

1 TechnoWallSt TechnoCap TechnoWallSt Estelle Havva Vice-présidente, TechnoWallSt [email protected]

DIGITAL TECHNOLOGIES IN TEXTILE ART - … TECHNOLOGIES IN TEXTILE ART Havva Halaceli Cukurova University, Faculty of Fine Arts, Department of Textile Design, ... Computerized Jacquard

A HOLISTIC RESEARCH APPROACH TO MEASURING URBAN …havvaalkanbala.com/file/a8-derya-havva-holistik.pdf · INTRODUCTION It is often argued that any place should have its own ‘identity’,

Nicki Minaj By: Havva Toprak. What Is this person famous for? Nicki Minaj Is famous for signing a contract with YME ( Young Money Entertainment ) and

Temporal and Real-Time Databases: A Survey by Gultekin Ozsoyoglu and Richard T. Snodgrass Presentation by Didi Yao

MEASUREMENT OF AIR TOXICS IN THE CABINS OF COMMUTER VEHICLES UNDER SUMMER AND WINTER CONDITIONS IN OTTAWA, CANADA Deniz Karman, Oznur Oguz (*), and Gultekin

ICASE2).pdf · Bob Worley, UK Janchai Yingprayoon, Thailand Elaine Horne, Australia Declan Kennedy, Ireland Gultekin Cakmakci, Turkey Kadir Demir, USA Pinar Cavas, Turkey Devrim Akgunduz,

A Case Report of Myocarditis and Sinus Arrest · A Case Report of Myocarditis and Sinus Arrest Filiz Kizilirmak Yılmaz*, Gultekin Gunhan Demir, Mehmet Onur Omaygenç, Bilal Boztosun

Family Literacy Programs for Migrants - Promoting Bilingualism and Enhancing Parent-Child Book Reading Prof. Dr. Havva Engin, M.A. Fachhochschule Bielefeld

Challenges of Strategic Human Resource Management (SHRM)...International Journal of Project Management, 28, 4, 314-327. Gurbuz, S. and Mert, I. (2011) Impact of the Strategic Human

Epilepsy and Oral Health - Semantic Scholar...Epilepsy and Oral Health Taskin Gurbuz Department of Pedodontics, Faculty of Dentistry, Atatürk University, Erzurum Turkey 1. Introduction

MBA: Mini-Batch AUC Optimization - arXiv · MBA: Mini-Batch AUC Optimization San Gultekin, Avishek Saha, Adwait Ratnaparkhi, and John Paisley Abstract Area under the receiver operating

GEA GEOEXPO+ - IRENA Sacramento GGA 25 Oct.pdf · GEA GEOEXPO+ GGA Working Session GLOBAL GEOTHERMAL ALLIANCE: A GLOBAL PARTNERSHIP FOR GEOTHERMAL ENERGY Gurbuz Gonul Senior Programme

ERP Software Selection Model for Small and Medium-Sized in ... · factors in the study, which are>>> (Balanced project team, Project management, and Change management). Gurbuz et

ERKUT GULTEKIN - ScriptieBank€¦ · with the Designs of Gustave Doré (New York: Cassell Publishing Company, 1890) 1 From the last four cantos in Purgatory until the end of Paradise

Chemical Kinetics Concepts Achievement Test …yunus.hacettepe.edu.tr/~cakmakci/ckcat.pdfChemical Kinetics Concepts Achievement Test (CKCAT) Gultekin Cakmakci B.Sc. (Ankara), M.A

Kinetic and Mechanistic Assessment of Alkanol/Alkanal …iglesia.cchem.berkeley.edu/Publications/2015 Gurbuz... · 2015-11-13 · Kinetic and Mechanistic Assessment of Alkanol/Alkanal

Food Additives of Public Concern For Their Carcinogenicity...Food Additives of Public Concern for their Carcinogenicity. Fatih Gultekin. 1 *, Sulhattin Yasar. 2, Nilgun Gurbuz. 3,

Leadership And Human Development - Official Site · Leadership And Human Development Oscar Arias ... Faika Bürkan and Havva Salman. From The School of ... (England), Mr. Leong Sze

UNIVERSITY OF ARKANSAS FOR MEDICAL SCIENCES … · 2019-07-26 · ahmed magdy salem 55197 $408.38 privacy protection required *** $1,500.00 ahmet gurbuz 4010454 $687.30 ahsan bashir

QUINOLINE EUhhAhEE h - Defense Technical … under industrial conditions results In the ... HDN by Satterfield and Gultekin' have shown that ... catalysts,17 It was thought that perhaps

SEVINDIRICI HABER’IN TEMEL BILGILERIBölüm Adem ve Havva ... ve onlar hakkında dua ederken bunların doğru olduğunu kendiliğiniz-den bileceksiniz ve Kutsal Ruh size bunların

Cute Monster Kuboo EN Havva Designsfile.hstatic.net/1000251698/file/cute-monster-kuboo-en...NOTE: Crocheting in spiral and getting same symmetrical work in the end can change from

Havva Ashuri Shahrestani2

Multistatic Ground-Penetrating Radar Experiments · Multistatic Ground-Penetrating Radar Experiments Tegan Counts, StudentMember,IEEE, Ali Cafer Gurbuz, StudentMember,IEEE, Waymond

CoE for Functional SUrfaces and interfaces for Nano ... · Havva Funda Yagci Acar (Nano particle design) Nurcan Doğan (Nano particle Design) Gozde Ince (Polymers, Thin Films) Emre

ROBOTICS, LAPAROSCOPY & …media.robotictimes.org/uploads/files/kunye.pdf · · 2016-09-01Bob Djavan, Austria Cenk Gurbuz, Turkey Erem Kaan Basok, Turkey ... The preferred format