large scale visual recognition challenge (ilsvrc)...

26
Large Scale Visual Recognition Challenge (ILSVRC) 2017 Eunbyung Park UNC Chapel Hill Overview Wei Liu UNC Chapel Hill Olga Russakovsky CMU/Princeton Jia Deng Univ. of Michigan Fei-Fei Li Stanford Alex Berg UNC Chapel Hill

Upload: leanh

Post on 16-Jun-2018

243 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Large Scale Visual Recognition Challenge (ILSVRC) 2017

Eunbyung ParkUNC Chapel Hill

Overview

Wei LiuUNC Chapel Hill

Olga RussakovskyCMU/Princeton

Jia DengUniv. of Michigan

Fei-Fei LiStanford

Alex BergUNC Chapel Hill

Page 2: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Agenda

1. Participation over the years

2. LOC+CLS Task – Results

3. DET Task– Results

4. VID Task – Results

Page 3: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Participation in ILSVRC over the years

35

1529

81

123

157

172

115

2010 2011 2012 2013 2014 2015 2016 2017

The

nu

mb

er o

f En

trie

s

1 year 9 month

Page 4: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

ILSVRC Image Classification (CLS) TaskSteel drum

1000 object classes 1,431,167 images CLS-LOC

Page 5: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

ILSVRC Image Classification (CLS) TaskSteel drum

Page 6: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

ILSVRC Image Localization (LOC) TaskSteel drum

Page 7: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

ILSVRC Image Localization (LOC) TaskSteel drum Correct

Bad localization Bad classification

Page 8: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

ILSVRC Image Localization (LOC) TaskSteel drum Correct

Page 9: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Classification Results (CLS)

0.280.26

0.16

0.12

0.07

0.036 0.03 0.0230

0.05

0.1

0.15

0.2

0.25

0.3

2010 2011 2012 2013 2014 2015 2016 2017

Cla

ssif

icat

ion

Err

or

16.7% ↓ 23.3% ↓

Page 10: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Localization Results (LOC)Lo

caliz

atio

n E

rro

r

0.43

0.34

0.3

0.25

0.09 0.077 0.0620

0.1

0.2

0.3

0.4

0.5

2011 2012 2013 2014 2015 2016 2017

14.4% ↓ 19.5% ↓

Page 11: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Team Name Error(%)

WMW 0.0225

Trimps-Soushen 0.0248

NUS-Qihoo_DPNs 0.0274

BDAT 0.0296

WMWJie Hu1 , Li Shen2 , Gang Sun1

1. Momenta2. Universify of Oxford

Trimps-SouchenXiaoteng Zhang, Zhengyan Ding, JianyingZhou, Jie Shao, Lin Mei The Third Research Institute of the Ministry of Public Security, P.R. China.

ILSVRC2017 CLS Results - ‘Provided’ Data

Page 12: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Team Name Error(%)

NUS-Qihoo_DPNs 0.0271

BDAT 0.0300

BDATHui Shuai1, Zhenbo Yu1, Qingshan Liu1, Xiaotong Yuan1, Kaihua Zhang1, YishengZhu1, Guangcan Liu1, Jing Yang1, YuxiangZhou2, Jiankang Deng2

1. Nanjing University of Information Science & Technology2. Imperial College London

NUS-Qihoo_DPNsYunpeng Chen1, Huaxin Xiao1, Jianan Li1, Xuecheng Nie1, Xiaojie Jin1, Jianshu Li1, Jiashi Feng1, Jian Dong2, Shuicheng Yan2

1. NUS - National University of Singapore2. Qihoo 360

ILSVRC2017 CLS Results - ‘External’ Data

Page 13: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

ILSVRC2017 LOC Results - ‘Provided’ Data

Team Name Error(%)

NUS-Qihoo_DPNs 0.0623

Trimps-Soushen 0.0650

BDAT 0.0814

SIIT_KAIST-SKT 0.1290

NUS-Qihoo_DPNsYunpeng Chen1, Huaxin Xiao1, Jianan Li1, Xuecheng Nie1, Xiaojie Jin1, Jianshu Li1, Jiashi Feng1, Jian Dong2, Shuicheng Yan2

1. NUS - National University of Singapore2. Qihoo 360

Trimps-SouchenXiaoteng Zhang, Zhengyan Ding, JianyingZhou, Jie Shao, Lin Mei The Third Research Institute of the Ministry of Public Security, P.R. China.

Page 14: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Team Name Error(%)

NUS-Qihoo_DPNs 0.0619

BDAT 0.0875

BDATHui Shuai1, Zhenbo Yu1, Qingshan Liu1, Xiaotong Yuan1, Kaihua Zhang1, YishengZhu1, Guangcan Liu1, Jing Yang1, YuxiangZhou2, Jiankang Deng2

1. Nanjing University of Information Science & Technology2. Imperial College London

NUS-Qihoo_DPNsYunpeng Chen1, Huaxin Xiao1, Jianan Li1, Xuecheng Nie1, Xiaojie Jin1, Jianshu Li1, Jiashi Feng1, Jian Dong2, Shuicheng Yan2

1. NUS - National University of Singapore2. Qihoo 360

ILSVRC2017 LOC Results - ‘External’ Data

Page 15: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

ILSVRC Object Detection (DET) Task

200 object classes 578,482 images DET

Page 16: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

ILSVRC Object Detection (DET) Task

This year: 5,500 new test images with bounding boxes fully annotated

Boxes are correct if IoU > 0.5

Average Precision

IoU =

Recall

Prec

isio

n Area under Precision Recall Curves

0

1

1

Page 17: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Detection Results (DET)M

ean

Ave

rage

Pre

cisi

on

(mA

P)

0.23

0.44

0.620.66

0.73

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

2013 2014 2015 2016 2017

Page 18: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

ILSVRC2017 DET Results - ‘Provided’ Data

Team Name#category

wonmAP(%)

BDAT 85 0.732

NUS-Qihoo_DPNs 9 0.657

VIST 10 0.593

KAISTNIA_ETRI 1 0.610

BDATHui Shuai1, Zhenbo Yu1, Qingshan Liu1, Xiaotong Yuan1, Kaihua Zhang1, YishengZhu1, Guangcan Liu1, Jing Yang1, YuxiangZhou2, Jiankang Deng2

1. Nanjing University of Information Science & Technology2. Imperial College London

NUS-Qihoo_DPNsYunpeng Chen1, Huaxin Xiao1, Jianan Li1, Xuecheng Nie1, Xiaojie Jin1, Jianshu Li1, Jiashi Feng1, Jian Dong2, Shuicheng Yan2

1. NUS - National University of Singapore2. Qihoo 360

Page 19: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

ILSVRC2017 DET Results - ‘External’ Data

Team Name#category

wonmAP(%)

BDAT 128 0.732

NUS-Qihoo_DPNs 14 0.658

BDATHui Shuai1, Zhenbo Yu1, Qingshan Liu1, Xiaotong Yuan1, Kaihua Zhang1, YishengZhu1, Guangcan Liu1, Jing Yang1, YuxiangZhou2, Jiankang Deng2

1. Nanjing University of Information Science & Technology2. Imperial College London

NUS-Qihoo_DPNsYunpeng Chen1, Huaxin Xiao1, Jianan Li1, Xuecheng Nie1, Xiaojie Jin1, Jianshu Li1, Jiashi Feng1, Jian Dong2, Shuicheng Yan2

1. NUS - National University of Singapore2. Qihoo 360

Page 20: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Object Detection from Video(VID) Task

Allows evaluation of generic object detectionin cluttered videos at scale

Fully annotated 30 object classes across 7,314 snippets

Page 21: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Object Detection from Video(VID) Task

This year: 1,036 new snippets distributed into train, val, test set.

• Algorithms outputs a list of bounding box detections with confidences

• A detection is considered correct if intersection over union(IoU) overlap with ground truth > 0.5

• Evaluated by average precision per object class

• Winner of challenge is the team that wins the most object categories

Evaluation modeled after PASCAL VOC:

Page 22: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Object Detection from Video(VID) Task

This year: 1,036 new snippets distributed into train, val, test set.

• Algorithms outputs a list of bounding box detections with confidences and tracklet ID.

• Tracklets are sorted by the mean confidence.

• A tracklet is considered correct if intersection over union(IoU) overlap with ground truth tracklet > 0.5.

• Evaluation by average precision per class. Final score is an average over different thresholds.

• Winner of challenge is the team that has highest score.

Evaluation taking tracking into account:

Page 23: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Video Detection Results (VID)M

ean

Ave

rage

Pre

cisi

on

(mA

P)

0.68

0.81 0.82

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

2015 2016 2017

W/O Tracking

0.545

0.641

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

2016 2017

W/ Tracking

Page 24: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

ILSVRC2017 VID Results - ‘Provided’ Data

Team Name#category

wonmAP(%)

mAP(%)tracking

IC&USYD 15 0.817 0.641

NUS-Qihoo-UIUC_DPNs

(VID)3 0.758 0.545

THU-CAS 0 0.730 0.512

IC&USYDJiankang Deng1, Yuxiang Zhou1, Baosheng Yu2, Zhe Chen2, StefanosZafeiriou1, Dacheng Tao2, 1. Imperial College London2. University of Sydney

NUS-Qihoo-UIUC_DPNs(VID)Yunchao Wei1, Mengdan Zhang1, JiananLi1, Yunpeng Chen1, Jiashi Feng1, Jian Dong2, Shuicheng Yan2, Honghui Shi3

1. National University of Singapore2. Qihoo 3603. University of Illinois Urbana-Champaign

Page 25: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

ILSVRC2017 VID Results - ‘External’ Data

Team Name#category

wonmAP(%)

mAP(%)tracking

IC&USYD 24 0.820 0.643

NUS-Qihoo-UIUC_DPNs

(VID)3 0.761 0.550

IC&USYDJiankang Deng1, Yuxiang Zhou1, Baosheng Yu2, Zhe Chen2, StefanosZafeiriou1, Dacheng Tao2

1. Imperial College London2. University of Sydney

NUS-Qihoo-UIUC_DPNs(VID)Yunchao Wei1, Mengdan Zhang1, JiananLi1, Yunpeng Chen1, Jiashi Feng1, Jian Dong2, Shuicheng Yan2, Honghui Shi3

1. National University of Singapore2. Qihoo 3603. University of Illinois Urbana-Champaign

Page 26: Large Scale Visual Recognition Challenge (ILSVRC) 2017image-net.org/challenges/talks_2017/ILSVRC2017_overview.pdf · Large Scale Visual Recognition Challenge (ILSVRC) 2017 ... CMU/Princeton

Coming Presentations!

1. Jie Hu(Team: WMW, Momenta): Squeeze-and-Excitation Networks

2. Yunpeng Chen(Team: NUS-Qihoo_DPNs, NUS): Dual Path Networks and its Applications

3. Short presentations of winning entries: NUS-Qihoo-UIUC_DPNs (VID), DeepView(ETRI), MIL_UT, SIIT_KAIST-SKT, KAISTNIA_ETRI