large scale visual recognition challenge (ilsvrc) 2013: detection spotlights

10
Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights

Upload: branxton

Post on 23-Feb-2016

117 views

Category:

Documents


0 download

DESCRIPTION

Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights. Toronto A team. ILSVRC 2013 Spotlight. Latent Hierarchical Model with GPU Inference for Object Detection Yukun Zhu, Jun Zhu, Alan Yuille UCLA Computer Vision Lab. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights

Large Scale Visual Recognition Challenge (ILSVRC) 2013:

Detection spotlights

Page 2: Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights

Toronto A team

Page 3: Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights
Page 4: Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights

ICCV’2013

Sydney, Australia

Latent Hierarchical Model with GPU Inference for Object Detection

Yukun Zhu, Jun Zhu, Alan Yuille UCLA Computer Vision Lab

ILSVRC 2013 Spotlight

Thank L. Zhu, Y. Chen, A. Yuille and W. Freeman for the work “Latent hierarchical structural learning for object detection”in CVPR 2010.

Page 5: Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights

Root-Part Configuration

Model for HorseModel for Car

Hierarchical Model

Latent Hierarchical Model with GPU Inference for Object Detection

Page 6: Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights

Latent Hierarchical Model with GPU Inference for Object Detection

• The latent hierarchical model encoding holistic object and parts w.r.t. viewpoint variations

• Support richer appearance features: HOG, color, etc.

• Fast training with incremental concave-convex procedure (iCCCP) algorithm

• Quick model inference via GPU (CUDA) implementation

Page 7: Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights

[1] Felzenszwalb P, McAllester D, Ramanan D, “A discriminatively trained, multiscale, deformable part model,” Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE, 2008: 1-8.[2] Felzenszwalb P F, Girshick R B, McAllester D, “Cascade object detection with deformable part models,” Computer vision and pattern recognition (CVPR), 2010 IEEE conference on. IEEE, 2010: 2241-2248.

Latent Hierarchical Model with GPU Inference for Object Detection

Page 8: Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights

ILSVRC2013 Task 1: Detection

Team name: DeltaMembers: Che-Rung Lee, Hwann-Tzong Chen, Hao-Ping Kang, Tzu-Wei Huang, Ci-Hong Deng, Hao-Che KaoNational Tsing Hua University

Page 9: Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights

Generic Object Detector

ConvNet Multiclass Classifier

~ 15 proposals per image

each proposal gets one of the (200+backgrounds) class-labels

Multiclass classifier: cuda-convnet [Krizhevsky et al.] Training: 590,000 bounding boxes, 3 days using 2 GPUs0.5 error rate for classifying the validation bounding boxes

Generic object detector: “What is an object” + salient region segmentation 0.28 mAP on the validation images (ignoring class labels)

Overall: 0.057 mAP on validation data, 0.06 mAP on test data

Page 10: Large Scale Visual Recognition Challenge (ILSVRC) 2013: Detection spotlights

8:30 Classification&localization

10:30 Detection

Noon Discussion panel

14:00 Invited talk by Vittorio Ferrari: Auto-annotation and self-assessment in ImageNet

14:40 Fine-Grained Challenge 2013

Agenda

http://www.image-net.org/challenges/LSVRC/2013/iccv2013

8:50 9:05 9:20 9:35 9:50 Spotlights

10:50 11:10 11:30 11:40Spotlights