combining efficient object localization and image classification

31
Combining efficient object localization and image classi cation H. Harzallah, F. Jurie and C. Schmid LEAR, INRIA Grenoble, LJK

Upload: alcina

Post on 05-Jan-2016

25 views

Category:

Documents


0 download

DESCRIPTION

Combining efficient object localization and image classification. H. Harzallah, F. Jurie and C. Schmid LEAR, INRIA Grenoble, LJK. Tasks. Image classification: assigning labels to the image. Car: present Cow: present Bike: not present Horse: not present …. Cow. Car. Tasks. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Combining efficient object localization  and image classification

Combining efficient object localization and image classification

H. Harzallah, F. Jurie and C. Schmid

LEAR, INRIA Grenoble, LJK

Page 2: Combining efficient object localization  and image classification

• Image classification: assigning labels to the image

Tasks

Car: presentCow: presentBike: not presentHorse: not present…

Page 3: Combining efficient object localization  and image classification

• Image classification: assigning labels to the image

Tasks

Car: presentCow: presentBike: not presentHorse: not present…

• Object localization: define the location and the category

Car CowLocatio

n

Category

Page 4: Combining efficient object localization  and image classification

Contributions

• Object class localization method

• Combining image classification and object localization

Localization++ Classification-- Localization-- Classification++

Page 5: Combining efficient object localization  and image classification

Overview

• Related work and datasets

• Efficient object localization– Experimental results

• Combining image classification and localization– Experimental results

• Conclusion

Page 6: Combining efficient object localization  and image classification

Related work

• Object localization– Sliding window [Dalal06] [Rowley95]– Implicit shape model [Leibe04]– SVM classifiers [Chum07] [Ferrari08]– Cascade of classifiers [Viola01] [Vedaldi09]

• Context information– Combination of context sources [Divvala09]– Graphical model of events in images [Li07]– Local segmentation + global classification [Shotton08] [Heitz08]

Page 7: Combining efficient object localization  and image classification

PASCAL VOC dataset

• PASCAL VOC dataset 2007 and 2008

• Two tasks : classification and localization

• Fixed train/test set-up for the 20 object classes

• Standard evaluation measure – Area of overlap as detection matching criterion– Average precision for performance evaluation

Page 8: Combining efficient object localization  and image classification

Overview

• Related work and datasets

• Efficient object localization– Experimental results

• Combining image classification and localization– Experimental results

• Conclusion

Page 9: Combining efficient object localization  and image classification

Efficient object localization

• Sliding window based approach

• Image representation• Combination of features• Extensive parameters evaluation

Robust image representation

• Efficient search strategy

Page 10: Combining efficient object localization  and image classification

Image representation

• Combination of 2 image representations

• Histogram Oriented Gradient– Gradient based features– Integral Histograms

• Bag of Features– SIFT features extracted densely + k-means clustering– Pyramidal representation of the sliding windows– One histogram per tile Histogram

Histogram

Histogram

Histogram

Histogram Histogram

Page 11: Combining efficient object localization  and image classification

Efficient search strategy

• Reduce search complexity– Sliding windows: huge number of candidate windows– Cascades: pros/cons

• Two stage cascade:– Filtering classifier with a linear SVM

• Low computational cost• Evaluation: capacity of rejecting negative windows

– Scoring classifier with a non-linear SVM• Χ2 kernel with a channel combination [Zhang07]• Significant increase of performance

Page 12: Combining efficient object localization  and image classification

Efficiency of the 2 stage localization• Performance w. resp. to nbr of windows selected by the linear SVM

(mAP on Pascal 2007)

• Sliding windows: 100k candidate windows• A small number of windows are enough after filtering

Page 13: Combining efficient object localization  and image classification

Localization performance: aeroplane

Method AP

X2, HOG+BOF 33.8

X2, BOF 29.8

X2, HOG 18.4

Linear, HOG 10.0

Page 14: Combining efficient object localization  and image classification

Localization performance: car

Method AP

X2, HOG+BOF 50.4

X2, BOF 42.3

X2, HOG 47.5

Linear, HOG 33.9

Page 15: Combining efficient object localization  and image classification

Localization performance

• Mean Average Precision on all 20 classes

• PASCAL 2007 dataset

Method mAP

Linear, HOG 14.6

Linear, BOF 15.0

Linear, HOG+BOF 17.6

X2, HOG 21.9

X2, BOF 23.1

X2, HOG+BOF 26.3

Page 16: Combining efficient object localization  and image classification

Localization examples: correct localizations

Car

Sofa

Bicycle

Horse

Page 17: Combining efficient object localization  and image classification

Localization examples: false positives

Car

Sofa

Bicycle

Horse

Page 18: Combining efficient object localization  and image classification

Localization examples: missed objects

Car

Sofa

Bicycle

Horse

Page 19: Combining efficient object localization  and image classification

Overview

• Related work and datasets

• Efficient object localization– Experimental results

• Combining image classification and localization– Experimental results

• Conclusion

Page 20: Combining efficient object localization  and image classification

• Image classification & localization use a different information

Combination: key points

• For many TP only one has a high score• Truncated objects: hard for the

detector• Small objects: ok for the detector but

not for the classifier using global information

Page 21: Combining efficient object localization  and image classification

• Input: classification ( Si ) and localization ( Sw ) scores

• Output: probability that object is present

• Suppose that classification and localization outputs are independent:

Combination model

Page 22: Combining efficient object localization  and image classification

• For each modality (classification/detection): notion of detectability P(Di) for classifier and P(Dw) for detector

• Encodes the ability to detect presence of the objects

• Assuming that the classifier/detector outputs conditional probabilities: P(O|Di,Si) and P(O|Dw,Sw)

Combination model

Page 23: Combining efficient object localization  and image classification

• P (O |Si) = P(Di) × P(O|Si, Di) + P(¬Di) × P(O|Si,¬Di)

• P (O |Sw) = P(Dw) × P(O|Sw, Dw) + P(¬Dw) × P(O|Si,¬Dw)

• Final probability:

• Handle both cases:– Object detectable by two modalities– Object detectable by only one modality

Combination model

Page 24: Combining efficient object localization  and image classification

• P(O|¬Di,Si) and P(O|¬Di,Si) : constant value

• Sw = classification by localization: highest localization score

• Priors P(Di) and P(Dw) class dependant

Combination model

Page 25: Combining efficient object localization  and image classification

Combination experimental setup

• Image classifier : INRIA_flat classifier– SVM classifier Χ2 kernel using multiple feature channels [Zhang07]– Excellent results in PASCAL 2008 challenge

• Detector : as described previously

• Experimental validation on PASCAL VOC 2007

• Comparison to the state of the art on PASCAL VOC 2008

Page 26: Combining efficient object localization  and image classification

Experimental results : gain obtained

• Classification

• Localization0

2

4

6

8

1 0

1 2

B o t t le P la n t T V C a r C o w

G a i n i n A P

0

1

2

3

4

5

6

7

8

C o w S h e e p B o t t le S o f a C a t

G a i n i n A P

Method mAP

Base Classifier 60.1

Our Combination 63.5

Method mAP

Base Detector 26.3

Our Combination 28.9

Page 27: Combining efficient object localization  and image classification

• Correct but low score localization• High classification score score increased after combination

Experimental results

Car localization

Page 28: Combining efficient object localization  and image classification

• High classification score• No localization score decreased after combination

Experimental results

Car classification

Page 29: Combining efficient object localization  and image classification

• Based on blind evaluation on PASCAL VOC 2008• Classification

– Best on 12 classes out of 20

• Localization

– Best on 11 classes out of 20

Comparison to the state of the art

Method mAP

Lear_flat 53.8

Lear_shotgun 54.5

SurreyUvA_SRKDA 54.9

UvA_TreeSFS 54.3

Our method (based on Lear_flat) 57.7

Method mAP

CASIA_Det 12.7

MPI_struct 10.4

UoCTTIUCI 22.8

Our method 22.7

Page 30: Combining efficient object localization  and image classification

Conclusion

• Efficient localization method

• Successful combination of classification and localization

• State of the art performance on both tasks

Page 31: Combining efficient object localization  and image classification

Thank you