insect species recognition using sparse...

Post on 29-Mar-2019

217 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Insect Species Recognition using Sparse Representation

An Lu Student1

alu@nlpr.ia.ac.cn

Xinwen Hou Prof1

xwhou@nlpr.ia.ac.cn

Xiaolin Chen Prof2

xlchen@ioz.ac.cn

Chenglin Liu Prof1

liucl@nlpr.ia.ac.cn

1 Institute of AutomationChinese Academy of SciencesBeijing, China

2 Institute of ZoologyChinese Academy of SciencesBeijing, China

Insect species recognition is a typical application of image categorizationand object recognition. In this paper, we propose an insect species recog-nition method based on class specific sparse representation. On obtainingthe vector representation of image via sparse coding of patches, an SVMclassifier is used to classify the image into species. We propose two classspecific sparse representation methods under weakly supervised learningto discriminate insect species which have substantial similarity to eachother. Experimental results show that the proposed methods perform wellin insect species recognition and outperform the state-of-the-art methodson generic image categorization.

Our work is motivated by the sparse coding spatial pyramid matching(ScSPM) model for generic image categorization [3], sparse representa-tion based classification (SRC) algorithm for face recognition [2] and im-age restoration method [1]. All of the three methods are based on sparserepresentation. Comparing to these methods, we calculate bases of eachclass to obtain the class specific sparse representations in the strategies ofminimal reconstruction residual and sparsity of local features.

Our bases construction method is based on [3]. However their methodis more suitable for generic object image datasets which have more dis-tinction between classes than that of insect species. So motivated by thework of [2], we adopt a class specific bases construction strategy. Theholistic optimization problem can be formulated as:

minBi,S(i)

Ni

∑ni=1‖x(i)ni −Bis

(i)ni ‖

22 +λ‖s(i)ni ‖1, f or i = 1,2, . . . ,C

s.t. ‖bi,ki‖2 ≤ c, ∀ki = 1,2, . . . ,Ki

(1)

For each class we calculate its own basis matrix by an iterative process.Firstly we randomly initialize basis matrix Bi to calculate the new sparsecodes s(i)ni for input vector x(i)ni for each class i :

mins(i)ni

‖x(i)ni −Bis(i)ni ‖

22 +λ‖s(i)ni ‖1 (2)

Then we fix sparse codes S and solve the following optimization problemwith constraint:

minBi

Ni

∑ni=1‖x(i)ni −Bis

(i)ni ‖

22

s.t. ‖bi,ki‖2 ≤ c, ∀ki = 1,2, . . . ,Ki

(3)

At the same time, for any new input vector xnew, we can get C coefficientvectors (sparse codes) s(i)new respectively to each basis matrix by solvingthe optimization problem:

mins(i)new

‖xnew−Bis(i)new‖2

2 +λ‖s(i)new‖1, f or i = 1,2, . . . ,C (4)

We proposed two strategies to concatenate the C coefficient vectors intoone sparse vector to represent the original input vector. The first one wecall it minimal residual class specific sparse representation (MRCSSR).That means we take the coefficient vector which minimizes the residualof reconstruction as its original value and other vector as zero.

p = argmini‖xnew−Bs(i)new‖2

snew = [0,0, . . . ,0︸ ︷︷ ︸K1

,0,0, . . . ,0︸ ︷︷ ︸K2

, . . . ,(s(p)new)

T︸ ︷︷ ︸K p

, . . . ,0,0, . . . ,0︸ ︷︷ ︸KC

]T(5)

Figure 1: Example of our Tephritidae dataset

Sub-dataset Whole Head Thorax Abdomen WingSpecies 19 19 20 17 14Images 152 143 151 144 103

Table 1: Number of species and images in each sub-dataset.

The second strategy we call it sparsest class specific sparse representation(SCSSR). That means we take the coefficient vector which is sparsest torepresent the original feature and other vector as zero. Here we take L0norm to evaluate the sparsity of the coefficient vectors.

p = argmini‖s(i)new‖0

snew = [0,0, . . . ,0︸ ︷︷ ︸K1

,0,0, . . . ,0︸ ︷︷ ︸K2

, . . . ,(s(p)new)

T︸ ︷︷ ︸K p

, . . . ,0,0, . . . ,0︸ ︷︷ ︸KC

]T(6)

After calculating the sparse representation of each input vectors, any pool-ing method such as averaging or max pooling [3] can combine thesesparse codes of input vectors belonging to the same sample together toobtain the final feature vectors. Then any learning method such as neuralnetworks or SVM is competent for the recognition task.

Our Tephritidae dataset is composed of 3 genera and 20 species. Eachspecimen is taken one photograph respectively of its whole body, head,thorax, abdomen and wing (as shown in Fig.1). So we divide the wholedataset into 5 sub-dataset according to different part of specimen.Table. 1shows the number of species and photographs of the 5 sub-datasets.

Our experiments on the Tephritidae dataset and Caltech101 datasetdemonstrated the effectiveness of our methods. We believe that construct-ing a basis matrix for each class will take more discriminative informationinto the final sparse representation and both the minimal residual and thesparsest strategy remove some noise among other similar classes and re-main the information which is utmost expressive for the true classes.

[1] J. Mairal, M. Elad, and G. Sapiro. Sparse representation for colorimage restoration. IEEE Trans. IP, 17(1):53–69, 2008.

[2] J. Wright, A.Y. Yang, A. Ganesh, S.S. Sastry, and Y. Ma. Robustface recognition via sparse representation. IEEE Trans. PAMI, 31(2):210–227, 2009.

[3] J. Yang, K. Yu, Y.H. Gong, and T. Huang. Linear spatial pyramidmatching using sparse coding for image classification. In CVPR,2009.

top related