cosine transformed row and column mean content based … · engineering) pimpri chinchwad college...

International Journal of Computer Applications (0975 – 8887)

Volume 112 – No 13, February 2015

10

Cosine Transformed Row and Column Mean Content

based Image Retrieval using Higher Energy Coefficients

with Image Tiling and Assorted Similarity Measures

Sudeep Thepade, Ph.D. Professor

Pimpri Chinchwad College of Engg.

Savitrbai Phule Pune University

Anil Lohar Professor

Pimpri Chinchwad College of Engg.

Savitrbai Phule Pune University

Payal Dhakate M.E Student(Computer

Engineering) Pimpri Chinchwad College of

Engg. Savitrbai Phule PuneUniversity

ABSTRACT

Content based image retrieval is an application of computer

vision technique used to solve the problem of searching

images in large databases. The paper introduces three

different techniques tested with two different datasets in order

to compare the retrieval efficiency and accuracy. The feature

vector is formed using Color and texture information

extracted. The color information extraction process includes

separation of image into Red, Green and Blue planes. Then

each plane is divided into 4 blocks and for each block row

mean vectors are calculated. This system uses Cosine

transform to generate the feature vectors of the query and

database images. Cosine transform is applied over a row mean

vector of each block separately, which gives a set of feature

vector of size 15elementsin the first technique. In the second

technique first 60 coefficients are considered for feature

vector formation. In the third technique after extraction of

Red, Green and Blue components, they are divided into four

parts. From each part separate row mean is calculated and

discrete cosine transform is applied on it and from each

component taking 5 values each block. For each color

component. The total feature vector of size 60 is created.

Euclidean, Minkowski and Absolute difference are used as

similarity measures to compare the image features for image

retrieval in proposed CBIR techniques. Two standard datasets

namely COIL and Wang are used for the experimentation

purpose. COIL dataset consist of 500 processed images in 10

different categories and Wang’s dataset consist of 1000

unprocessed images in 10 different categories. Average

Precision and Recall values are considered for checking

performance of all three techniques.

Keyword Image Retrieval, Energy Coefficients, CBIR, Cosine

Transform

1. INTRODUCTION Huge amount of images are being readily available to users

because of astonishing advancements in color imaging

technologies[7]. Now a days there is a rapid growth of image

databases in almost every field like medical science,

multimedia, geographical information system, photography,

Journalism, etc., An effective and efficient technique is

required to process the images . The images can be extracted

on the basis of shape, size, color etc. The numbers of images

are available from variety of sources such as digital camera,

digital video and scanners [1]. The CBIR i. e. Content based

image retrieval is an application of computer vision where

relatively similar images are retrieved from the large database

on the basis of their content such as texture, color, shape and

spatial layout[6].

This paper focuses on color and texture information of the

image. Because of its easy and fast computation, the color

feature has widely been used in CBIR systems,. Color is also

an intuitive feature and plays an important role in image

matching. The extraction of color features from images

depends on an understanding of the theory of color and the

representation of color in digital images[5]. We are focusing

on color and texture information of image. First we are

separating the image into R, G, B planes and then

decomposing the image plane into 4 blocks and applying

Cosine transform over row mean vectors of each block of it

[10]. This paper is organized in various sections. Section II

gives literature survey. Section III elaborates proposed CBIR

Techniques. Section IV explains the experimental

environment along with platform, test bed, performance

measure and similarity measures; Section V contains results

with the discussion.

2. LITERATURE SURVEY Cosine Transform(DCT) made up of cosine functions which

can take over half the interval and dividing this interval into N

equal parts and sampling each function at the centre of these

parts [2], then we get the DCT matrix which is formed by

arranging these sequences row wise or column wise[8]. The

discrete cosine transform is related to the discrete Fourier

transform. It is a separable linear transformation means the

two-dimensional transform is equivalent to a one-dimensional

DCT performed along a single dimension followed by a one-

dimensional DCT in the other dimension [4]. Also DCT

expresses a sequence of finitely many data points in terms of a

sum of cosine functions oscillating at different frequencies

[9]. DCTs are important in various applications of science and

engineering, from lossy compression of audio (e.g. MP3) and

images (e.g. JPEG). The texture filter functions provide a

statistical view of texture based on the image histogram[4].

F[K,I]=

𝒇 𝒎, 𝒏 𝒂 𝒌 𝒂(𝒍)𝒄𝒐𝒔[(𝟐𝒎+𝟏)𝒌𝝅

𝟐𝑴]𝒏𝑵−𝟏

𝒏=𝟎𝑴−𝟏𝒎=𝟎 𝑪𝑶𝑺[

(𝟐𝒏+𝟏)𝒍𝝅

𝟐𝑵]

Where, 𝑎 𝑘 =

1

𝑁 , 𝑓𝑜𝑟 𝑘 = 0

2

𝑁, 𝑓𝑜𝑟 𝑘 = 1,2, … . 𝑁 − 1

(1)



11

3. PROPOSED CONTENT BASED

IMAGE RETERIEVAL (CBIR)

TECHNIQUE The paper proposes about three image retrieval techniques

which are experimented on two different datasets to compute

the retrieval efficiency as well as accuracy of each individual

technique. Each technique generates a feature vector which is

based on color and texture information extracted from an

image and used for retrieval.

Following three techniques are used to generate feature vector

of an image. Calculated feature vectors are stored at

secondary storage called as feature vector table which will be

used further for computation of precision and recall values.

3.1 Single Tile Five Coefficients Step 1: Each individual plane of an image i.e. Red, Green and

Blue are captured separately and considered as a tile.

Figure 1. Sample Image with Color Components

Step 2: Row mean of respective row from each tile is

calculated and stored at a row mean

vector[2][11][12][13][14][15][16][17].

TABLE I Row mean of sample image matrix

144 215 …… 578

(144+215+…+578)/n

458 536 …… 125 (458+356+…+125)/n

465 258 …… 158 (465+258+…+158)/n

425 136 ….. 236 (425+136+...+236)/n

Step 3: Discrete cosine transform is applied on a row mean

vector of each tile to obtain higher Energy Coefficients.

Step 4: First five high energy coefficients per tile are

considered as a feature vector. In this way final feature vector

of an image is formed by combining first five high energy

coefficients from each red, green and blue tile. So that the

feature vector size will become 15. In a similar fashion feature

vector of all the images from both the datasets i.e. COIL and

Wang’s are calculated and stored at feature vector table.

3.2 Single Tile Twenty Coefficient Step 1: Each individual plane of an image i.e. Red, Green and


Step 2: Row mean of respective row from each tile is

calculated and stored at a row mean vector.



Step 4: Now to consider feature vector we are considering

first twenty high energy coefficients per tile. Here final

feature vector of an image is formed by combining first

twenty high energy coefficients from each red, green and blue

tile due to which feature vector size become sixty. Calculated

feature vectors are stored at feature vector table.

3.3 Four Tile Five Coefficients Per Tile Step 1: Each individual plane of an image i.e. Red, Green and


Step 2: Divide each tile into four sub tiles.

Figure 2. Four tiles per Color Components

Step 3: Row mean of each sub tile is calculated and stored at

its respective row mean vector.



Figure 3.Transformed tiles

Step 5: For computing feature vector now we are considering

first five high energy coefficients from each four computed

row mean vector per sub tile. In this way for a plane we are

getting twenty high energy coefficients from four sub tiles. So

final feature vector size will become sixty[twenty coefficients

per plane] Feature vector for both the COIL and Wang’s

datasets are computed and stored at respective feature vector

table.

20 coefficient of red Plane

20 coefficient of blue plane

20 coefficient of green plane

Figure 4.Feature vector formation

4. EXPERIMENTAL ENVIRONMENT The implementation of the discussed CBIR techniques is done

in MATLAB 7.0 and experimented on to standard datasets

namely Wang and COIL. Sample images from both the

datasets are show below in figure 6 and figure 7. All these

techniques are applied on two types of images that are

processed images and unprocessed images. Wang’s dataset

consist of all unprocessed images having different

background, whereas COIL database consist of processed

images having only black background. First dataset is of

Wang’s standard 1000 image database which consists of 1000

images divided into 10 different categories and each category

having 100 images in it. The second database is of

Columbia’s COIL dataset. This dataset consist of all object

images in black backgrounds that are divided into 10 different

categories and each category having 50 images in it. This

paper is proposed three content based image retrieval

comparing all three techniques experimented on two standard

datasets i.e. Wang and COIL with their respective similarity

measurement criteria values.

R1 R2

R3 R4

R3 R4

B3

B4

G1 G2

G3 G4

G3 G4

B1 B2

B3 B4

B3 B4

3

T1 T2

T3 T4

DCT

4

1 2



12

Figure 6. Sample images from Wang dataset

In case of COIL dataset all three techniques are applied on

500 images of different sizes that are divided into 10 ategories

and all images are in .png format.

Figure 5. Sample images from COIL dataset

While in case of Wang dataset consist of 1000 images in 10

different categories. Each category having 100 images in it

and all are in .jpeg format[12][18].

4.1 Similarity Measure Various similarity comparison distances are used to generate

the similarity measures. Some of them are absolute difference,

Sorensen and Euclidean distance. Explanation for each one is

given below.

4.1.1 Absolute Difference The absolute difference of two real numbers x, y is given by

|x − y|. It describes the distance on the real line between the

points corresponding to x and y. It is a special case of the Lp

distance for all 1 ≤ p ≤ ∞ and is the standard metric used for

both i.e. Q the set of rational numbers and their completion,

the R set of real numbers.

As with any metric, the metric properties hold:

|x − y| ≥ 0, since absolute value is always non-negative.

Also it having three properties.

|x − y| = 0 if and only if x = y.

|x − y| = |y − x| (symmetry or commutativity).

|x − z| ≤ |x − y| + |y − z| (triangle inequality); in the case of

the absolute difference, equality holds if and only if x ≤ y ≤ z.

4.1.2 Sorensen Distance The Sorensen used for comparing the similarity of two

samples. It was developed by the botanists Thorvaldsen

Sorensen and Lee Raymond Dice and published in 1948 and

1945 respectively. The original formula was intended to be

applied to presence or absence of the data which is given by

𝑄𝑆 =2𝐶

𝐴+𝐵 (2)

Where

A, B are the number of species in the samples A and B,

respectively.

C is the number of species sharing by the two samples.

QS is the quotient of similarity and it ranges from 0 to 1.

4.1.3 Euclidean Distance/Minkowski The Euclidean distance or Euclidean metric is the ordinary

distance between two points that one would measure with a

ruler, which is given by using Pythagorean formula. Euclidean

space becomes a metric space by the use of these formula.

Earlier literature refers to the metric as Pythagorean metric.

The Euclidean distance between points p and q can be define

as is the length of the line segment connecting them point p

and q ( ).In Cartesian coordinates, if p = (p1, p2,..., pn)

and q = (q1, q2,..., qn) are two points the Euclidean distance

between them is calculated by the following formula.

𝑑 𝑞. 𝑝

= 𝑞1 − 𝑝1 2 + 𝑞2 − 𝑝2

2 + ⋯ . . + 𝑞𝑛 − 𝑝𝑛 2

= (𝑞𝑖− 𝑞𝑖)

2𝑛𝑖=1

(3)

4.2 Performance Measure 4.2.1 Precision/ Recall [3][12] To check the effectiveness of the proposed CBIR techniques,

precision and recall are used as statistical comparison

parameters. The definitions of these two measures are given

by following equations.

𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛

=𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑟𝑒𝑙𝑒𝑣𝑎𝑛𝑡 𝑖𝑚𝑎𝑔𝑒𝑠 𝑟𝑒𝑡𝑟𝑖𝑒𝑣𝑒𝑑

𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑟𝑒𝑙𝑒𝑣𝑎𝑛𝑡 𝑖𝑚𝑎𝑔𝑒𝑠 𝑟𝑒𝑡𝑟𝑖𝑒𝑣𝑒𝑑

(4)

𝑅𝑒𝑐𝑎𝑙𝑙

= 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑟𝑒𝑙𝑒𝑣𝑎𝑛𝑡 𝑖𝑚𝑎𝑔𝑒𝑠 𝑟𝑒𝑡𝑟𝑖𝑒𝑣𝑒𝑑

𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑟𝑒𝑙𝑒𝑣𝑎𝑛𝑡 𝑖𝑚𝑎𝑔𝑒𝑠 𝑖𝑛 𝑑𝑎𝑡𝑎𝑎𝑠𝑒

(5)

5. RESULTS AND DISCUSSION Below table II shows cross-over point values of all three

techniques when they applied on Wang’s dataset. Sorensen

gives highest cross-over point values as 0.5080 from the

technique one tile twenty coefficient [In this technique feature

vector size become sixty] where as Euclidean gives second

and third highest cross over point coefficient as 0.5064 and

0.5008 from the technique one tile five coefficient[In this

technique feature vector size become twenty] and four tiles

twenty coefficients respectively

http://en.wikipedia.org/wiki/Real_number

http://en.wikipedia.org/wiki/Lp_space

http://en.wikipedia.org/wiki/Lp_space

http://en.wikipedia.org/wiki/Metric_space

http://en.wikipedia.org/wiki/Rational_number

http://en.wikipedia.org/wiki/Commutativity

http://en.wikipedia.org/wiki/Triangle_inequality

http://en.wikipedia.org/wiki/Sample_%28statistics%29

http://en.wikipedia.org/wiki/Botanist

http://en.wikipedia.org/wiki/Thorvald_S%C3%B8rensen



http://en.wikipedia.org/w/index.php?title=Lee_Raymond_Dice&action=edit&redlink=1

http://en.wikipedia.org/wiki/Distance

http://en.wikipedia.org/wiki/Pythagorean_theorem

http://en.wikipedia.org/wiki/Metric_space

http://en.wikipedia.org/wiki/Line_segment

http://en.wikipedia.org/wiki/Cartesian_coordinates



13

TABLE II Average precision of one tile five coefficients,

one tile twenty values and four tile twenty

coefficients

Similarity

measures

One tile

five coefficients

One tile

twenty coefficients

four tile

twenty coefficients

Absolute

difference 0.5004 0.4956 0.4944

Sorensen 0.5052 0.5080 0.5000

Euclidean

\Minkowski 0.5064 0.4972 0.5008

TABLE III Average precision of one tile five

coefficients, one tile twenty values and four tile

twenty coefficients

Similarity

measures

One tile

five coefficients

One tile

twenty coefficients

four tile

twenty coefficients

Absolute

difference 0.8404 0.9072 0.6436

Sorensen 0.8360 0.9100 0.9632

Euclidean

\Minkowski 0.8090 0.8964 0.9156

Above table III shows cross-over point values of all three

techniques when they applied on Sorensen gives first and

second highest cross over point values as 0.9632 and 0.9100

from four tiles twenty coefficients and one tile twenty

coefficients [in both case feature vector size is sixty] followed

by absolute difference at third highest as 0.8404 from one tile

five coefficients [feature vector size twenty]

Figure 7.shows percentage precision with different

similarity measures, for all three techniques applied on

Wang’s datasets.

Figure 8.shows percentage precision with different

similarity measures, for all three techniques applied on

Coil datasets.

TABLE IV Similarity measures with one tile and different

values

Similarity

measures

Datasets

Average Row Mean Column Mean

Absolute

difference

0.7970 0.4968 0.8284 0.5040 0.65655

Sorensen

Distance

0.9030 0.5044 0.9045 0.5002 0.7030

Euclidean

\Minkowsi

Distance

0.8736 0.5014 0.8408 0.4984 0.67855

Average

0.8578 0.5008 0.8579 0.5008

Table IV. shows different similarity measures i.e. Absolute

difference, Sorensen and Euclidean distance along with

different average precision values having one tile five values,

one tile twenty values and four tile five values for Wang and

COIL datasets. For one tile 5 values Euclidean distance shows

highest values. For COIL it is 0.8096 and for Wang 0.5064.

and value for COIL dataset and Absolute difference shows

highest value for Wang dataset. Sorensen shows highest result

of average of all COIL and Wang dataset it is 0.903 and

0.5046. If we take overall percent precision Sorensen gives

good results as compared to the other techniques

6. CONCLUSIONS The content based image retrieval has become very vital

because of the numerous applications needing it. The paper

presents the performance improvement if image retrieval

techniques using Cosine transform using higher energy Coefficients as well as assorted similarity measures. The

proposed technique is experimented with the help of two

standard image datasets. In all three CBIR methods are

proposed with three similarity measures as Sorensen distance,

Absolute difference And Euclidean distance. In all the

0.49

0.495

0.5

0.505

0.51

One tile five coefficients

One tile twenty

coefficients

four tile twenty

coefficients

Pe

rce

nt

Pre

cisi

on

Absolute difference Sorensen Distance



14

Sorensen distance has given better performance with one tile

twenty coefficients both in Wang dataset as well as COIL

dataset in both row mean and column mean content based

image retrieval. The absolute difference gives least precision

among the three similarity measures considered. Also the

tiling does not help in performance boosting.

7. REFERENCES [1] Dr. H. B. Kekre, Sudeep D. Thepade, Tanuja K. Sarode,

Vashali Suryawanshi, “Image Retrieval using Texture

Features extracted from GLCM, LBG and KPE”,

International Journal of Computer Theory and

Engineering, Vol. 2, No. 5, October, 2010, pp1793-8201.

[2] H. B. kekre, Kavita Sonawane, “Retrieval of Images

Using DCT and DCT Wavelet Over Image Blocks”,

International Journal of Advanced Computer Science

and Applications, Vol. 2, No. 10, 2011.

[3] Dr. H. B. Kekre, Sudeep D. Thepade, Varun K. Banura,

“Amelioration of Color Averaging Based Image

Retrieval Techniques using Even and Odd parts of

Images”, International Journal of Engineering Science

and Technology Vol. 2(9), 2010, 4238-4246.

[4] J. Bridget Nirmala, S.Gowri, “A Content based CT Lung

Image Retrieval by DCT Matrix and Feature Vector

Technique”, IJCSI International Journal of Computer

Science Issues, Vol. 9, Issue 2, No 2, March 2012

[5] Manimala Singh, K.Hemachandran, “Content Based

Image Retrieval using Color and Texture”, Signal &

Image Processing : An International Journal, 3, Vol

No.1, February 2012 .

[6] Yixin chen, member IEEE, james z. Wang, member

IEEE, and robertkrovetz clue, ”Cluster-Based Retrieval

Of Images By UnsupervisedLearning”, IEEE

Transactions On Image Processing, Vol.14, No. 8,August

2005.

[7] Qasim Iqbal And J. K. Aggarwal, Cires, “A System For

Content-BasedRetrieval In Digital Image Libraries”,

Seventh International ConferenceOn Control,

Automation, Robotics And Vision (Icarcv’02), Dec

2002,Singapore.

[8] S. Santini and R. Jain, “Similarity measures”, IEEE

Trans. Pattern Anal.Mach. Intell., vol., 2005, in press.

[9] E. de Ves, A. Ruedin, D. Acevedo, X. Benavent, and L.

Seijas, “A New Wavelet-Based Texture Descriptor

forImage Retrieval‖” , CAIP 2007, LNCS 4673, pp. 895–

902, 2007, Springer-Verlag Berlin Heidelberg 2007.

[10] H.B.Kekre, Dhirendra Mishra, “DCT-DST Plane

sectorization of Rowwise Transformed color Images in

CBIR” , International Journal of Engineering Science

and Technology, Vol. 2 (12), 2010, 7234-7244.

[11] J. Bridget Nirmala and S.Gowri, “A Content based CT

Lung Image Retrieval by DCT Matrix and Feature

Vector Technique”, International Journal of Computer

Science Issues, Vol. 9, Issue 2, No 2, March 2012 ISSN

(Online): 1694-0814.

[12] Ajinkya P. Nilawar, “Image Retrieval Using Gradient

operators”, Journal on Recent and Innovation trends in

Computing and Communication, ISSN 23218169, Vol.2

Issue 1.

[13] H.B.Kekre, Tanuja Sarode, Sudeep D. Thepade, “DCT

Applied to Row Mean and Column Vectors in

Fingerprint Identification”, In Proceedings of Int. Conf.

on Computer Networks and Security , 27-28 Sept. 2008,

VIT, Pune.

[14] H. B.Kekre, Sudeep D. Thepade, “Image Blending in

Vista Creation using Kekre's LUV Color Space”, SPIT-

IEEEColloquium and Int. Conference, SPIT, Andheri,

Mumbai, 04-05 Feb 2008.

[15] H.B.Kekre, Sudeep D. Thepade, “Boosting Block

Truncation Coding using Kekre’s LUV Color Space for

Image Retrieval”, WASET Int.

[16] H.B.Kekre, Sudeep D. Thepade, “Image Retrieval using

Augmented Block Truncation Coding Techniques”,

ACM Int. Conf. ICAC3-09, 23-24 Jan 2009, FCRCE,

Mumbai. Is uploaded on ACM portal.

[17] H.B.Kekre, Sudeep D. Thepade, “Color Traits Transfer

to Grayscale Images”, In Proc.of IEEE First International

Conference on Emerging Trends in Engg. & Technology,

(ICETET-08).

[18] Dr.H.B.Kekre, Sudeep D. Thepade, Shobhit W., Miti, K

Styajit”Image retrieval with shape features extracted

using gradient operators and slope magnitude technique

with BTC”, Intrenational Journal Of compyter

Applications, Vol. 6, Number 8, pp.28-33, Sept2010.

IJCATM : www.ijcaonline.org

cosine transformed row and column mean content based … · engineering) pimpri chinchwad college...

Documents