dimo dimov, iit-bas harmonic and wavelet transform...

23
Harmonic and Wavelet Transform Harmonic and Wavelet Transform Combination for Fast Content Based Combination for Fast Content Based Image Retrieval Image Retrieval by Dimo T. Dimov by Dimo T. Dimov Institute of Information Technologies Institute of Information Technologies Bulgarian Academy of Sciences Bulgarian Academy of Sciences dtdim dtdim@iinf iinf .bas.bg .bas.bg a presentation at the Conference on "PDE Methods in Applied Mathematics and Image Processing" "PDE Methods in Applied Mathematics and Image Processing" (Sep. 7 Sep. 7-10, 2004, Sunny Beach, Bulgaria 10, 2004, Sunny Beach, Bulgaria ) Copyright, 2004 © IIT - BAS. 2 Abstract Abstract Content Based Image Retrieval (CBIR) is a relatively new area of Informatics and covers methods, techniques, and approaches for automatic (or automated) retrieval of images by content description based on simple features as color, texture, shape, movement, etc., as well as on structures over them. Each CBIR system comprises a Database of Images (IDB), for which it should provide content-based image access methods being enough fast and reliable from a user viewpoint. From the other hand, the CBIR can be considered also a pattern recognition approach using a large dictionary of image examples, where the accents are put on images preprocessing to adapt the appropriate recognition technique. A new heuristic approach to fast and reliable CBIR will be presented, namely arranged as a combination of Fourier transform and wavelet transform for images considered in a polar coordinate system. Experimental results will be also committed in comparison with other CBIR approaches experimented by the EIRS (Experimental Image Retrieval System of IIT) on a real IDB of hallmark images. Obviously, the approach proposed can be considered based on a linear 2D transform for global processing of initial images. The approach combination with differential filtering techniques of first and second order for possible image contouring will be also brought to discussion due to the Conference topic. Key words Key words Content Based Image Retrieval (CBIR), fast and reliable CBIR, Fourier’s and wavelets’ transform of images.

Upload: vanlien

Post on 17-Apr-2018

218 views

Category:

Documents


4 download

TRANSCRIPT

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 1

Harmonic and Wavelet TransformHarmonic and Wavelet TransformCombination for Fast Content Based Combination for Fast Content Based

Image RetrievalImage Retrieval

by Dimo T. Dimovby Dimo T. DimovInstitute of Information TechnologiesInstitute of Information Technologies

Bulgarian Academy of SciencesBulgarian Academy of Sciencesdtdimdtdim@@iinfiinf.bas.bg.bas.bg

a presentation at the Conference on"PDE Methods in Applied Mathematics and Image Processing""PDE Methods in Applied Mathematics and Image Processing"

((Sep. 7Sep. 7--10, 2004, Sunny Beach, Bulgaria10, 2004, Sunny Beach, Bulgaria ))

Copyright, 2004 © IIT - BAS.

2

AbstractAbstract Content Based Image Retrieval (CBIR) is a relatively new area ofInformatics and covers methods, techniques, and approaches for automatic (or automated) retrieval of images by content description based on simple features as color, texture, shape, movement, etc., as well as on structures over them. Each CBIR system comprises a Database of Images (IDB), for which it should provide content-based image access methods being enough fast and reliable from a user viewpoint. From the other hand, the CBIR can be considered also a pattern recognition approach using a large dictionary of image examples, where the accents are put on images preprocessing to adapt the appropriate recognition technique.

A new heuristic approach to fast and reliable CBIR will be presented, namely arranged as a combination of Fourier transform and wavelet transform for images considered in a polar coordinate system. Experimental results will be also committed in comparison with other CBIR approaches experimented by the EIRS (Experimental Image Retrieval System of IIT) on a real IDB of hallmarkimages.

Obviously, the approach proposed can be considered based on a linear 2D transform for global processing of initial images. The approach combination with differential filtering techniques of first and second order for possible image contouring will be also brought to discussion due to the Conference topic.

Key wordsKey words Content Based Image Retrieval (CBIR), fast and reliable CBIR, Fourier’s and wavelets’ transform of images.

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 2

3

ContentsContents slide No./46slide No./46

• Introduction 4CBIR (Content Based Image Retrieval) and IDB (Image Data Bases)

• The EIRS (Experimental Image Retrieval System) 8Images of interest, IDB of EIRS, and a demo of EIRSThe EIRS in brief ( 3 basic techniques proposed )

• This talk aims 20the most promising case by a 2D Fourier Transform (2DFT) of images,extensions: by a simple Polar Map (SPM) + a 2DFT, andby a SPM + combined FTWT (= FT (on angles) + WT (on distances) )Experiments. Result analysis

• Near future work 32- Possible application in the Bulgarian Patent Office (PORB) practice- Restrictions of the proposed techniques. Current problems to solve.

• Conclusions 37• References. Acknowledgements 39• Some extra pages 41

4

IntroductionIntroduction•• CBIR (Content Based Image Retrieval)CBIR (Content Based Image Retrieval)•• Image Data Bases (IDB)Image Data Bases (IDB)•• SomeSome examples (examples (GEMINI, GEMINI, QBIC (IBM), CORE,QBIC (IBM), CORE,

ARTISAN) of the “early” CBIRARTISAN) of the “early” CBIR•• How they work:How they work:- color, tone or texture histograms, directional histograms

of sub- objects, elastic constellations of them, etc..- full shape representation (segmentation troubles,

problems of noise)•• Search in Search in IDBsIDBs (sequential or indexed access)(sequential or indexed access)•• EIRS (the Experimental Image Retrieval System of IIT)EIRS (the Experimental Image Retrieval System of IIT)

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 3

5

The CBIR state of the art in brief• CBIR – the retrieval of images on the bases of features

automatically derived from the images themselves.

• CBIR draws many of its methods from the field of image analysis and processing, computer vision, statistics, and pattern recognition.

• An ordinal CBIR system includes a database (IDB) for keeping image data and for maintaining the typical user requests, a graphic user interface for request wording and result visualization, and suitable indexing techniques for the feature vectors treasuring.

• CBIR systems examples: TRADEMARK (1992), QBIC (1993), Photobook (1994), ARTISAN (1999), Picture-Finder (2002),…

cf. [Eakins], [Stanchev], [Smeulders et al], [Vasconselos et all], [Zaniolo et all].

6

The CBIR state of the art in briefThe CBIR state of the art in brief (continue 1)• 3 levels of CBIR [Eakins]:

- by primitive features - color, texture, shape (representative points and their histograms), etc., e.g. “all pictures containing yellow stars in a ring <?>” - by attributes or logical features, involving some degree of inference among objects in the picture: e.g. ”a train crossing abridge <?>”-by abstract attribute; e.g. ”all pictures illustrating pageantry”

• The future CBIR levels 2 & 3 (will) rely on AI areas of Informatics: knowledge representation, syntactic/semantic approaches, machine learning, etc..

• CBIR level 1 <=> “early” CBIR, and it seems everybody are stilltherein [Eakins], [Smeulders et al].

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 4

7

The CBIR state of the art in briefThe CBIR state of the art in brief (continue 2)

• Possible applications of the “primitive” CBIR:-for large amount of given class images: trademarks, postmarks, stamps, fingerprints, industrial components, etc..-Accent on the CBIR system’s interaction with the user: search by association, by example, by category; Vienna convention in the practice of patent offices, assay offices, etc.. -Image DB: extension over conventional DBs and/or DB Management Systems (DBMS).-The problem of effectiveness (access speed and noise tolerance).

8

The EIRS (Experimental Image Retrieval System)The EIRS (Experimental Image Retrieval System)The iThe imagesmages of interestof interest

((EExamples the EIRS methods are designed for)xamples the EIRS methods are designed for)An example of a complete hallmark, parts (from left to right)

represent the sponsor’s mark, the mark for sterling silver, the London Assay Office mark and the date mark for 1976, cf. [A booklet about Hallmarks on gold, silver & platinum, the Assay Office Publication Dept., London., 10p.]

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 5

9

The iThe imagesmages of interestof interest((Other examplesOther examples suitable)suitable)

Examples of Bulgarian hallmarks on old silverware, used as maker’s marks in 1907-1919. Numbers below the images of Tsarina’s crown represent silver standards, cf. [Int. Hallmarks].

10

IDB of EIRSIDB of EIRS: the first 29 images of hallmarks − from the 4000 ones already loaded into the test IDB (as they are semantically ordered in [Int. Hallmarks]):

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 6

11

The IDB of EIRS simple logical schemeThe IDB of EIRS simple logical scheme

Extra Description Records

User Data Fields Text Description

General Description RecordsSystem field(s) User Data Fields (F1÷Fm)

Fast Search Data

Image files(BMP, JPG, etc.)

Extended text files

12

The EIRS system ( The EIRS system ( a a DEMODEMO ))

• If the Demo is not possible show next 4 slides:

•• EIRS: the IDB browserEIRS: the IDB browser

•• EIRS: EIRS: the Searchthe Search--engineengine start menustart menu

•• EIRS: EIRS: the Search engine the Search engine (result table I)(result table I)

•• EIRS: EIRS: the Search engine (result table II)the Search engine (result table II)

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 7

13

EIRS: the IDB browserEIRS: the IDB browser

14

EIRS: EIRS: the Search-engine start menu

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 8

15

EIRS: EIRS: the Search engine (result table I)

16

EIRS: EIRS: the Search engine (result table II)

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 9

17

The EIRS system in briefThe EIRS system in brief• A view on a retrieval experiment

• 3 fast CBIR techniques proposed:- T1: ToCFT using a 1/D Fourier’s transform to the image tree of

contours. - T2: 2DWT using a 2/D wavelet transform.- T3: 2DFT using a 2/D Fourier’s transform (to be extended hereinafter).

• Common principles for the 3 techniques:- The search content is the input image itself or a sketch of it- The most essential image data are automatically extracted and arranged in a key string of a fixed length- The fast access is performed on this key data using conventional index access methods of the DBMS of current use.

18

All fast access methods work using the convention of a “key” andAll fast access methods work using the convention of a “key” and give the give the search result as follows:search result as follows:

where “(i) (ii)” means “(i) is the greatest value less than or equal to (ii)”.

p(record)object retrieved theof key value the(record)object searched theof key value thep

p

• An intuitive definition: The image keyimage key or simply the keykeyshould consist of the most essential information of given image, structured in descending order into a one-dimensional (1/D) array of fixed length.

Intuitive Requirements to the image key performance

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 10

19

Requirements to the image key definition:Requirements to the image key definition:• The key should consist of the (most) essential information of the image.

• This key information should be of significantly smaller volume than the image itself.

• The key information volume should be written in a 1/D array of fixed length.

• The key information should be structured in a way that essential parts to appear in frontier positions.

• The image key is not obligatory to be unique, i.e. for a given key length there can be more than one image to correspond.

• All DB images of one and the same key should be considered as equivalent images.

• The key length should be chosen in such a way that the image key to be enough informative one (from user viewpoint).

20

This talk aim:This talk aim:•• Brief comparison of the 3 basic techniques of EIRS for fast and Brief comparison of the 3 basic techniques of EIRS for fast and reliable CBIR.reliable CBIR.

•• Accent on the rotation invarianceAccent on the rotation invariance −− against possible accidental rotation of input against possible accidental rotation of input images to search. images to search.

•• The other type of invariance against translation, scaling, mirroThe other type of invariance against translation, scaling, mirroring and/or ring and/or illumination irregularities can be considered resolved illumination irregularities can be considered resolved –– by preliminary evaluation by preliminary evaluation and/or compensation. This approach is considered bad practice toand/or compensation. This approach is considered bad practice to realize rotation realize rotation invariance, cf. also [Reiss]. Internal integration of rotation iinvariance, cf. also [Reiss]. Internal integration of rotation invariance within EIRS nvariance within EIRS key derivation techniques is necessary.key derivation techniques is necessary.

•• Integration of rotation invariance within EIRS techniques:Integration of rotation invariance within EIRS techniques:

-- in T1 (in T1 (ToCFT) –– resolved by the definition of 1DFT applied to resolved by the definition of 1DFT applied to ToCToC (the tree of (the tree of contours). But it has noisy troubles with very close couples of contours). But it has noisy troubles with very close couples of contours.contours.

-- in T2 (2DWTin T2 (2DWT) –– principal difficulties for integration because of the 2DWT principal difficulties for integration because of the 2DWT definition (nevertheless the good properties for noise tolerancedefinition (nevertheless the good properties for noise tolerance organization here).organization here).

-- in T3 (2DFT)in T3 (2DFT) –– the most promising for both the rotation invariance integrationthe most promising for both the rotation invariance integrationand the good properties for noise tolerance organization. and the good properties for noise tolerance organization.

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 11

21

An original mark image.

EIRS: Image key generationEIRS: Image key generation((a visual comparison)a visual comparison)

Its ToCFT key(i.e. by a 1D Fourier’s transform on the image’s Tree of Contours).

Its 2DWT key (i.e. by a 2D Wavelets’ transform on the whole image).

Its 2DFT key(i.e. by a 2D Fourier’s transform

on whole image).

22

(T1): ToCFC:ImageImage ::-->> ToC (the Tree of image Contours) ToC (the Tree of image Contours) ::-->>

::--> 1DFT to ToC > 1DFT to ToC ::-->>::--> a heuristic arrangement of the > a heuristic arrangement of the

frequencfrequency y modulamodula ::-->>::--> IDB key> IDB key

Pic.frame

About the 3 techniques of EIRS:

(T2): 2DWT:ImageImage ::--> 2DWT of the image (WT package,> 2DWT of the image (WT package,

a quada quad--tree of decomposition) tree of decomposition) ::-->>::--> consider only the leafs > consider only the leafs ::-->>::--> or> ordder them by “importance” er them by “importance” ::-->>::--> IDB key> IDB key

the root (the image itself)

the leafs

(T(T33): ): 2DFT2DFT::ImageImage ::--> 2DFT of the image > 2DFT of the image ::-->>

::--> scan appropriately > scan appropriately the 2d frequency space the 2d frequency space ::-->>

::--> IDB key> IDB key

ωx

ωy

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 12

23

(a) An original mark image.

EIRS/T1:: the ToCFT method to image key generation:the ToCFT method to image key generation:

(c) The resulting ToCFT key

(b) All contours of the image -> the image Tree of Contours (ToC)

24

(a) An original mark image.

EIRS/T2:EIRS/T2: the 2DWT method to image key generation:the 2DWT method to image key generation:

(c) The resulting 2DWT key

(b) 2D Wavelets’ transform on the binarized image.

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 13

25

(a) An original mark image.

EIRS/T3:EIRS/T3: the 2DFT method to image key generation:the 2DFT method to image key generation:

(c) The resulting 2DFT key

(b) 2D Fourier’s transform on the binarized image (a logarithmic scalefor the modula-spectrum).

26

Image geometric transforms’ influence on IERS/T3 approach(2DFT and its extensions)

2DFT: 2,,;)()exp()())(()( =∈−== ∫ ndjggG nTTTTT Eωxxωxxxω F

or dxdyyxjyxgG yxyx ))(exp(),(),( ωωωω +−= ∫∫or dxdyeyxgG yx yxj

yx ∫∫ +−= )(),(),( ωωωω

Translation:

Scale:

Rotation:

))(())~(~(i.e.

.~,)~(~)~exp()()~(~~,)()~(~

TT

TTTT

TT

gg

GjGG

gg

xFxF

ωωωω∆ωω

∆xxxx

=

=−=⇔

+=⇔

))(())~(~(i.e.

.~,)~(~)()~(~0

0,~,)()~(~

1 TT

TTT

y

xTTT

gg

GGG

ss

gg

xFSxF

ωSωωSωω

SxSxxx

−=

==⇔

==⇔

)(()~(~(i.e.

~,)~(~)()~(~cossinsincos

,~,)()~(~

TT

TTTT

TTT

gg

GGG

gg

xFxF

ωRωωωω

RxRxxx

=

==⇔

−==⇔

αααα

Modula-spectrumis invariant.

Modula-spectrumvastly depends on scale matrix.

Modula-spectrumis rotated as theoriginal image is.

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 14

27

(a) An original mark image.

EIRS/T3:EIRS/T3: an extension: an extension: a simple Polar Map (SPM) of the image before it’s 2DFT:a simple Polar Map (SPM) of the image before it’s 2DFT:

(c) The resulting SPM-2DFT key

(b1) Preliminary SPM of the binarized image, and(b2) it’s 2D Fourier’s transform (a logarithmic scale for modula-spectrum).

28

(a) An original mark image.

EIRS/T3:EIRS/T3: next extension (still under development):next extension (still under development):= = SPM + FTWT, FTWT = SPM + FTWT, FTWT = FT on angles (horizontally), get modula

and then a WT on distances (vertically).

(!) Zoom of the Spectra essential part(linear scale for visualization)

(b1) Simple Polar Map of the binarized image, and(b2) it’s FTWT (a logarithmic scale for visualization).

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 15

29

(a) 2DFT of the original image.

(b) Simple Polar Map and 2DFT (!)promising experiments,cf. also [Reiss], and [Zhang & Lu].

(c) Simple Polar Map and FTWT (?)(a heuristics to be experimentedand proved)

EIRS/T3:EIRS/T3: A comparison of the essential A comparison of the essential spectrum areas for the above 3 approaches:spectrum areas for the above 3 approaches:

30

Experiments (notes to results’tables)Experiments (notes to results’tables)(1) Pure access time: it includes the time for image key generation as well as the time for fault-tolerant search of the generated key among all IDB-image keys preliminary loaded into the main memory. It can be theoretically evaluated to about ~ log2|IDB|.

The full access time (for CPU Intel Celeron 333MHz, MM SDRAM 192MB, HDD IDE-47 20GB ) is about 3÷5s for the main test IDB (3937 images) and less that a 1s for the extra test IDB (146 images). Note that 2 extra seconds are necessary for the EIRS to visualize the results. (2) IDB load time: it can represent the time for a conventional sequential access to IDB, by the averaged formulae ~ |IDB|.(3) Number of bad experiments: For each image of the given IDB a retrieval experiment is conducted, i.e the number of all experiments is |IDB|. An experiment is denoted “bad” if no exact match occurs between the input and the most similar image retrieved. (4) Experimental recognition rate: an “ambitious” evaluation by the bad experiments’ percentage relatively to the all experiments’ number (|IDB|).

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 16

31

Experiments Experiments (results for two (results for two test test IDBIDB--ss))EIRS

method used

IDB access (1)

time [s /img] ( pure | full )

IDB load (2)

time [s]

Number (3)

of bad experiments

Experimental (4)

recognition rate [%]

ToCFT 0.44 | 3÷5 1076 ~ 18 min 71 1.8

2D-FT 0.95 | 3÷4 2113 ~ 35 min 127 3,2

2D-WT 2.79 | 4÷7 6365 ~ 106 min 975 24.7

Table 1. Experimental results on the main test IDB, |IDB| = 3937 half-tone images of hallmarks

Table 2. Experimental results on the extra test IDB, |IDB| = 146 multicolor/tone images of POBG trademarks

EIRS method

used

IDB access (1)

time [s /img] ( pure | full )

IDB load (2)

time [s]

Number (3)

of bad experiments

Experimental (4)

recognition rate [%]

ToCFT 0.19 | <1.0 14 57 38

2D-FT 0.65 | <1.0 74 22 15

2D-WT 0.68 | <1.0 78 64 44

32

Near future workNear future work•• Possible application in the current practice of Bulgarian Bulgarian Patent Office (PORB).Patent Office (PORB).Trademark examples from the current practice of POBG:15 trademark images (from the extra test IDB) are shown, derived in 5 typical categories:

(1) Enough clean images

(2) Relatively clean images

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 17

33

(3) Multicolor and/or halftone images

(4) Images with a great level of artifacts

(5) Very dirty images

34

Other possible applications::Extensions/Modifications (of EIRS & IDB ) for:Extensions/Modifications (of EIRS & IDB ) for:- numismatics- banking, bond- hobbies- philately. - forensic expertise- machinery designIn all mentioned cases (excepting the forensic expertise) the images

can be considered “static” ones, because of the original are usually produced by precise machines (of printing and publishing industry)Additionally “dynamic” image recognition by examples can be

performed for short video clips: e.g. face speaking in a frame, palm gesticulating a sign language, object flying in the sky, etc..

2 general versions of IDB performance:- a local IDB: cf. test IDB of EIRS.- a distributed IDB, Internet applications, etc..

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 18

35

The EIRS techniques’ restrictions:The EIRS techniques’ restrictions:- The techniques are restricted to pictures containing of well- localizable essential graphics (i.e. gray scale or colored objects vs. a picture background). - The pictures should not contain a great level of noise, and especially an “artificial” noise (artifacts).

An idea to avoid the restrictions:An idea to avoid the restrictions: by a level of manual preprocessing of images: the user important fragment(s) could be preliminary and interactively (!) extracted, and that means a loss of automation (what is unacceptable).

36

Current Problems to Solve:- The 3 basic techniques of the EIRS has been developed in a concurrence to met different image types.

- EIRS/T1 is the best one for now. It is invariant to image accidental rotation, but is sensible to a specific noise − small and/or regular noise may cause tremendous changes in the ToC representation.

- EIRS/T2 & T3, and more precisely their combination described here as SPM-FTWT, is very promising to overcome EIRS/T1 in CBIR effectiveness (i.e. processing speed, noise tolerance and invariance to accidental images transform outside).

- A common problem to solve (concerning all the EIRS access techniques) − to isolate or to suppress the relatively great level of “artificial” noise in images of PORB practice.

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 19

37

ConclusionConclusionss• A fast access method for image retrieval by graphic content has been

proposed. The method can be associated to the new informatics area of CBIR.

• The method basic idea is to represent the essential image content as a well-structured DB key of fixed length, namely - the more important image data to take more front positions in the key.

• Three (3) basic techniques has been developed by the method - ToCFT, 2DWT, and 2DFT. The techniques ascend on 3 different types of image shape features.

• An new 2D transform (SPM-FT-WT) is proposed based on a preliminary polar mapping of the image, Fourier transform modula on the angle parameter, and wavelet transform on the distance parameter of polar mapping. The SPM-FT-WT is rotation invariant by definition and preserves the noise tolerance properties characteristic of the FT and WT.

• The experimental image retrieval system (EIRS) has been developed for proving the proposed methods on an IDB of about 4000 images of hallmarks.

38

ConclusionConclusionss (continuation)(continuation)

• The EIRS method can be also considered a recognition method by closeness to examples (centers of classes for recognition).

• The method accents on the most “important” image information based on shape instead of particular features as intensity, color, texture, etc..

• The method is well adapted to the conventional index access methods of conventional DB management systems.

• The method can be applied as image search engine in a conventional DB of images, for instance in information and image retrieval systems for marks, hallmarks, trademarks, postmarks, cultural heritage archiving, etc..

• Besides of “static” images considered herein, also “dynamic” recognition of image sequences, e.g. short video clips, can be performed by similar approaches. Besides 2D-signals (as the images herein), also, 1D- and/or 3D-signal recognition can be developed by analogy to the techniques proposed.

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 20

39

References:References:EakinsEakins, J.P.: Towards intelligent image retrieval, , J.P.: Towards intelligent image retrieval, Pattern Rec. J.Pattern Rec. J., 35 (2002), pp. 3, 35 (2002), pp. 3--14.14.Eakins, J.P., K. Shields, and J. Boardman: ARTISAN Eakins, J.P., K. Shields, and J. Boardman: ARTISAN –– a shape retrieval system based ona shape retrieval system based on

boundary family indexing, 12p. (via boundary family indexing, 12p. (via http://citeseer.nj.nec.com/cshttp://citeseer.nj.nec.com/cs, keyword ‘CBIR’), keyword ‘CBIR’)DudaDuda, R. and P. Hart: , R. and P. Hart: Pattern Classification and Scene AnalysisPattern Classification and Scene Analysis, A Wiley, A Wiley--Inter science Inter science Publication, 1973. Publication, 1973.

International Hallmarks on Silver, collected by TARDY, Paris, FrInternational Hallmarks on Silver, collected by TARDY, Paris, France, 1981, 429ance, 1981, 429--523.523.PavlidisPavlidis, T.: , T.: Algorithms of Computer Graphics and Image ProcessingAlgorithms of Computer Graphics and Image Processing, Comp. Sci. Press, Inc., , Comp. Sci. Press, Inc., 1982.1982.

Reiss, T. H.: Recognizing Planar Objects Using Invariant Image FReiss, T. H.: Recognizing Planar Objects Using Invariant Image Features. Lecture Notes in eatures. Lecture Notes in Computer Science, Vol. 676. SpringerComputer Science, Vol. 676. Springer--Verlag, Berlin Heidelberg New York (1993).Verlag, Berlin Heidelberg New York (1993).

Smeulders, A.W.M., M. Worring, S. Santini, A. Gupta, and R. JainSmeulders, A.W.M., M. Worring, S. Santini, A. Gupta, and R. Jain: Content: Content--Based Image Based Image Retrieval at the End of the Early Years, Retrieval at the End of the Early Years, IEEE Trans. on PAMIIEEE Trans. on PAMI, 22, 12, (2000), pp. 1349, 22, 12, (2000), pp. 1349--1380.1380.

Sonka, M., V. Hlavac, and R. Boyle: Sonka, M., V. Hlavac, and R. Boyle: Image Processing, Analysis, and Machine VisionImage Processing, Analysis, and Machine Vision, 2d , 2d edition, PWS Publ. at Brooksedition, PWS Publ. at Brooks--Cole Publ. Co, ITP, Pacific Grove, CA, 1998.Cole Publ. Co, ITP, Pacific Grove, CA, 1998.

Stanchev, P.L.: Content Based Image Retrieval Systems, in: Stanchev, P.L.: Content Based Image Retrieval Systems, in: Proceedings ofProceedings ofCompSysTech’2001CompSysTech’2001, June 21, June 21--22, 2001, Sofia, P.1.122, 2001, Sofia, P.1.1--6.6.

Vasconselos N., and M. Kunt: ContentVasconselos N., and M. Kunt: Content--based retrieval from databases: current solutions andbased retrieval from databases: current solutions andfuture directions, in future directions, in Proceedings of ICIP’2001Proceedings of ICIP’2001, Oct. 7, Oct. 7--10, 2001, Thessaloniki, Greece: 10, 2001, Thessaloniki, Greece: IEEE, Vol. 3, pp. 6IEEE, Vol. 3, pp. 6--9.9.

Zaniolo C., S. Ceri, C. Faloutsos, R.T. Snodgrass, V.S. SubrahmaZaniolo C., S. Ceri, C. Faloutsos, R.T. Snodgrass, V.S. Subrahmanian, and R. Zicari: nian, and R. Zicari: Advanced Database SystemsAdvanced Database Systems, Morgan Kaufmann, Inc., San Francisco, CA, 1997., Morgan Kaufmann, Inc., San Francisco, CA, 1997.

Zhang D., and G. Lu: Enhanced Generic Fourier Descriptors for ObZhang D., and G. Lu: Enhanced Generic Fourier Descriptors for Objectject--Based ImageBased ImageRetrieval, Int. Conf. On Acoustic, Speech, and Signal ProcessiRetrieval, Int. Conf. On Acoustic, Speech, and Signal Processing (ICASSP’2002), Mayng (ICASSP’2002), May1313--17, 2002 17, 2002 OrlandoOrlando Florida, USA.Florida, USA.

40

Author publications in the topic:DimovDimov, D.: Fast Retrieval of Images by Graphics Content, , D.: Fast Retrieval of Images by Graphics Content, Cybernetics and Information Cybernetics and Information Technologies, IITTechnologies, IIT--BASBAS, Vol.4, 2, 2004, (to appear)., Vol.4, 2, 2004, (to appear).

DimovDimov, D.: Fast access by content to multimedia databases, in Abstrac, D.: Fast access by content to multimedia databases, in Abstracts of Int. Congress ts of Int. Congress MASSEE’2003, Sept. 15MASSEE’2003, Sept. 15--21, 21, BorovetsBorovets, Bulgaria, 2003, pp.94, Bulgaria, 2003, pp.94--94.94.

DimovDimov, D.: Fast, Shape Based Image Retrieval, Proc. of CompSysTech’20, D.: Fast, Shape Based Image Retrieval, Proc. of CompSysTech’2003, June 1903, June 19--20, 20, Sofia, 2003, pp.3.8.1Sofia, 2003, pp.3.8.1--7, (http://ecet.ecs.ru.acad.bg/cst/Docs/proceedings/S3/III7, (http://ecet.ecs.ru.acad.bg/cst/Docs/proceedings/S3/III--8.pdf)8.pdf)

DimovDimov, D.: Experimental Image Retrieval System, , D.: Experimental Image Retrieval System, Cybernetics and Information Cybernetics and Information Technologies, IITTechnologies, IIT--BASBAS, Vol.2, 2, 2002, pp.120, Vol.2, 2, 2002, pp.120--122.122.

Dimov, D.: Wavelet Transform Application to Fast Search by ConteDimov, D.: Wavelet Transform Application to Fast Search by Content in Database of Images, nt in Database of Images, in Proceedings of Intelligent Systems (IS’2002), Sept. 10in Proceedings of Intelligent Systems (IS’2002), Sept. 10--12, 2002, Varna, Vol.I, pp. 23812, 2002, Varna, Vol.I, pp. 238--243.243.

Acknowledgements:This research is jointly supported by the following grants/funds:(i) the Grant #RC6/2004 (IIT #0210143) of the ICT Development Agency at Bulgarian

Ministry of Transport & Communication,(ii) the Grant #I-1306/2003 (IIT #020057) of the National Science Fund at Bulgarian

Ministry of Education & Science, and (iii) the BAS research funds of IIT disposal (IIT #010056)..

Special thanks to Ognyan Kounchev and Svetozar Margenov (from BAS) for the perfect organization of the Conference as well as to all participants for the fruitful discussion of this work.

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 21

41

Pic.frame

The way the EIRS (T1) obtains the image contours:The way the EIRS (T1) obtains the image contours:(a way for possible syntactic/semantic extensions)(a way for possible syntactic/semantic extensions)

The following approachesThe following approachesare also possible:are also possible:

via 1via 1stst gradient filter (Sobel gradient filter (Sobel versvers. here). here) via 2d gradient filter (Laplace oper. here)via 2d gradient filter (Laplace oper. here)

Extra page 1:Extra page 1:

42

WT

A nD

n H

A (n-1)WT

D n V D

n D

1+nA

nE

1+nDD

1+nHD

•Lo •1 2

•Hi •1 2

•Lo •1 2

•Hi •1 2

•Lo •2 1

•2 1•Hi

1+nVD

rowscolumns

A

A

A H V D

H V D

A

A

A H V D

H

A H V D

V

A H V D

D

A H V D

0 1

2 3

0

2

1

3

5

10

8

12

4

7

6

14

9

11

13

15

0WT1 WT2

0

0

0 1 2 3

1

4 9 7 11

2

5 8 10 12

3

6 13 14 15

2/D WT decomposition scheme2/D WT decomposition schemeat given level at given level nn

2/D wavelet transform2/D wavelet transforma visual interpretationa visual interpretation

Wavelet chainWavelet chain--tree of 3 levelstree of 3 levels

Wavelet image package of 3 levelsWavelet image package of 3 levels

Possible sorting scheme of the leafs

The way the EIRS (T2) processes via 2DWT:The way the EIRS (T2) processes via 2DWT:Extra page 2:Extra page 2:

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 22

43

Extra page 3.1:Extra page 3.1:

KK((OO)) -- a value associated to a value associated to OO (an object)(an object)A simple key example: a feature valueA simple key example: a feature valueDB index <DB index <--> ordering the objects by their keys> ordering the objects by their keys, , i.e. a full ordering:i.e. a full ordering:

•O1 •O2 •Oi •Oj •ON•(key values)

AA specific similarity function (specific similarity function (SS))or distance function (or distance function (DD) (usually not a metrics)) (usually not a metrics)to compare given input image (input object)to compare given input image (input object)with all the DB images (objects).with all the DB images (objects).

The above techniques usually lead toThe above techniques usually lead to(1) a partial ordering, i.e. to a sequential access into DB(1) a partial ordering, i.e. to a sequential access into DB,,

of an access time ~ of an access time ~ NN = |IDB|= |IDB|while while the EIRS approach leads tothe EIRS approach leads to

(2) a full ordering, i.e. to a DB primary key index, (2) a full ordering, i.e. to a DB primary key index, of an access time ~ logof an access time ~ log22NN, , NN =|IDB|,=|IDB|,

what is the main advantage of the EIRS approach (!)what is the main advantage of the EIRS approach (!)

The usual CBIR practice:The usual CBIR practice:

The convention for the DB object’s keys:The convention for the DB object’s keys:

44

How the EIRSHow the EIRS approachapproach defines full order (the IDB keys):defines full order (the IDB keys):

where where xi xi ,, i = 1i = 1÷÷ nn are the are the OO--object features ordered by “importance” object features ordered by “importance” (for the current application of EIRS)(for the current application of EIRS)

-- zz--filling curves (a way of scanning the feature space)filling curves (a way of scanning the feature space)-- kk--trees, Rtrees, R--trees, etc. heritage from GIS (Geographictrees, etc. heritage from GIS (Geographic

Information Systems)Information Systems)All they do not match the necessities of an effective (fastAll they do not match the necessities of an effective (fastand noise tolerant) CBIR (!)and noise tolerant) CBIR (!)

Conventional CBIR approaches for accessing an IDB:Conventional CBIR approaches for accessing an IDB:

),...,,(, 211

n

n

i

ini xxxOMx K(O) ≡=∑

=

CBIR DBMS

IP, PR, CV etc. methods

EIRS approach

Similarities among objectsSimilarities among objects::Similarity function Similarity function SS. Distance . Distance DD..

SS((A,BA,B) ~ () ~ (DD((A,BA,B))))--1, where 1, where AA and and BB are given objects.are given objects.MehalanobisMehalanobis distance (Gaussian model of errors over given object), it’s notdistance (Gaussian model of errors over given object), it’s not a metrics.a metrics.Euclidean distance (also often used, because it defines a metricEuclidean distance (also often used, because it defines a metrics)s)

Extra page 3.2:Extra page 3.2:

Dimo Dimov, IIT-BAS 17/09/2004

My expose at the Conf.on PDE in AM & IP,Sep. 7-10, 2004, Sunny Beach, Bulgaria 23

45

Squeezing the feature space dimensionSqueezing the feature space dimension::Allows an easier performance (of a PR system)Allows an easier performance (of a PR system)Simple approaches:Simple approaches:Reordering of the feature space coordinates (a permutation from Reordering of the feature space coordinates (a permutation from nn! possible ones): ! possible ones): Put the most important coord’s at front position and cut the resPut the most important coord’s at front position and cut the rest of “not sot of “not soimportant” coord’s.important” coord’s.More sophisticated approaches:More sophisticated approaches: e.g. Karunene.g. Karunen--Loeve (KL principal components Loeve (KL principal components analysis (?)analysis (?)

-- Searching an object into a DB => Method (or system) forSearching an object into a DB => Method (or system) forPR (Pattern Recognition),PR (Pattern Recognition),

or a DB access methodor a DB access method, , i.e.i.e.-- An object of given DB => A standard object (example)An object of given DB => A standard object (example)

or a center of a classor a center of a class(of given type of objects)(of given type of objects)

The place The place of of EIRSEIRS approachapproach in the CBIR areain the CBIR area ::O ε

Oerr

x1

x2

ji,CjC

niC C O

i

j

n

1jii

÷=⊆∈=

I

U 1,

Extra page 3.3:Extra page 3.3:

46

-- (Approximated) representation of objects by their features.(Approximated) representation of objects by their features.

-- Vector representation (extracted features, e.g. colors, geometrVector representation (extracted features, e.g. colors, geometric measures, ic measures, histograms, frequencies, etc.),histograms, frequencies, etc.),

,, nn –– number of coordinates of number of coordinates of EEnnnnxxxO E∈≡ ),...,,( 21

nnerr eee O O E∈≡+= ),...,,(, 21εε

2},|),(min{) min

nBABADdO D(O, E∈=≤+ ε

-- Noised objects:Noised objects:

-- A sufficient condition for a recoverable noiseA sufficient condition for a recoverable noise

Extra page 3.4:Extra page 3.4: