hierarchical constraint satisfaction in spatial database dimitris papadias, panos kalnis and nikos...

26
Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Post on 20-Dec-2015

227 views

Category:

Documents


5 download

TRANSCRIPT

Page 1: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Hierarchical Constraint Satisfaction in Spatial Database

Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Page 2: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Purpose of the Paper

• Show how systematic and local search make use of hierarchical decomposition of space.

• To efficiently guide search.

• Show conditions when hierarchical constraint satisfaction outperforms traditional methods

Page 3: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

• Helps solves queries in spatial database and geographical information systems.

• For e.x. a user is searching for a residential area that covers a commercial center and the commercial center meets a park.

Hierarchical Constraint Satisfaction

Page 4: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Introduction

• Content based queries can be modeled as CSP.

• All objects in query variables.

• relation between variables constraints.

• The domain of the variables consists of objects in the database.

• For e.x. find residential areas(v1) that cover commercial centers(v2) that meets a park(v3).

Page 5: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

x1 x2

Disjoint(x1, x2)

x1 x2

Meet(x1, x2)

x1 x2

Overlap(x1, x2)

x2x1

Cover( x1, x2)

Topological Relations

x1 x2

Equal(x1, x2)

x2x1

Contain(x1, x2)

x1x2

Covered-by(x1, x2)

x1x2

Inside(x1, x2)

Page 6: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Minimum Bounding Rectangles(MBR) are actual area objects on the map the R-tree is built by grouping rectangles at the lower level.

R-trees are used by CSP algorithms to accelerate search.

Page 7: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

R-tree• Are an extension of B+-trees to many dimensions.• B+-trees is a balanced search tree which maintains

an ordered set of data and in which the keys are stored in a the leaves

• Is a height Balanced Tree that consists of intermediate and leaf nodes corresponding to disk in secondary memory.

• If h is the height of the tree the root is at level h-1 and the leaf is at the level 0.

• The intermediate levels are built by grouping rectangles at lower level.

• There is a R-tree for each type of object.

Page 8: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

R-tree

Page 9: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

R-tree

• Used for window queries.• Two Steps are involved.• Filter Step – retrieve a set of candidates that

includes all the results and some false hits.• Refinement Step – each candidate is examined

and false hits are eliminated.• The method can be extended for topological

relations.

Page 10: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

R-tree

Page 11: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

R-trees Join (RTJ)• The most influential algorithm for processing

intersection joins using R-trees.

• Based on enclosure property.

• Like window queries, in order to process arbitrary topological relations using RTJ we need to define conditions for intermediate nodes.

• The problem is viewed as a multi-way spatial join and processed by computing the result of one pair-wise join and joining the result with v3.

Page 12: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Hierarchical CSPs using R-trees• A set of variables v1,v2,v3…vn.• Domain di for variable vi is

for level 0 : {xi,1,…… xi,ci}

for level 1 to h-1:{Xi,1,…… Xi,ci}• For each pair of variables the binary constraint is for level 0 : Cij is a disjunction of topological

relations as specified by the query. for level 1 to h-1: Cij is derived by replacing each

relation in Cij by the corresponding condition for intermediate nodes in Table 2.

Page 13: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Table 2

Page 14: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

• Two preprocessing heuristic

– Space Restriction.

– Path Consistency

• Space Restriction – scans the domains of all variables, removing the entries that cannot satisfy the query constraints given their positions w.r.t. to other nodes.

• Path Consistency – is a form of semantic query optimization to discard inconsistent queries.

Hierarchical CSPs using R-trees

Page 15: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

• Using systematic and local search algorithms there are three cases :– Hierarchical systematic search.

– Hierarchical local search.

– Hierarchical local/systematic search.

Hierarchical CSPs using R-trees

Page 16: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Experiments

• Problems were randomly generated by modifying the paramters n,m,p1, p2.

• n = number of variables.

• m = size of datasets

• p1 = is the probability that a random pair of variables is constrained (network density).

• p2 = is the probability that assignment for a constrained pair is inconsistent (tightness).

Page 17: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

• Typical values takes for the problem

• m = 104

• |x| = .0045 d .2( typical value for real datasets)

• h=3

• C=50-200

Using these values problems were randomly

generated.

Page 18: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Hierarchical Systematic Search

• Using Forward Checking with fail first dynamic variable ordering heuristic.

• 50 randomly generated problems.

• P1 = 1.

• n=5.

• m=104

D 0.2

Page 19: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Hierarchical Systematic Search

• Two searches were used – Forward Checking (FC)– Hierarchical FC (H-FC)

• Three types of problems were tested – Varying P2 without disjoint– Varying P2 with disjoint– Varying n with one solution

Page 20: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Results

• Varying P2 without disjoint

– H-FC outperforms FC by two orders of magnitude.

• Varying P2 with disjoint– For dense graphs the H-FC outperforms FC by two

orders of magnitude.– As tightness decreases the performance converges.

Hierarchical Systematic Search

Page 21: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Results• Varying n with one solution

– The performances converges as the number of variables increases.

– For n>25 FC outperforms H-FC.

Hierarchical Systematic Search

Page 22: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

• Local search used is Hill Climbing with min-conflicts(MC) heuristic.

• Following Variations used – Flat MC

– Hierarchical uninformed MC (HU-MC)

– Hierarchical informed MC (HI-MC)

– Hierarchical root MC (HR-MC)

– Hierarchical root MC/FC (HR-MC/FC)

Hierarchical Local Search

Page 23: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

• Problems created with the parameters– n=5

– m=103, 104, 105.

• All algorithms were executed 10 times for every setting.

• Their execution was terminated is solution could not be obtained after 109 checks.

Hierarchical Local Search

Page 24: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Results

• HR-MC outperforms HU-MC by at least one order of magnitude.

• HI-MC’s performance is between HR-MC and HU-MC.

• When m= 103 MC is better than hierarchical local search

• When m= 105 HR-MC outperforms MC by one order of magnitude.

Hierarchical Local Search

Page 25: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Results

• Due to large number of sultions at the upper level hierarchical local search succeeds fast but spends more time trying to find a soultion at the leaf level this motivated the replacement of MC at leaf level with FC

• For larger domains HR-MC/FC outperforms HR-MC by almost an order of magnitude.

Hierarchical Local Search

Page 26: Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis

Conclusion• Provides a methodology for hierarchical

constraint satisfaction in spatial database using R-trees.

• Systematic search is significantly faster in the case of hierarchical CSPs for m 104 and n10

• Hierarchical local search is better for very large domains.

• Provides hints to improve performance.