an approach for service oriented discovery and retrieval...

43
IW-SOSE 2006 ICSE 2006, Shanghai An Approach for Service Oriented Discovery and Retrieval of Spatial Data Authors: Manoj Paul, S.K. Ghosh Dr. S. K. Ghosh School of Information Technology Indian Institute of Technology, Kharagpur, India [email protected]

Upload: others

Post on 14-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

IW-SOSE 2006ICSE 2006, Shanghai

An Approach for Service Oriented Discovery and Retrieval of Spatial Data

Authors: Manoj Paul, S.K. Ghosh

Dr. S. K. GhoshSchool of Information Technology

Indian Institute of Technology, Kharagpur, [email protected]

Page 2: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 2

OutlineSpatial Data – Importance

Spatial Data Infrastructure

General WS vs. SWSSOA

UDDI and Ontology

System Architecture: Service Discovery and Retrieval

Page 3: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 3

Spatial Data

Refers to information that identifies the geographic location of natural or constructed features and boundaries on the earth

Also known as geospatial data or geographic information

Key information for effective planning and decision-making in a variety of Decision Support Systems (DSS)

Page 4: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 4

Spatial Data

Types of Spatial Data:Types of Spatial Data:

•Geospatial data– Polygons– Points– Lines– Images/Grid

•Socioeconomic data– County/Province statistics– Census data– Social surveys

Spatial Data Sources:Spatial Data Sources:

•Geographic data (polygons, points and lines)•Arc/Info data•Shape files (*.shp, *.shx, and *.dbf)•Grid•Image data (ERDAS Image, JPEG, TIFF, BMP and Arc/Info Image)•Tabular data (dBASE, INFO and TEXT)•SQL•SDE (Spatial Data Engine)

Page 5: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 5

Spatial Data: Characteristics

Spatial Data

ProviderGeographic Information

System

OrthogonalData of

Different Category

Heterogeneous An obstacle for

Data Sharing

Page 6: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 6

Spatial Data: IntegrationThere is demand for GIS in large computational environments, with a large number of users, spread over numerous locations

Sharing large volumes of spatial data is an important task

GIS are no longer simple departmental toolsCorporate toolsGovernmental toolsSociety-oriented applications

Page 7: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 7

Spatial Data: IntegrationFactors influencing sharing:

The fast development of networking technology and the Internet

The success of Web-based GIS

The wide applicability of mobile and ubiquitous computing

ChallengeHeterogeneity (system, modeling and schematic) as well as need to support autonomy posed main challenges; major issues were data access and connectivity

Page 8: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 8

Enterprise-GIS (E-GIS)Growing demands for sharing of spatial data at the enterprise level

Enterprise GIS (E-GIS)

An enterprise GIS is an integrated, multi-departmental system composed of interoperable components

It provides broad access to geospatial data, a common infrastructure upon which to build and deploy GIS applications, and significant economies of scale

Page 9: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 9

E-GIS

Browsers Clients Web Services

ApplicationServer

GISServer

SpatialDBMS

Tabular DBMS File Data

Page 10: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 10

E-GIS

Political Data – Boundaries – Settlements

Local attributes– Climate– Culture– Education– Languages– Agriculture– Business

Geographical data– River– Roads– Elevation

Remotely sensed data– Images– Grid

Statistical data–Socioeconomic data–Survey data–Census data

Page 11: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 11

Integration: IssuesGISs uses proprietary mechanisms for data storage and access

Interoperability problems arise while integrating these diverse spatial data sources

Geographic resources are designed for a variety of different purposes

Orthogonal directions in the design of geographic resources may affect the semantics of the data

Page 12: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 12

Integration: Issues

Geographic information domain is currently characterized by a paradigm shift

from monolithic systems to open and distributed GIS and their use processes.

A move from standardized data formats to specifications of geographic information (GI) service interfaces

The number of GI services available on the web is rapidly and continually increasing

Page 13: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 13

Integration: Issues

Till date geospatial data sharing and exchange among several organizations is hard to achieve

It involves atleastA lot of political negotiationAgreements on standardsAgreements on costs and cost sharing Agreements on maintenance....

Page 14: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 14

Spatial Data Infrastructures

An approach for integration of Spatial data sources at the national level

Infrastructure implies that there should be some sort of coordination for policy formulation and implementation

Definition (NSDI): “Technologies, policies, and people necessary to promote sharing of geospatial data through all levels of government, the private and non-profit sectors, and the academic community”

Page 15: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 15

Spatial Data InfrastructuresSDIs should

Provide standardized access to data

Have multiple participants, in the role of information services providers and/or users

Have a broad thematic scope

Facilitate data sharing

Page 16: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 16

General Web Services

WSDL Document

WSDL DocumentUDDI

Registry

WebService

Client

2 1. Client queries registry to locate service.

2. Registry refers client to WSDL document.

3. Client accesses WSDL document.

4. WSDL provides data to interact with Web service.

5. Client sends SOAP-message request.

6. Web service returns SOAP-message response.

3

4

5

1

6

Page 17: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 17

SOA

Web Service User

Web Service Provider

Web Services Directory

Discover / Select

Bind

Publish

Page 18: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 18

Geospatial ServicesGeospatial data can be interfaced in the web as Services for better accessibility among users

Geospatial data can be accessed in real-time over Internet as if it were local dataTimeliness of the data is preserved

Multiple organizations can benefit from the data being staged and maintained once and used many times

SDI can be built over the these Spatial Web Services (SWS)

Page 19: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 19

HumanUser

Geoportal

Search Discovery

GI Service Provider

Dat

are

spon

se

Ser

vice

s re

ques

t

Ser

vice

s re

ques

t

Dat

a re

spon

se

SDI

Pub

lish

met

adat

aServices and DataCatalog

Catalog Web

Service

Software Client

Dat

are

spon

seS

ervi

ces

requ

est

Key-word based Search

Courtesy: PUC Minas

Page 20: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 20

Standards

Dimensions of interoperability for earth science data

use

acce

ss

discovery

ISO / OGC

Page 21: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 21

Spatial Web Services - OGC

Open Geospatial Consortium (OGC):International consortium of nearly 300 companies, government agencies and universities participating in a consensus process to develop publicly available geo-processing specifications

OGC web servicesWeb Map ServiceWeb Feature ServiceWeb Coverage Service

Page 22: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 22

OGC: WMS

Web Map Service (WMS)Three operations:

GetCapabilitiesGetMapGetFeatureInfo

GetCapabilities request<Service> metadata for service as a whole (ISO 19115 compliant)<Capability> metadata describes request bindings, exceptions, and:<Layer>s: title, name, SRS, bounding box, keywords, style; nested layers inherit from parent

Page 23: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 23

OGC: WFSWeb Feature Service (WFS)

Provides access to geographic feature instances

Features are defined by application schemacompliant with the Geography Markup Language (GML)

Operations:DescribeFeatureType: Returns XML schema for the featureGetFeature: Allows retrieval of features in XML compliant to the feature schema. Selected features may be constrained through Xpath expressionsGetCapabilities: describes capabilities of WFS (as for WMS)

Page 24: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 24

OGC Web ServicesWeb Map Service (WMS, ISO 19128)

GetCapabilitiesGetMap

Web Feature Service (WFS)GetCapabilitiesDescribeFeatureTypeGetFeatureTransaction, LockFeature

Web Coverage Service (WCS)GetCapabilitiesDescribeCoverageGetCoverage

Catalog Service for the Web (CSW) – new!

Page 25: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 25

OGC Web Services

Page 26: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 26

SWS vs WS

SWS do not necessarily use the usual W3C standards, including SOAP and WSDL

Instead of UDDI, OWS propose the use of catalog services

SWS have a particular interface for binding

SWS use GML, and not plain XML

Focus on Service Discovery

Page 27: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 27

UDDIRegistry for services where a service requester can browse through available services

Service advertisement in the form of XML-Schema

InputsOutputsOperation

Page 28: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 28

UDDI Issues in GI Services

UDDI introduces keyword-based retrieval mechanism. It does not allow advanced metadata-oriented query capabilities on the registry.

UDDI does not take into account the volatile behavior of services and supports only quasi-static service registries.

Since UDDI is domain-independent, it does not provide domain-specific query capabilities in particular for GIS domain such as spatial queries.

Page 29: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 29

Enhanced SOAOntologyService

Registry

ServiceProvider

Service Requester

Semantic Service

Description

Service Discovery Request

Requested Service WSDL

Service Interface Description

Service Request

Service Response

Page 30: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 30

OGC CS: Semantic IssuesPresently, capabilities of GI web services are made available in OGC catalogues using standardized metadata templates

Employs Text-based search method for the discovery of service

Problems can occur if requesters and providers of web services use different terms for referring to the same or similar concepts or if they use the same term for different conceptsSemantic heterogeneity problem

Keyword-based service discovery approaches cannot completely capture the semantics of the user’s query

Page 31: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 31

Semantic Issues: GI Ontology

An envisioned approach to overcome these limitations is to use ontology-based service discovery

Ontologies are used for classification of the services based on their properties

This enables retrieval based on service types rather than keywords

Page 32: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 32

GI Ontology: Structure

Hybrid ontology description

Application Ontology: The semantics of each source is described by its own ontology

Domain Ontology: Global shared vocabulary for the domain of concepts

Page 33: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 33

Ontology: Structure

Hybrid Ontology Structure

Page 34: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 34

System Architecture

Page 35: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 35

GI Service Discovery

Service requests are matched with the service advertisements in Service Registry

Instead of Direct Key-word search, semantic search is employed

The matching process goes through two stepsDomain Dependent MatchingService Specific Matching

Page 36: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 36

Domain Dependent Matching

Uses Domain Ontology for Matchmaking

Categorizes service request based on the request parameters

Doesn’t involve any particular service advertisements

Page 37: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 37

Domain Dependent Matching: Example

User Requested Service:

GetAirportLocation

Catalog based searching employs searching all the service advertisement

Service Discovery gets slowed due to large number of available servicesLow Recall/Precision

Ontology based searching categorizes it to be of category

DistanceMeasurement

Page 38: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 38

Domain Dependent Matching: Example

Matchmaker

Domain Ontology

Service Registry

Service

Requester

GetAirportLocation DistanceMeasurement

Page 39: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 39

GI Service Specific Matching

Uses service specific Application ontology for further mapping

Service advertisements are matched forInputs: AirportName, CityLocationOutputs: Distance

The matching utilizes Application Ontology for semantic matching

Increased Recall/Precision

Page 40: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 40

Service Specific Matching

Matchmaker

Application Ontology

MeasurementService

Advertisements

Service Request

Matched ServicesInputs/Outputs

Page 41: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 41

Query Processor

Page 42: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 42

Conclusion

An Integration Platform for Spatial data through

Providing spatial information as servicesService Oriented ArchitectureAdhering to OGC standards

Heterogeneity issues in spatial information has been addressed through

Ontological description for the domainSemantic based service discovery

Standard Query processing methodology

Page 43: An Approach for Service Oriented Discovery and Retrieval ...conferenze.dei.polimi.it/SOSE06/presentations/Ghosh.pdf · Geographic information domain is currently characterized by

May 28, 2006 S.K. Ghosh, India 43

Thank You !!!

[email protected]@iitkgp.ac.in