semantic integration of relational data sources with topic maps
DESCRIPTION
Data integration of heterogeneous data sources plays a major role in the development of modern knowledge management systems. Additional enrichment of this data with the use of ontologies opens up completely new possibilities in leveraging the use of semantic technologies, and combining information from existing information systems. This paper presents the architecture and prototype implementation of a semantic integration layer for transparent access to relational data sources through the use of Topic Maps.TRANSCRIPT
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
1
Semantic Integration of Relational Data Sources with Topic Maps
Use case providers:
Rani Pinchuk, Thomas Neidhart, Bernard Valentin
The research was done within the SATOPI Project, a co-funded activity with the European Space Agency (ESA Contract N°: 21520/08/I/OL)
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
2
• We have more and more data• The data is spread across heterogeneous data sources.• The data is not self-explanatory.• Copying the data to a better organized data store is no
option.
The Problem
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
3
Create a Semantic Integration Layer that provides:• semantic annotation of existing data• a “live” view to data stored in heterogeneous data
sources.• a merging of multiple data sources in a clean and
understandable way• a common interface to different types of data.
The Vision
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
4
• Glacial Lake Outburst Flood (GLOF) happen when glaciers melt, and glacial lakes are formed next to them.
• The glacial lakes are usually dammed by a natural dam made partly from ice. This dams are unstable.
• When the water level goes up, the ice in the dams floats and causing the dam to break.
• The resulted floods can be very intense. For example, in a GLOF event happened in 1985, the water from Dig Tsho Lake went downhill in a flood that lasted 4-6 hours, and with a flow of 1600 to 2350 cubic meters per second.
The Use Case – GLOFs in the Himalayas
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
5
• The phenomena of GLOF events depend on many factors such as weather, topography and characteristics of the glaciers, the glacier lake and its dam.
• In order to identify precursors for a GLOF event the researcher has to collect data from different resources:
– Different remote Earth Observation sensors each might have its own interface.
– Data which is already collected about the glacier lake and similar glacier lakes
– Weather data, etc.
The process of accessing and collecting the data slows down the research.
The Use Case – GLOFs in the Himalayas
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
6
Why Topic Maps ?
Merging Data
Reusing DataProcessing Data
Navigating Data
Accessing Data
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
7
• Topic Maps technology provides the ability to represent knowledge in an informal, natural way – the way we, humans, grasp knowledge.
• RDF/OWL is designed for being processed by machines, and therefore is much more formal, more machine-friendly and less human-friendly.
Topic Maps versus The Semantic Web
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
8
The Architecture (1)
• The client accesses the data through a Topic Maps interface – using TMQL or TMAPI.
• The Topic Maps Engine uses the Ontology Definition and the Mapping Definition files to find where from the data should be retrieved.
• If the data is located in the external data stores, the Topic Maps Engine will retrieve this data.
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
9
The Architecture (2)
• The Ontology Definition is an ordinary topic map, containing well-defined topics and types -> reflects application logic.
• For each attached data source, a separate connector is defined (with a mapping definition file).
• Additionally, an arbitrary number of topic maps can be loaded.• Associations can be defined between any combination of attached
connectors and pre-loaded topic maps.• The final topic map that is visible to the user through the available
interfaces is the sum of all attached connectors + loaded topic maps + their defined associations.
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
10
The Ontology (1)
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
11
The Ontology (2)
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
12
The Datastore Connectors Ontology (XTM)Ontology (XTM)
<topicMap xmlns="http://www.topicmaps.org/xtm/" version="2.0"> <itemIdentity href="http://spaceapps.com/satopi/tm" /> <topic id="glacier" > <name><value>Glacier</value></name> <occurrence> <type> <topicRef href="#local-id" /> </type> <resourceData datatype="string" /> </occurrence>
<topicMap xmlns="http://www.topicmaps.org/xtm/" version="2.0"> <itemIdentity href="http://spaceapps.com/satopi/tm" /> <topic id="glacier" > <name><value>Glacier</value></name> <occurrence> <type> <topicRef href="#local-id" /> </type> <resourceData datatype="string" /> </occurrence>
DatastoreDatastore
Mapping Definition (CTM Template)Mapping Definition (CTM Template)
%ctm 1.0%prefix database <glaciers.db>%prefix tablename <gka>topic - "${NAME}"; isa glacier; ^ ${TMID}; local-id: "${LOCAL_ID}"; area: "${AREA_KM2}" .is-located-in(location: topic, host: "${COUNTRY_TMID}")
%ctm 1.0%prefix database <glaciers.db>%prefix tablename <gka>topic - "${NAME}"; isa glacier; ^ ${TMID}; local-id: "${LOCAL_ID}"; area: "${AREA_KM2}" .is-located-in(location: topic, host: "${COUNTRY_TMID}")
Query TMQLTMQL
TMAPITMAPI
TopiEngineTopiEngine
TMAPITMAPI
DatastoreConnector
DatastoreConnector
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
13
The User Interface: Search Pages
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
14
The User Interface: Search Result Lists
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
15
The User Interface: Description Pages
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
16
• Topic Maps is a useful data integration technology.• Modeling of application structure/logic using Topic Maps is
efficient and straight-forward
easily understood by domain experts.• Common interface for heterogeneous data greatly reduced
complexity and amount of time for app. development.
Focus on Application Logic rather than technical integration details
Conclusion
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
17
• Improve existing Topic Maps interfaces (like TMAPI) to better fit for specific use-cases:• Should we have TMAPI with sessions?
Open Issues
TMRA’09, LeipzigSemantic Integration of Relational Data Sources with Topic Maps
18
Thank you
Questions?