interoperation of information sources via articulation of ontologies
DESCRIPTION
Interoperation of Information Sources via Articulation of Ontologies. Prasenjit Mitra, Gio Wiederhold Stanford University. Supported by AFOSR- New World Vistas Program. Outline. Introduction Preliminaries Articulation Generation Toolkit Ontology Algebra Conclusion. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/1.jpg)
Interoperation of Information Sources via Articulation of Ontologies
Prasenjit Mitra, Gio Wiederhold
Stanford University
Supported by AFOSR- New World Vistas Program
![Page 2: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/2.jpg)
Prasenjit Mitra 2
Outline
IntroductionPreliminariesArticulation Generation ToolkitOntology AlgebraConclusion
![Page 3: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/3.jpg)
Prasenjit Mitra 3
Interoperation of Information Sources
Compose information Multiple independent, heterogeneous
sourcesReliability, scalabilitySemantic heterogeneity - same term different semantics - different term same semantics
![Page 4: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/4.jpg)
Prasenjit Mitra 4
Preliminaries: Ontology
Ontology - hierarchy of terms and specification of their properties.
Modeled as a directed labeled graph + set of rules.
Ont = ( V, E, R) V - set of nodes(concepts) E - set of edges(properties) R - set of rules involving V,E
![Page 5: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/5.jpg)
Prasenjit Mitra 5
Example
Car
H123
Instance
List_PriceVID
PMitra
NameUnit
$ 40k CA
AddressAmount
Vehicle
LuxuryCar
PMitra
BuyerInstance
SubClass
Instance
VIN
H123
Owner Retail_Price
CA
Name Addr
$35k
SubClass
SubClass
BuyerOwner
Attribute
Attribute
Equ
![Page 6: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/6.jpg)
Prasenjit Mitra 6
Articulation Rules
Articulation rules - logic-based rules that relate concepts in two ontologies:
- Binary Relationships (O1.Car SubClassOf O2.Vehicle) (O1.Buyer Equ O2.Owner) (O2.LuxuryCar SubClassOf O1.Car)
![Page 7: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/7.jpg)
Prasenjit Mitra 7
Articulation Rules (contd.)
Horn Clauses - (O1.Car O1.Instance X), (X O1.Price Y), (Y > $30000) => (O2.LuxuryCar Instance X) - (V O2.Retail_Price P), (C O1.List_Price
L), (L Unit U), (L Amount A), (V Equ C) => (P Equ concat(U,A))
![Page 8: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/8.jpg)
Prasenjit Mitra 8
Contributions
Articulation Generation Toolkit - produce translation rules semi-
automatically - a library of reusable heuristic methods - a GUI to display ontologies and interact with the expertOntology Algebra - query rewriting and planning
![Page 9: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/9.jpg)
Prasenjit Mitra 9
Articulation Generator
Driver
Ont1 Ont2
Phrase Relator
Thesaurus
StructuralMatcher
SemanticNetwork
Human Expert
Context-basedWord Relator
OntA
![Page 10: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/10.jpg)
Prasenjit Mitra 10
Articulation Generation Methods
Non-iterative Methods - Lexical Matcher - Thesaurus-based Matcher - Corpus-based Matcher - Instance-based MatcherIterative Methods - Structural Matcher - Inference-based Matcher
![Page 11: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/11.jpg)
Prasenjit Mitra 11
Lexical Methods
Preprocessing rules. -Expert-generated seed rules. e.g., (O1.List_Price Equ O2.Retail_Price) -Context-based preprocessing directives. e.g., (O1.UK_Govt Equ O2.US_Govt) -Stop-word Removal & Stemming -Word match (full or partial) -Phrase match e.g., (O1.Ministry_Of_Defence Equ O2.Defense_Ministry) 0.6
![Page 12: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/12.jpg)
Prasenjit Mitra 12
Thesaurus-based methods
Consult a dictionary/thesaurus to find synonyms, related words
Generate a similarity measure or relatedness measure
- words that have similar words in their definitions are similar
Get more semantically meaningful relationships from WordNet (syn, hyper)
![Page 13: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/13.jpg)
Prasenjit Mitra 13
Candidate Match Repository
Term linkages automatically extracted from 1912 Webster’s dictionary *
* free, other sources . being processed.
Based on processing headwords definitions Notice presence
of 2 domains: chemistry, transport
![Page 14: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/14.jpg)
Prasenjit Mitra 14
Corpus-based
Collect a set of text documents preferably from same domain
- search using keywords in googleBuild a context vector (1000-character
neighbourhood) for each wordCompute word-pair similarity based on the
cosine of the vectorsUse word-pair similarity to find similarity
among labels of nodes/edges
![Page 15: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/15.jpg)
Prasenjit Mitra 15
Structural Methods
Uses results of lexical match If x% of parent nodes match & y% of
children nodes matchSpecial relations (AttributeOf) match
![Page 16: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/16.jpg)
Prasenjit Mitra 16
Tools to create articulations
Graph matcherforArticulation- creatingExpert
Vehicle ontology
Transport ontology
Suggestionsfor articulations
![Page 17: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/17.jpg)
Prasenjit Mitra 17
continue from initial point
Also suggest similar terms for further articulation:
• by spelling similarity,• by graph position• by term match repository
Expert response:1. Okay2. False3. Irrelevant to this articulation
All results are recorded
Okay ’s are converted into articulation rules
![Page 18: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/18.jpg)
Prasenjit Mitra 18
An Ontology Algebra
Operations can be composed
Operations can be rearranged
Alternate arrangements can be evaluated
Optimization is enabled
The record of past operations can be
kept and reused when sources change
![Page 19: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/19.jpg)
Prasenjit Mitra 19
Binary Operators
A knowledge-based algebra for ontologies
The Articulation Function (ArtGen), given two ontologies, supplies articulation rules between them.
Intersection create a subset ontology keep sharable entries
Union create a joint ontology merge entries
Difference create a distinct ontology remove shared entries
![Page 20: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/20.jpg)
Prasenjit Mitra 20
Intersection: Definition
O1 = (V1, E1, R1) O2 = (V2, E2, R2)OI = ( O1 IntArtGen O2) = (VI, EI, RI)
Arules = ArtGen( O1, O2 )VI = Nodes( Arules )EI = Edges( Arules ) + Edges(E1, VI.V1) +
Edges( E2, VI.V2)RI = Arules + Rules( R1, VI.V1) + Rules( R2,
VI.V2)
![Page 21: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/21.jpg)
Prasenjit Mitra 21
Intersection: Example
ARules = { (O1.Car SubClass O2.LuxuryCar), (O1.MSRP Equ O2.Price)} NI = ( O1.Car, O1.MSRP, O2.LuxuryCar, O2.Price ) EI = Edges(ARules) + {(O1.Car Attribute O1.MSRP), (O2.LuxuryCar Attribute Price)}
InexpCar
Car LuxuryCar
MSRP Price LuxuryTax
SubClass
Equ
O1 O2OI
![Page 22: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/22.jpg)
Prasenjit Mitra 22
Intersection: Properties
Commutative? OI12=(O1 IntArtGen O2) = (O2 IntArtGen O1)=OI21
VI12 = VI21, EI12 = EI21, RI12=RI21
ARules12 = ARules21
ArtGen(O1, O2) = ArtGen(O2, O1)
IntArtGen is commutative iff ArtGen is commutative
![Page 23: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/23.jpg)
Prasenjit Mitra 23
Semantically Commutative
Example: ArtGen(O1,O2) : ( O1.Car SubClassOf O2.Vehicle) ArtGen(O2,O1) : ( O2.Vehicle SuperClassOf O1.Car)
Defn: ArtGen is Semantically Commutative iff ArtGen(O1,O2) <=> ArtGen(O2,O1)
Operands to intersection can be rearranged if ArtGen is semantically commutative
![Page 24: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/24.jpg)
Prasenjit Mitra 24
Associativity Example
Example:ArtRules(O1, O3) : (O1.Car
SubClassOf O3.Vehicle)ArtRules(O2, O3) : (O2.Truck
SubClassOf O3.Vehicle)ArtRules(O1, O2) : nullIntersection is not associative!
![Page 25: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/25.jpg)
Prasenjit Mitra 25
Associativity
(O1.O2).O3 = O1.(O2.O3) iff ArtGen is consistent and
transitively connective
consistent - given two nodes it generates the same relationship irrespective of relationships between other nodes
![Page 26: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/26.jpg)
Prasenjit Mitra 26
Associativity (contd.)
ArtGen relates A and B: (A Rel B) = (A R1 B) or (B R2 A)
Transitively connective: If ArtGen generates (A Rel B), (B Rel C) then it also generates (A Rel C’) where A O1, B O2, C,C’ O3
![Page 27: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/27.jpg)
Prasenjit Mitra 27
Transitive Connectivity
A B
CE
D
O1 O2
O3
![Page 28: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/28.jpg)
Prasenjit Mitra 28
Exploiting the result .
Processing & query evaluation is best performed withinSource Domains & by their engines
Result has linksto source
Avoid n2 problem of interpretermapping [Swartout HPKB year 1]
![Page 29: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/29.jpg)
Prasenjit Mitra 29
Using the Articulation Rules
Q(X) :- (LuxuryCar Instance X), (X Owner Y), (Y Addr `CA’)
Translated using Articulation Rules Q(X) :- (Car Instance X),(X List_Price Y) (Y Unit ‘$’),(Y Amount A), (A > 30000), (X Buyer Z), (Z Address ‘CA’)
![Page 30: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/30.jpg)
Prasenjit Mitra 30
Related Work
TsimmisGarlicInfomasterInformation ManifoldClioLSD
![Page 31: Interoperation of Information Sources via Articulation of Ontologies](https://reader036.vdocument.in/reader036/viewer/2022062500/5681574d550346895dc4efa0/html5/thumbnails/31.jpg)
Prasenjit Mitra 31
Conclusion
Introduced an Ontology Interoperation System.
Interoperation based on articulations that bridge the semantic gap.
Graph-oriented data model, logical rulesFounded on Ontology Algebra
Thanks to Stefan Decker, Alex Carobus, Jan Jannink, Sergey Melnik, Shrish Agarwal