collaborative nlp-aided ontology modelling - fbk · collaborative nlp-aided ontology modelling 1...
TRANSCRIPT
![Page 1: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/1.jpg)
Collaborative NLP-aided ontology modelling
1
Chiara Ghidini Marco Rospocher
[email protected] [email protected]
International Winter School on Language and Data/Knowledge Technologies
TrentoRISE – Trento, 24th February 2012
![Page 2: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/2.jpg)
ONTOLOGIES & ONTOLOGY MODELLING
Part I
2
![Page 3: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/3.jpg)
What is an ontology?
� Many definitions of an ontology in literature;
� Here we refer to an ontology as a “formal specifications of the terms in the domain and relations among them” (*)
� Ontologies contain a formal explicit description of:
� Concepts (aka classes)
� Relations (aka roles)
� Individuals (aka instances)
� Classes (and relations) can be ordered in taxonomies using the subclass relation
(*) [Gruber, T.R. (1993). A Translation Approach to Portable Ontology Specification. Knowledge Acquisition 5: 199-220.]
3
![Page 4: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/4.jpg)
Andrew
Charles
Patty
Rome
Milan
London
Paris
PeopleTown
hasWife
hasBrother
livesIn
livesIn
Andrew
Charles
Patty
Rome
Milan
London
Paris
PeopleTown
hasWife
hasBrother
livesIn
livesIn
Andrew
Charles
Patty
Rome
Milan
London
Paris
PeopleTown
hasWife
hasBrother
livesIn
livesIn
Andrew
Charles
Patty
Rome
Milan
London
Paris
PeopleTown
hasWife
hasBrother
livesIn
livesIn
In a picture
4
![Page 5: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/5.jpg)
Taxonomies
� Classes (and relations) can be ordered in taxonomies using the subclass relation
� Example: biological classification of species
� Same for roles
5
![Page 6: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/6.jpg)
Axioms
� Concepts can be formally described through axioms
� A Pizza Margherita is a pizza which has both tomato topping and mozzarella topping
6
PizzaMargherita v PizzaP izzaMargherita v 9hasTopping.TomatoToppingP izzaMargherita v 9hasTopping.MozzarellaTopping
![Page 7: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/7.jpg)
Different types of Ontologies
7 Slide taken from “Ontology-Driven Conceptual Modelling” A tutorial by Nicola Guarino.
![Page 8: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/8.jpg)
Why to develop an ontology?
� To share common understanding of the structure of information among people or software agents
� To enable reuse of domain knowledge
� To make domain assumptions explicit
� To separate domain knowledge from the operational knowledge
� To analyze domain knowledge
8
![Page 9: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/9.jpg)
Examples of ontologies
� Large taxonomies categorizing Web sites (such as on Yahoo!)
� Medical Ontologies (such as SNOMED) to annotate documents and share information
� Categorizations of products for sale and their features (such as on Amazon.com, but also smaller enterprises).
� Therefore……
The development of ontologies is moving from the realm of research labs to the “desktop of domain experts”
9
![Page 10: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/10.jpg)
Problems in ontology modeling
1. Modelling is a collaborative activity
10
How to write an ontology?
Domain expert
How to change this axiom? Is this information
relevant?
What is the meaning of this description?
Knowledge engineer
![Page 11: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/11.jpg)
Problems in ontology modeling
2. Modelling is a time-consuming and error-prone activity, and often needs parsing of a large quantity of material.
11
Do I really need to read all this?
![Page 12: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/12.jpg)
Our contribution
Our Contribution to solve those problems
1. Framework for the collaborative modeling of ontologies using wikis
2. Automatic extraction of key-phrases for ontology modelling
12
![Page 13: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/13.jpg)
COLLABORATIVE FRAMEWORK FOR ONTOLOGY MODELING
Part II
13
![Page 14: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/14.jpg)
Why a wiki-based conceptual modeling tool?
� Wikis support collaborative editing; � Users are quite familiar with viewing/editing wiki
content (e.g. Wikipedia); � Only a web-browser is required on the client side; � Wikis provide a shared knowledge repository
accessible by users spread all over the world; � Wikis can provide a uniform tool/interface for the
specification of different model types (e.g. ontologies, processes, …);
14 14
![Page 15: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/15.jpg)
An architecture for collaborative conceptual modeling in wikis
1. One element One page
� each element of the model is represented by a page in the wiki;
15 15
that stretches above the surrounding land in a limited area usually in the form of a peak. A mountain is generally steeper than a hill.
Mountain
A mountain is a large landform
The highest mountain on earth is the Mount Everest
Concept “Mountain”
![Page 16: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/16.jpg)
An architecture for collaborative conceptual modeling in wikis
2. Unstructured and structured descriptions
� each page contains both structured and unstructured content;
16 16
that stretches above the surrounding land in a limited area usually in the form of a peak. A mountain is generally steeper than a hill.
Mountain
A mountain is a large landform
The highest mountain on earth is the Mount Everest
v Landform
v 8madeOf(Earth tRock)
v 9height. �2500
Mountain(Mt.Everest)
v ¬Hill u ¬Plain
Mountain(Mt.Kilimanjaro)
(unstructured content) (structured content)
![Page 17: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/17.jpg)
An architecture for collaborative conceptual modeling in wikis
3. Different views to access the model:
� different views to support different modeling actors;
17 17
that stretches above the surrounding land in a limited area usually in the form of a peak. A mountain is generally steeper than a hill.
A mountain is a large landform
The highest mountain on earth is the Mount Everest
(unstructured view)
Mountain
Mountain
(semi - structured view)
earthmade of
is a landform
height at least 2,500m
samples Mt. Everest
made of rock
different from hill, plain
Mt. Kilimanjaro
v Landform
v 8madeOf(Earth tRock)
v 9height. �2500
Mountain(Mt.Everest)
v ¬Hill u ¬Plain
Mountain(Mt.Kilimanjaro)
(fully - structured view)
Mountain
![Page 18: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/18.jpg)
An architecture for collaborative conceptual modeling
� Alignment between the different views
that stretches above the surrounding land in a limited area usually in the form of a peak. A mountain is generally steeper than a hill.
Mountain
(unstructured view)
A mountain is a large landform
The highest mountain on earth is the Mount Everest
earthmade of
is a landform
height at least 2,500m
samples Mt. Everest
made of rock
different from hill, plain
Mt. Kilimanjaro
(semi-structured view)
v Landform
v 8madeOf(Earth tRock)
v 9height. �2500
Mountain(Mt. Everest)
v ¬Hill u ¬Plain
Mountain(Mt. Kilimanjaro)
(fully structured view)
18 18
![Page 19: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/19.jpg)
19
MoKi: The modeling wiki
Collaborative editing between knowledge experts and knowledge engineers
Web 2.0 tool
Term extraction features
Automatic translation from and to OWL and BPMN
Support for validation and feedback Integrated ontology and process modeling
Graphical and textual editing
Available as open source tool. Demo at moki.fbk.eu
![Page 20: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/20.jpg)
MOKI DEMO Part III
20
![Page 21: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/21.jpg)
Definition of the collaborative framework
Hints on the applicability of the tool also for other conceptual modelling languages (BPMN)
Showcase of results and usages
21
![Page 22: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/22.jpg)
AUTOMATIC EXTRACTION OF KEY-PHRASES FOR ONTOLOGY MODELLING
Part IV
22
![Page 23: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/23.jpg)
NLP-aided ontology engineering
� Support ontology modeling by extracting concepts characterizing a domain from a reference text corpus…
� … actually, by automatically extracting key-phrases
� Key-phrases are the terms characterizing a document or a corpus of documents => candidate relevant concepts of the domain described by the corpus
� Automatic concepts extraction plays an important role in ontology modeling:
� To boost the ontology construction/extension phase
� To “validate” an ontology against a domain corpus
![Page 24: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/24.jpg)
An NLP-aided ontology engineering framework
� A framework for supporting ontology building/evaluation by automatic concept extraction from a reference text corpus
� A fully-working and publicly available implementation of the proposed framework in MoKi
![Page 25: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/25.jpg)
NLP-aided ontology engineering
25
Key-concepts extraction
Alignment with
additional resources
Corpus collection
External resources (e.g Wordnet)
Candidate key-concepts list
Enriched key-concepts list
Extended ontology
Domain corpus
Current ontology
Validation / Evaluation
Ontology metrics
![Page 26: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/26.jpg)
Corpus Selection
� The corpus can be manually or automatically selected (e.g. crawling web pages).
� Corpus could consist of:
� (large) collection of documents
• e.g. pollen bulletins crawled on-line
� A single big document
• e.g. the BPMN specification.
Key-concepts extraction!
Alignment with external
resources!
Corpus collection!
Manual validation!
![Page 27: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/27.jpg)
Key-concept extraction
� Performed by KX (Keyphrase eXtraction) tool.
� exploits linguistic information and statistical measures to select a list of weighted keywords from documents;
� handles multi-words;
� flexible parameters configuration;
� easily adaptable to new languages;
� ranked 2nd (out of 20) at SemEval2010, task on “Automatic Keyphrase Extraction from Scientific Articles”.
Key-concepts extraction!
Alignment with external
resources!
Corpus collection!
Manual validation!
![Page 28: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/28.jpg)
Alignment with additional resources
� Extracted key-concepts aligned and enriched with additional resources:
� WordNet (& WN domains): synonyms, definitions, SUMO labels;
� Wikipedia: link to the Wikipedia page corresponding to the term (exploiting BabelNet);
� Other external resources (e.g. dictionary).
� Enriched key-concepts list matched against the ontology, to detect already defined key-concepts.
Key-concepts extraction!
Alignment with external
resources!
Corpus collection!
Manual validation!
![Page 29: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/29.jpg)
Ontology Extension / Evaluation
� Ontology Extension:
� The user decides which of the extracted key-concepts to add to the ontology;
� The additional details provided in the enriched list may guide the formalization;
• e.g. is-a related synsets, definitions, …
� Ontology Terminological Evaluation:
� Automatically computed metrics (variants of IR precision and recall) support users in determining the terminological coverage of the ontology wrt to the corpus used;
Key-concepts extraction!
Alignment with external
resources!
Corpus collection!
Manual validation!
![Page 30: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/30.jpg)
Application Scenarios
� The proposed approach can support several different ontology engineering tasks:
� Ontology construction boosting: building an ontology from scratch;
� Ontology extension: adding new concepts to an existing ontology;
� Ontology evaluation: evaluating terminologically an ontology against a domain corpus;
� Ontology ranking: ranking candidate ontologies wrt a given domain corpus;
� Ranking of ontology concepts: determining which are the domain-wise most relevant concepts defined in an ontology.
![Page 31: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/31.jpg)
� Framework fully-implemented in MoKi
� Publicly available @ moki.fbk.eu
� Accepts a collection of digital documents in any popular formats
� Let’s see it in action!
![Page 32: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/32.jpg)
MOKI DEMO (CONTINUED)
Part V
32
![Page 33: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/33.jpg)
PHD CALL ON INFORMATION EXTRACTION FOR ONTOLOGY ENGINEERING
Part VI
33
![Page 34: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/34.jpg)
Building Quality Ontologies
� Starting Point: a collaborative ontology modeling framework supported by NLP techniques
� Goal: to support building rich and high quality ontologies
� Issue: current state of the art NLP techniques for information extraction have some limitations wrt ontology modeling:
� mainly focused on the extraction of terms;
� more suitable to support the construction of light-weight medium-quality ontologies;
� Challenge: how to appropriately exploit NLP techniques to support the construction of rich and high quality ontologies?
![Page 35: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/35.jpg)
PhD call on Information Extraction for Ontology Engineering
� Objective:
Investigate how to combine work in automatic ontology learning and work in methodologies and tools for manual knowledge engineering to produce (semi)-automatic services for ontology learning better supporting the construction of rich and good quality ontologies.
� Address key research challenges in NLP and ontology engineering.
� Strong algorithmic and methodological aspects, together with implementation-oriented tasks.
![Page 36: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/36.jpg)
team at FBK
Multi-linguality and eGovernment application Guided domain expert modeling via template
Collaborative modeling of ontologies and processes
Ontological description of processes
![Page 37: Collaborative NLP-aided ontology modelling - FBK · Collaborative NLP-aided ontology modelling 1 Chiara Ghidini Marco Rospocher ghidini@fbk.eu rospocher@fbk.eu](https://reader030.vdocument.in/reader030/viewer/2022021802/5b7840e87f8b9a47518ecad1/html5/thumbnails/37.jpg)
Thank You! Questions?
Marco Rospocher http://dkm.fbk.eu/rospocher [email protected]
Chiara Ghidini http://dkm.fbk.eu/ghidini