linked data and skos
DESCRIPTION
Connecting the dots: * Publishing SKOS thesauri as linked data* Generate SKOS from LOD sources* Usage of SKOS thesauri for entity extraction & content enrichment from LOD sources* Use linked data mechanisms for collaborative thesaurus management* Usage of SKOS for linked data alignment & disambiguationTRANSCRIPT
Linked Data and SKOSConnecting the dots.
Mag. Andreas BlumauerSemantic Web Company
• Brief introduction to Linked Data• Very brief introduction to SKOS based
Thesaurus Management• Linked Data & SKOS
1. Publishing SKOS thesauri as linked data2. Generate SKOS from LOD sources3. Usage of SKOS thesauri for entity extraction
& content enrichment from LOD sources4. Use linked data mechanisms for collaborative thesaurus
management5. Usage of SKOS for linked data alignment & disambiguation
• Conclusio
Agenda
© Semantic Web Company – http://www.semantic-web.at/ 2
© Semantic Web Company – http://www.semantic-web.at/ 3
Brief introduction toLinked Data
© Semantic Web Company – http://www.semantic-web.at/ 4
Towards a „Web of Data“
Web of Data Hyperdata
Data on the Web (Open Data)
Web of Documents Hypertext
Documents on the Web
From ‚Gopher‘ to ‚Super-Mashups‘
© Semantic Web Company – http://www.semantic-web.at/ 5
© Semantic Web Company – http://www.semantic-web.at/ 6
Live-Demohttp://reegle.info/countries
reegle.info
Find drugs - related to asthma - that are linked to a curated molecular interaction in the literature - where the protein is known to cause inflammatory response
Answering questions
© Semantic Web Company – http://www.semantic-web.at/ 7
bio:Inflammation
dbpedia:Protein_XY
caused by
Drug:Auranofin
Molecule_Interaction_Z24
pubed:Paper4018
contains
dbpedia:Asthma
treats related topic
© Semantic Web Company – http://www.semantic-web.at/ 8
How it works
http://sws.geonames.org/3374084/child of: http://sws.geonames.org/6255149/ parent of: http://sws.geonames.org/3373557/ long: 59.53lat: 13.16
http://dbpedia.org/resource/Grapefruitsubject: category:Flora_of_Barbadosthumbnail: http://upload.wikimedia.org/...
The grapefruit is a subtropical citrus tree first bred in Barbados.
http://rdf.freebase.com/rdf/en.grapefruitvitamin_c: 0.0344gdietary_restrictions: gluten-free_dietdiets_that_like_this: grapefruit_diet
Linked Open Data (LOD) Cloud
© Semantic Web Company – http://www.semantic-web.at/ 9
Source: http://www4.wiwiss.fu-berlin.de/lodcloud/state/
State of the Linked Data Cloud
© Semantic Web Company – http://www.semantic-web.at/ 10
Source: http://www4.wiwiss.fu-berlin.de/lodcloud/state/
DomainNumber of datasets
Triples % (Out-)Links %
Media 25 1,841,852,061 5.82 % 50,440,705 10.01 %
Geographic 31 6,145,532,484 19.43 % 35,812,328 7.11 %
Government 49 13,315,009,400 42.09 % 19,343,519 3.84 %
Publications 87 2,950,720,693 9.33 % 139,925,218 27.76 %
Cross-domain
41 4,184,635,715 13.23 % 63,183,065 12.54 %
Life sciences 41 3,036,336,004 9.60 % 191,844,090 38.06 %
User-generated content
20 134,127,413 0.42 % 3,449,143 0.68 %
29531,634,213,7
70503,998,8
29
Core disciplines: entity extraction, disambiguation & data alignment
© Semantic Web Company – http://www.semantic-web.at/ 11
The grapefruit is a subtropical citrus tree bred in Barbados.
Context Elicitation
Grapefruit (Citrus paradisi)is a subtropical citrus tree firstbred in Barbados.
• Super-Mashups
© Semantic Web Company – http://www.semantic-web.at/ 12
Two main applications of linked data
• Complex search queries
• Which other fruits provide a comparable amount of vitamin C like grapefruits do?
• Which country produces the most grapefruits per capita?
• What is the relation between grapefruits and Georgian wine?
Dietary restrictions•Gluten-free diet•Vegetarian•Veganism
Other plants from Barbados•Coconut•Matico•Grandleaf Seagrape
Top Producers•USA•China•South Africa
© Semantic Web Company – http://www.semantic-web.at/ 13
The technical perspective: data integration the classical way
Data integration realised at application layer Implicit conceptual model
© Semantic Web Company – http://www.semantic-web.at/ 14
The technical perspective: data integration the linked data way
Integration on data level
Application on top of explicit conceptual model
The business perspective: costs of data integration
© Semantic Web Company – http://www.semantic-web.at/ 15
Source: Price Waterhouse Coopers – Technology Forecast, Spring 2009
© Semantic Web Company – http://www.semantic-web.at/ 16
Linked Data: Use Cases
1. Linked Data for internal data integration
2. Linked Data Consuming
3. Linked Data Publishing
4. Combine 1, 2, 3
1.
3.
2.
Linked data – the next evolution of the database?
© Semantic Web Company – http://www.semantic-web.at/ 17
Source: Fujitsu – White Paper Linked Data: March 2012
© Semantic Web Company – http://www.semantic-web.at/ 18
Thesaurus Managementbased on SKOS
SKOS stands for ‚Simple Knowledge Organization System‘
© Semantic Web Company – http://www.semantic-web.at/ 19
• W3C Standard since 2009
• Based on SemanticWeb standards
• Open for linking withadditional linked data
• W3C Standard since 2009
• Based on SemanticWeb standards
• Open for linking withadditional linked data
http://www.w3.org/2004/02/skos/
© Semantic Web Company – http://www.semantic-web.at/ 20
Live-Demohttp://scot.curriculum.edu.au/
Example: Schools Online Thesaurus
SKOS stands for ‚Simple Knowledge Organization System‘
© Semantic Web Company – http://www.semantic-web.at/ 21
1. Each concept in one or many concept schemes2. Each concept has one URI3. Each concept has one ore more labels4. (Poly-)Hierarchical and non-hierachical relations5. Matching between concepts from various sources
1. Each concept in one or many concept schemes2. Each concept has one URI3. Each concept has one ore more labels4. (Poly-)Hierarchical and non-hierachical relations5. Matching between concepts from various sources
1.
2.
3.
4.
5.
© Semantic Web Company – http://www.semantic-web.at/ 22
Linked Data & SKOS
Usage of SKOS in the linked open data cloud
© Semantic Web Company – http://www.semantic-web.at/ 23
Source: http://www4.wiwiss.fu-berlin.de/lodcloud/state/
1. Thesauri like• TheSoz (GESIS)• STW (Leibniz)• GBA (Geological Survey,
A)• …
2. SKOS ‚fragments‘ like in• DBPedia (categories)• New York Times Dataset• …
22%
27%
31%
Publishing SKOS as Linked Data
© Semantic Web Company – http://www.semantic-web.at/ 24
http://vocabulary.semantic-web.at/PoolParty/wiki/semweb
Generate SKOS from Linked Data sources
© Semantic Web Company – http://www.semantic-web.at/ 25
Live-Demohttp://pilot4.poolparty.biz/PoolParty/wiki/Sustainability http://vocabulary.semantic-web.at/PoolParty/wiki/cocktails
Using SKOS for entity extraction& content enrichment
© Semantic Web Company – http://www.semantic-web.at/ 26
Live-Demohttp://poolparty.biz/demozone
http://pilot4.poolparty.biz/extractor/testextractor
Collaborative thesaurus management
© Semantic Web Company – http://www.semantic-web.at/ 27
Live-Demohttp://bit.ly/J16GWD
SKOS & Linked data alignment (1)
© Semantic Web Company – http://www.semantic-web.at/ 28
Live-Demohttp://open.poolparty.punkt.at/PoolParty/
SKOS & Linked data alignment (2)
© Semantic Web Company – http://www.semantic-web.at/ 29
Live-Demohttp://bit.ly/semantic_search
Download slides: http://www.slideshare.net/semwebcompany Try PoolParty: http://poolparty.biz/try-it
Conclusio
© Semantic Web Company – http://www.semantic-web.at/ 30
Contact
Semantic Web Company GmbHMariahilfer Strasse 70/81070 ViennaAustria
http://www.semantic-web.at/ http://poolparty.biz
Andreas [email protected]
31© Semantic Web Company – http://www.semantic-web.at/
http://lod2.eu/
http://lassoproject.org