europeana and semantic alignment of vocabularies antoine isaac jacco van ossenbruggen, victor de...

34
Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije Universiteit Amsterdam NKOS workshop, Berlin, Sept. 28 th 2011

Upload: ross-morton

Post on 20-Jan-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Europeana and semantic alignment of vocabularies

Antoine IsaacJacco van Ossenbruggen, Victor de Boer,

Jan Wielemaker, Guus Schreiber

Europeana & Vrije Universiteit Amsterdam

NKOS workshop, Berlin, Sept. 28th 2011

Page 2: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

?

Page 3: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije
Page 4: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Towards semantics-enabled searchBuilding a "semantic layer" to help accessing content

Page 5: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Prototype: Europeana Thought Lab

http://europeana.eu/portal/thought-lab.html

Page 6: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Semantic search

Page 7: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Towards semantics-enabled search

• Enhance access to Europeana content by semanticsQuery expansion, clustering of results

• Exploiting various types of relations"located in", "lived in", "is more specific concept"…

• Semantics are already thereThesauri, classifications…

• No need to convert KOSs to formal ontologies for nowToo much overhead, too difficult to exploit

Page 8: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Towards semantics-enabled searchBuilding a "semantic layer" to help accessing content

Page 9: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Interoperability challenges?

• Starting from heterogeneous data sources

• Conversion of data in RDF

• Semantic alignments– Between metadata element schemes

vra:depicts rdfs:subPropertyOf dc:subject

– Between concepts in value vocabulariesiconclass:bird skos:closeMatch ddc:bird

Page 10: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Europeana Data Model

10

From metadata element scheme

From value vocabulary

Page 11: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

(to EDM)

Mapping metadata element schemes

EDM property -“Original” property -

Page 12: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Mapping metadata element schemes

• Providers will do it themselves!Yes, manually

• Conversion is automatic, but the specification of the conversion is manualmapping file using OWL,SPARQL CONSTRUCT querywhatever XSLT/XMLRDF script…

Page 13: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Building a network of semantic references

Page 14: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

A brief story of EuropeanaConnect’s efforts on the “semantic layer”…

Page 15: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Where to start from?

• What is already available as RDF data (Linked or not)

Page 16: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

http://www.w3.org/2001/sw/wiki/SKOS/Datasets

Page 17: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

http://thedatahub.org/dataset?q=format-skos

Page 18: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Where to start from?

• What is already available as RDF data (Linked or not)

• What can be made available for Europeana by its partnersSurvey in 2009 by HUB

Page 19: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Criteria for selection

• Languages covered

• Institutional/collection adequacy• Usage adequacy (type of resources)

Europeana users are interested in places, topics and persons

• “Quality”– Conceptual grain and coverage– Lexical coverage– Completeness and correctness of the semantic relationships

• Licenses

Page 20: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Criteria for selection

• Languages covered

• What is there or what can be reasonably obtained and converted

But it’s getting better!And our project is also contributing…

Page 21: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Contents (still being updated)

http://semanticweb.cs.vu.nl/europeana/

Page 22: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Building a network of semantic references

Page 23: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Not everything can be aligned

• We can get links from external projectsMACS, AGROVOC…

• Sometimes in the vocabularies themselves

• But that still not does not cover everything

Page 24: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Back to NKOS 2010…

Page 25: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Pivot vocabularies?

• Looks like BS8723 backbone approach • Hypothesis: a NKOS cloud would look the same

Not with strict pivots, but still high centrality

[…]

• EuropeanaConnect project will follow a (loose) pivot approach for Europeana’s “semantic layer”

Page 26: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

A “soft pivot” strategy?

• Matching smaller and specialized vocabularies to larger and more general pivot onesMulti-lingual, wide-coverage and widely adopted vocabularies

• More specialized vocabularies may be used as anchoring points, depending on domains and existing alignments. WordNet was retained as a general pivot for topics, But LCSH is widely used in libraries and already (partially) mapped to

other vocabularies of that sector.

• A pivot could be several vocabularies with comparable importance and complementary coverage. If these vocabularies are well aligned together, it is possible to map one

more specialized vocabulary to one or the other.

Page 27: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Datacloud in the making

Page 28: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Community guidance?

Library Linked Data incubator group on CKAN, graph athttp://semantic.ckan.net/group/?group=http://ckan.net/group/lld

Page 29: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Community strategy?

Page 30: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

• Hmmm, not there yet

• And maybe it’s alright to have no strategy now?

• Anyway, we had to find something that would bring hopeE.g., an alignment tool to support a loose strategy, with trials and errorsDifferent from the less-agile state-of-the art ontology alignment tools

Page 31: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

AMALGAME

Presented at TPDL

http://semanticweb.cs.vu.nl/amalgame

Page 32: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

AMALGAME

• Do not try to make the tool smart– use simple atomic matching components– fast, scalable, with predictable and understandable results

• Empower domain experts by interactive iteration loop– try most promising component(s)– analyze the results– decide on next step:

• discard results from last attempt• refine current results to improve precision• look for more results to improve recall

32

Page 33: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Amsterdam Museum case, http://semanticweb.cs.vu.nl/lod/am/

Page 34: Europeana and semantic alignment of vocabularies Antoine Isaac Jacco van Ossenbruggen, Victor de Boer, Jan Wielemaker, Guus Schreiber Europeana & Vrije

Thanks!

[email protected]

Europeana.eu teamEuropeanaConnect WP1 @ HUB

Steffen Hennicke, Marlies Olenski, Stefan GradmannEuropeanaConnect project

http://www.europeanaconnect.eu/