notes on thoughtlab / athena wp4 november 13, 2009 antoine isaac [email protected]

30
Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac [email protected]

Post on 15-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Notes on ThoughtLab / Athena WP4

November 13, 2009

Antoine Isaac

[email protected]

Page 2: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Towards semantics-enabled search

• Enhance access to Europeana content by semantics

• Exploiting different types of relations– locatedIn, isBornIn, created…

• Making use of inference– Finding work showing London for a query on UK

• Rich descriptions are already there, in metadata!• Requires to make it properly machine-accessible

Page 3: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Goal: semantics in Europeana v1.0

Building a semantic layer to help accessing content

Stefan Gradmann, EDL D2.5

Page 4: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Europeana Thought Lab

http://europeana.eu/portal/thought-lab.html

Page 5: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Semantic autocompletion

Page 6: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Clustering of results

Page 7: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Baseline: matching concepts' label

Page 8: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

A "more specific Egypte"??

Page 9: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

A "more specific Egypt"?

Page 10: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

A concept more specific than the Egypt one

Page 11: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Following other relations

Page 12: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Following other relations - creator

Page 13: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Following other relations - match

Page 14: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Following other relations – death place

Page 15: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Following other relations – death place

Page 16: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Enabling Technologies

• RDF– Uniform format for data– Amenable to sharing and linking

• OWL – Representation of metadata structures– Amenable to inference

• SKOS– Representation of controlled vocabulary– Allows exploitation of legacy knowledge organization

• Simple but precious!• E.g., hierarchical relationships for cluster creation

Page 17: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Where are the challenges?

• Semantic conversion of data– Using appropriate data models– Enriching legacy metadata

• Semantic alignments– Between description ontologies

vra:depicts rdfs:subPropertyOf dc:subject

– Between concepts in controlled vocabulariesiconclass:bird skos:closeMatch ddc:bird

Page 18: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Alignment of semantic references

Page 19: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Where are the challenges?

• Semantic alignment (c'ed)– Find correspondences between large vocabularies– In a multilingual context

Page 20: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Athena WP4

Seems to fit very nicely into that challenge

• SKOS & SKOSification• Semantic alignment:

From Marie-Véronique & Johann, Lund

"The Athena Thesaurus = network of Athena-compliant micro-thesauri with bridges in-between"

• Focus on multilingual resources

Page 21: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

What kind of semantic alignment?

• Fundamental goal:– enhancing semantic interoperability of collections– via the KOSs used for describing them

• Several options…

Page 22: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Structural models for interoperability(British Standard BS8723)

1. Unified structure: one KOS

2. Pairwise relations

3. Backbone structure

VocA

VocB

VocC

VocD

VocB

VocC

VocD

VocA

Page 23: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Structural models for interoperability

ThoughtLab "data cloud"• Not really corresponding to best practice

– More like a "web of data" cloud

• But still, a couple of backbone/central nodes– Again, like a "web of data" cloud

Page 24: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl
Page 25: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

• At some point, we have to deal with what is there• Especially if it's much better than nothing!

Page 26: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Goals of Athena WG4?

• Athena Integrated Thesaurus • or Athena Thesaurus Network?

VocA

VocB

VocC

VocD

AthenaThesaurus

VocC

VocD

VocA

Page 27: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Throwing away integrated thesaurus?

• Individual manual mappings can already be exploited– Dumping them in the semantic layer will bring interesting stuff

• Keeping original vocabularies as access points can be an asset

• But a backbone for museum KOSs is likely to bring more– Especially as an umbrella for all those small controlled lists!

• An unified multilingual thesaurus is always extremely precious to have

Page 28: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Throwing away integrated thesaurus?

Thesaurus integration can be used as a driving scenario

• Issue: mapping without application in mind is tricky– What's the "meaning" of a concept?– archeology; netherlands can perfectly be mapped to excavations for translation of book annotations at KB

• Thesaurus integration can provide with mapping criteria– Two concepts are equivalent if we can fit them in the same place

of a semantic network

Page 29: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Wishlist?

• Again, do not forget that intermediate results (individual mappings) can be very precious

• If you produce them as part of the process anyway, there should be a way to export them– As SKOS?

• Problem: ideally, this would require SKOS versions of the individual "micro-thesauri"– Is that planned?

Page 30: Notes on ThoughtLab / Athena WP4 November 13, 2009 Antoine Isaac aisaac@few.vu.nl

Thanks!