europeana and open data robina clayphan interoperability manager, europeana ldbc tuc meeting, 19...

32
Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Upload: sabina-mccormick

Post on 27-Dec-2015

219 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Europeana and Open DataRobina Clayphan

Interoperability Manager, Europeana

LDBC TUC meeting, 19 November, 2013

Page 2: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

What is Europeana?

• Europeana is a service that brings together digital content from across the cultural heritage domain in Europe

• It makes the metadata freely available

• It is a catalyst for change in the world of cultural heritage.

• Our vision: We believe in making cultural heritage openly accessible in a digital way, to promote the exchange of ideas and information.

Page 3: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Europeana.eu, Europe’s cultural heritage portal

Museums

National Aggregators

Regional Aggregators

Archives

Thematic collections

Libraries

- A network of participants in development and innovation- Nearly 30 million objects from 2,400 European galleries, museums, archives

and libraries

Page 4: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

What types of objects does Europeana give access to?

Text Image Video Sound 3D

Page 5: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Europeana and open data

Page 6: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

What Europeana makes available

Metadata

Link to digital objects online

Page 7: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Metadata (descriptive object information)

Different options:Open – not fully open (but clear) – Not open

Two categories of rights

CC

Page 8: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

The Europeana Data Model

Page 9: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

EDM requirements & principles

1. Distinction between “provided objects” (painting, book, movie, etc.) and their digital representations

2. Distinction between objects and metadata records describing an object

3. Allow for multiple records for a same object, containing potentially contradictory statements about it

4. Support for objects that are composed of other objects

5. Support for contextual resources, including concepts from controlled vocabularies

Richer metadata with finer granularity

Page 10: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Provide more semantics to the data

Build a semantic layer on top of Cultural Heritage objects

Page 11: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

EDM Classes

Page 12: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

ore:Aggregation(Identifier of aggregation)

edm:WebResource(Identifier of web resource)

edm:ProvidedCHO(Identifier of real object)

An aggregation with a provided CHO and a web resource

The three core classes

edm:aggregatedCHO

edm:hasView

Page 13: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

The Aggregation with metadata

Page 14: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Properties for the Aggregation

Mandatory:

edm:aggregatedCHO

edm:dataProvider

edm:isShownBy or

edm:isShownAt

edm:provider

edm:rights

Optional:

edm:hasView

edm:object

dc:rights

edm:ugc

The aggregation represents the set of related resources about one real object contributed by one provider. It carries the metadata that is about the whole set

Page 15: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Properties for the ProvidedCHO

The ProvidedCHO is the cultural heritage object which is the subject of the package of data that has been submitted to Europeana.

Properties: dc:contributor, dc:coverage, dc:creator, dc:date, dc:description, dc:format, dc:identifier, dc:language, dc:publisher, dc:relation, dc:rights, dc:source,dc:subject, dc:title, dc:type, dcterms:alternative, dcterms:extent, dcterms:temporal, dcterms:medium, dcterms:created, dcterms:provenance, dcterms:issued, dcterms:conformsTo, dcterms:hasFormat, dcterms:isFormatOf, dcterms:hasVersion, dcterms:isVersionOf, dcterms:hasPart, dcterms:isPartOf, dcterms:isReferencedBy, dcterms:references, dcterms:isReplacedBy, dcterms:replaces dcterms:isRequiredBy, dcterms:requires dcterms:tableOfContents

edm:isNextInSequence

edm:isDerivativeOf

edm:currentLocation…

Page 16: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Properties for the web resource

One or more digital representations of the provided cultural heritage object.

dc:description dc:format dc:rights dc:sourcedcterms:conformsTo dcterms:createddcterms:extent dcterms:hasPart dcterms:isFormatOf dcterms:isPartOf dcterms:issuededm:isNextInSequence edm:rights

Page 17: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

EDM Classes

Page 18: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Contextual classes

Representing (real-world) entities related to a provided object

as fully fledged resources, not just strings

edm:Agent

foaf:name

skos:altLabel

rdaGr2:biographicalInformation

rdaGr2:dateOfBirth….

skos:Concept

skos:prefLabel

skos:altLabel

skos:broader

skos:definition….

edm:TimeSpan skos:prefLabel

dcterms:isPartOf

edm:begin

edm:end….

edm:Placewgs84_pos:lat

wgs84_pos:long

skos:prefLabel

dcterms:isPartOf….

Page 19: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Example of a CHO with two contextual classes

dc:creator

dc:subject

Page 20: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Accessing and re-using Europeana data

Page 21: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

How do users access Europeana content?

Europeana aims to provide content in the users’ workflow – where they want it, when they want it.

User focused channels: Europeana.eu portal, social media exports

For programmers: API, search widget, semantic mark up, LOD pilot

Page 22: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Europeana’s infrastructure is open for re-use

Europeana data available via

API

Search widgets

Semantic mark-up (schema.org) on portal

Linked Open Data pilot

http://pro.europeana.eu/api

http://data.europeana.eu

Page 23: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Some (approximate) numbers

Europeana database – 30 Million objects

LOD pilot – a subset of 20 Million objects

• contained nearly 1 Billion RDF explicit statements

• 4 Billion once you do all the RDF reasoning (sub-properties, sub-classes, etc) in OWLIM

• Ontotext has already loaded a chunk of data and is working on the update of it, in Europeana Creative.

Page 24: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Possible benchmarking queries?

Queries for exploring the dataset

• e.g. to generate the complete ordered list of Europeana aggregators and the data providers they gather

Queries for exploring the objects

• e.g. a list of works with a matching location/creator/title

• Simple graph traversal

Expressing EDM constraints (that cannot be done in OWL)

• Can RDF validation help e.g where at least one of two properties must be present (title or description)?

Queries to assist in data quality improvement

• Broken links, duplicates (or near duplicates), missing mandatory properties, missing thumbnails etc etc

For Information: We are starting a data quality task force if you are interested!

Page 25: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Useful links

Europeana portal europeana.eu

Europeana Professional pro.europeana.eu

• EDM documentation http://pro.europeana.eu/edm-documentation

• Europeana API http://www.europeana.eu/portal/api-introduction.html

• LOD pilot http://data.europeana.eu

Data Quality task force – [email protected]

Europeana Professional blog pro.europeana.eu/blog

Facebook facebook.com/Europeana

Twitter twitter.com/EuropeanaEU

Europeana Thought Lab pro.europeana.eu/thoughtlab/

Europeana end-user blog blog.europeana.eu/

Page 26: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Thank you

Robina Clayphan

[email protected]

Page 27: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Bonus slides!

Page 28: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

EDM design requirements

Compatibility with different levels of description

• Allow different levels of granularity

• A book, a page, a detail of an image

Standard metadata format that can be specialized

• Allow the specification of domain specific application profiles

• Enable the re-use of existing standards

• Allow the extension of the initial model

Page 29: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

EDM basis

OAI ORE (Open Archives Initiative Object Reuse & Exchange) for organizing an object’s metadata and digital representation(s)

Dublin Core for descriptive metadata

SKOS (Simple Knowledge Organization System) for conceptual vocabulary representation

CIDOC-CRM for the modeling of event and relationships between objects

Use the Semantic Web representation principles• RDF

• Re-use and mix different vocabularies together

• Preserve original data and still allow for interoperability

Page 30: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

EDM Properties (excluding ESE)

Page 31: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Two providers and two aggregations(the same object)

31

aggregation of DMF

aggregation of Louvre

v

provenancemetadata

provenancemetadata

Cultural heritage object

Page 32: Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013

Europeanaaggregation

Enriched metadata

Landing page