Transcript
Page 1: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

On Libraries & Linked Data

Antoine Isaac

UB Utrecht, April 6, 2011

Page 2: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Who am I?

• Europeana• Web & Media Lab, Vrije Universiteit Amsterdam

• W3C Library Linked Data group• (2006-2009) W3C Semantic Web Deployment group

SKOS

[email protected]

Page 3: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

DemoFollowing one’s nose to subject heading lists as linked data• American LCSH

http://id.loc.gov/authorities/sh85145447#concept

• French RAMEAUhttp://stitch.cs.vu.nl/vocabularies/rameau/ark:/12148/cb11931913j

• German SWDhttp://d-nb.info/gnd/4064689-0

• Agrovochttp://aims.fao.org/aos/agrovoc/c_8309

• STWhttp://zbw.eu/stw/descriptor/14188-0

• Further on to DBPediahttp://dbpedia.org/resource/Water

Page 4: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Demo (fallback option)

Subject heading lists as SKOS linked data• American LCSH http://id.loc.gov• French RAMEAU: http://stitch.cs.vu.nl/rameau• German SWD: http://d-nb.info/gnd/ • mapped using manual links from the MACS project

http://macs.cenl.org

Starting from http://id.loc.gov/authorities/sh85014310#concept

Page 5: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011
Page 6: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011
Page 7: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011
Page 8: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011
Page 9: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011
Page 10: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Linked Data?

1. Use URIs as names for things2. Use HTTP URIs so that people can look up those names3. When someone looks up a URI, provide useful information

using standards (RDF, SPARQL)4. Include links to other URIs, so that they can discover more

things

Tim Berners-Lee, http://linkeddata.org/

Page 11: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

(Linked) Data Representation

• That subject heading data follows a link-intensive data modelUniform resource identifiers (URI)Resource Description Framework (RDF)

Page 12: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

(Linked) Data Representation

• Use more-or-less the same standard vocabularySimple Knowledge Organization System (SKOS)http://www.w3.org/2004/02/skos/ For representing thesauri, classifications, etc. on the

Semantic Web

Page 13: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

A SKOS graphanimalscats

UF domestic catsRT wildcatsBT animalsSN used only for domestic

catsdomestic cats

USE catswildcats

Page 14: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

SKOS mappings

SKOS provides conceptual links to bridge across different contexts

KOS 1:animalscatswildcats

KOS 2:animalhumanobject

Page 15: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Links in the data

Page 16: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Links in the data

Page 17: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Growing interest for linked data in the library community

Page 18: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Linked Library Cloud beginning 2008

[Ross Singer, Code4Lib2010]

http://code4lib.org/conference/2010/singer

Page 19: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Linked Library “sector” in 2010

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

Page 20: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Libraries and LD, the perfect match?

• Libraries have been producing (meta)data for ages• Libraries (often) produce high-quality metadata

Page 21: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Libraries and LD, the perfect match?

• Library metadata was locked in record silos• But it maintain links to the outside world

• Bibliographic and web references• Shared vocabularies• Same books!

Page 22: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Libraries and LD, the perfect match?

LD is about• Citing object• Linking to them• Re-using data

Think of web-native union catalogues

Page 23: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Johan Stapel, Koninklijke Bibliotheek (now bibliotheek.nl)

A vision for the Dutch National Library

Page 24: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

A web of cultural heritage data?

?

Page 25: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

?

Page 26: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

The current portal

Page 27: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011
Page 28: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Towards semantic search: facets

Page 29: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Building a search engine on top of metadata is difficultIntrinsic quality problems: correctness, coverage

Especially when data is so heterogeneous100s of formatsFrom flat 5-fields records to 100-nodes XML treesLanguage issue!

We currently use a simple, flat interoperability formatQuick-win quickly showing its limits

Page 30: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

We can better use institutions’ original metadata

Accommodate their different practicesData structures and semantics

Access objects via a semantic layer of vocabularies for subjects, persons, places…

Semantic ThoughtLab: experimenting solutions

Page 31: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Towards semantics-enabled searchBuilding a "semantic layer" to help accessing content

Page 32: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Towards semantics-enabled search

• Enhance access to Europeana content by semantics– Query expansion, clustering of results

• Exploiting various types of relations– "located in", "lived in", "is more specific concept"…

• Semantics are already there, in metadata and "controlled vocabularies" used in metadata– Thesauri, classifications…

• Requires to make it properly machine-accessible

Page 33: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Europeana Data Model

Trying to evolve towards RDF and Linked Data• Representing objects, persons, places, etc. as

resources• Linking and re-using external sources• (Re-using) richer data modeling features

SKOS, CIDOC-CRM, OAI-ORE

• Enabling domain-specific data profiles• Separating original data from enrichments

http://version1.europeana.eu/web/europeana-project/technicaldocuments/

Page 34: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Prototype: Europeana Thought Lab

http://europeana.eu/portal/thought-lab.html

Page 35: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Clustering of results

Page 36: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Baseline: matching concepts' label

Controlled place name from a vocabulary at the Rijskmuseum

Metadata for the object

Page 37: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

A "more specific Egypte"?

Page 38: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

A "more specific Egypte"?Metadata for the object

Page 39: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

A place more specific than the Egypt one

Semantic information on the Giza place in the Rijskmuseum Vocabulary

Page 40: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Following other relations

Page 41: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Following other relations - creator

Metadata for the object

Controlled person name from a vocabulary at the Rijskmuseum

Page 42: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Following other relations - matchInformation on Gustave Le Gray from the Rijskmuseum Vocabulary

Matched to a "Gustave Le Gray" from another Vocabulary

Page 43: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Enabling bits & pieces

Exploiting semantic links in CH vocabulariesConcept “Giza” narrower than concept “Egypte”

Mapping/alignment between CH vocabulariesLouvre’s “Égypte” equivalent to Rijksmuseum’s “Egypte”

Enrichment of existing metadataThe string “Egypt” in a metadata record indicates the concept of

Egypt defined in Rijksmuseum thesaurus

Page 44: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Challenge #1: Linking

Page 45: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Challenge #1: Linking

Manual mapping of large vocabularies is labour-intensive• LCSH, RAMEAU and SWD mapped in the MACS project

http://macs.cenl.org • SWD and DDC mapped in the CRISS-CROSS project

http://linux2.fbi.fh-koeln.de/crisscross/

Automatic linking is not perfect but can help• STW, AGROVOC…• Some studies (and further pointers) for automatic library

thesaurus alignment in the STITCH projecthttp://stitch.cs.vu.nl

Page 46: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Challenge #1: Linking

• (Semi-)automatic techniques are necessary to– Connect objects to vocabularies (esp. for legacy data)– Connect objects themselves together

• Crowdsourcing?

• Making the way librarians create metadata evolve?

Page 47: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Linking strategy for libraries?

Page 48: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Linking strategy for libraries?

• Links to library-originated sources– VIAF, LCSH, DDC, UDC, Worldcat, PND…

• Links to resources from cultural environment– Museums, archives– Scientific communities: bibliographic data & research data– Publishers– Europeana and other aggregators

Page 49: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Semantic Annotation

Page 50: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Conclusion?

• Linked Data won’t not solve everything right now

• Just a set of techniques and a vision for better sharing, cross-linking and re-use data, fitting the web

• Which is not bad!

Page 51: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

If we stop here, thanks for your attention!

Any (more) questions?

Page 52: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Some references

Page 53: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

W3C Library LD Incubator

http://www.w3.org/2005/Incubator/lld• 1-year group• OCLC, LC, VU Amsterdam, DNB, etc.

• help increase global interoperability of library data on the Web

• bringing together people involved in Linked Data—in the library community and beyond

• building on existing initiatives and collaboration tracks for the future

Page 54: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Library LDUse Cases

• LLD use cases and case studies (work in progress)

• JISC cases for open bibliographic data http://obd.jisc.ac.uk

http://www.w3.org/2005/Incubator/lld/wiki/UseCases

Page 55: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Useful vocabularies to express data• Dublin Core• SKOS• BIBO• OAI-ORE• FOAF• MADSIn progress• RDA vocabularies• FRBR@IFLACf. Linked Open VocabulariesNote: vocabularies can be combined and articulated together

dublincore.org/www.w3.org/2004/02/skos/

bibliontology.com/www.openarchives.org/ore/

www.foaf-project.org/www.loc.gov/standards/mads/rdf/

metadataregistry.org/rdabrowse.htm

labs.mondeca.com/dataset/lov/

Page 56: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Datasets

• Controlled vocabularies (thesauri, etc.)LCSH, DDC, Agrovoc, VIAF, GND

• Bibliographic dataNat. Libraries of Hungary, Sweden

• Trying to keep track of some on CKANhttp://ckan.net/group/lld

Page 57: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

In the Netherlands

• DEN, Bibliotheek.nl, KB, Vrije Universiteit Amsterdam, Beeld en Geluid, UvA Library

• Amsterdam Museum as Linked Data http://semanticweb.cs.vu.nl/lod/am/

• Dutch Culture Link http://sites.google.com/site/dclod11/

• Dublin Core 2011http://dcevents.dublincore.org/index.php/IntConf/dc-2011

Page 58: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

Pictures• http://www.europeana.eu/portal/record/

03903/8C5C6AEFF6B50DCCEDF6A23A99DD3A2D66AEB2CC.html• http://www.europeana.eu/portal/record/03912/

E9666896A50FDDE5F7F15A17C11219A7FBCBBC50.html(Europeana links give access to resources on original sites)

Page 59: On Libraries & Linked Data Antoine Isaac UB Utrecht, April 6, 2011

First Demo pointers

• American LCSH http://id.loc.gov• French RAMEAU: http://stitch.cs.vu.nl/rameau• German SWD: http://d-nb.info/gnd/ • Agrovoc: http://aims.fao.org/ • STW: http://zbw.eu/stw/ • DBPedia: http://dbpedia.org/


Top Related