session 3: vocabulary enrichment, gerda koch
DESCRIPTION
LoCloud Content Provider workshops Aug-Sept 2013TRANSCRIPT
local content in a Europeana cloud
Session 3: Vocabulary enrichment
Gerda Koch, [email protected] AIT Angewandte Informationstechnik
Forschungsgesellschaft mbH
LoCloud is funded by the European Commission's ICT Policy Support Programme
WP3: Micro services for small and medium institutions• establishing a cloud-based collaborative testing
environment for tools and services
• developing cloud-based SaaS services (Software as a Service) and applications suitable for use by small and medium institutions
• providing the basis for a continuing process of participative testing and validation of each of the services and applications
Introduction
WP3: Service framework
• Geolocation enrichment
• Metadata enrichment
• Vocabularies and languages
• Historic place names
• Wikimedia applications
Introduction
This presentation will provide you:
A brief introduction to …
1. Web Services
2. Vocabulary Standards used by Task 3.4
3. Vocabulary Management Tool (Sample)
Contents
What is a Web Service?
A web service is a software function provided at a network address over the web or the cloud (24/7)
• are application components• communicate using open protocols• are self-contained and self-describing• can be used by other applications• XML is the basis for Web services
1 Web Services
Query a Web Service offered online
Results: Rivers in GermanyTGN – Getty Thesaurus of Geographic Names
Query
Result
1 Web Services
Xataface: editing a record
Integrate a Web Service in a local application
Which
Descriptors
should I use?
1 Web Services
Integrate a Web Service in a local application
The vocabulary
webservice is
directly addressed
within the entry field
of the application.
The user chooses
the vocabulary terms
that are taken over
into the application.
(auto-suggest)
Results: Music GenresDISMARC Genres Vocabulary
1 Web Services
Task 3.4: Vocabularies and languages
• Experimental application to enable local cultural institutions to collaborate in the development of multilingual vocabularies for local history and archaeology
2 Vocabularies
The application will be based on:
ISO 25964: standard for building thesauri• Part 1: Thesauri for information retrieval
- published in 2011
- developing a thesaurus (mono- andmultilingual)
- replaced previous standards ISO 2788/5964- includes data model and XML schema
• Part 2: Interoperability with other vocabularies- published in 2013
- recommendations for the establishment and maintenance ofmappings between multiple thesauri, or between thesauri and
other types of vocabularies
2 Vocabularies
The application will use SKOS as exchange format:SKOS Simple Knowledge Organisation System• is a W3C recommendation designed for
representation of controlled vocabularies• main objective is to enable easy publication and use
of such vocabularies as linked data
2 Vocabularies
How this two relate…
• The SKOS metamodel is broadly compatible with the data model of ISO 25964-1 - Thesauri for Information Retrieval.• ISO 25964-1 advises on the selection and fitting together
of concepts, terms and relationships to make a good thesaurus
• SKOS addresses the next step - porting the thesaurus to the Web.
2 Vocabularies
Using SKOS, concepts can be identified using URIs, labeled with lexical strings in one or more natural languages…
The SKOS Core Vocabulary is an application of the Resource Description Framework (RDF), that can be used to express a concept scheme as an RDF graph. Using RDF allows data to be linked to and/or merged with other data, enabling data sources to be distributed across the web, but still be meaningfully composed and integrated.
2 Vocabularies
Vocabulary examples (SKOS Format)
• DDC Dewey Decimal Classification• Library of Congress’ vocabularies• VIAF person authorities• UKAT UK Archival Thesaurus• UNESCO Thesaurus• …..
Example….
2 Vocabularies
C00213URI
C00206URI C00207
URI
skos:prefLabel
skos:prefLabel
skos:narrowerskos:related
skos:prefLabel
skos:prefLabel
Archaeology
Arqueología
Archaeological dating
Archaeological excavations
Object Predicate Subject
XML
Graph
Using a vocabulary for classificationSemantic Net presentation
Graph
presentation
(Topic Map)
Tree
presentation
Object
Predicate
Subject
2 Vocabularies
Usage of SKOS within theEuropeana Data Model (EDM)
Contextual Classes (EDM)ObjectPredicate
Subject
2 Vocabularies
Provided Cultural Heritage Object (EDM)
Object
Predicate
Subject
2 Vocabularies
Vocabulary management tool examples
Import and Export
thesauri
ThManager 2.0 developed by the University of Zaragoza
UNESCO
Thesaurus
3 Voc. Tool
Vocabulary management tool examples
View and browse
thesauri
List view
Tree view
Search terms
3 Voc. Tool
Vocabulary management tool examples
Edit thesauri
3 Voc. Tool
Vocabulary Management – Business Process/Workflow
The process of editing and
managing a vocabulary
involves the communication
between different persons
and the technical system:
Eg. Finding a term,
Requesting a candidate term,
Accepting the new term,
Updating the vocabulary etc.
User uses web
service for finding a
vocabulary term
Term exists in the vocabulary
SaaSCrowdsourcing
Term does not
exist in the
vocabulary
Use
crowdsourcing
for finding a new
candidate term
3 Voc. Tool
Outcomes of Copenhagen - WS1
Access to the vocabulary:
• Online access point (see TGN example)
• Integration into local application (see Genre example)
Import of existing vocabularies into the LoCloud experimental vocabulary
application (pre-requisite: skosified, multilingual, open access)
• Subject vocabularies
• Object Types
• Geographic Names
• Suggested vocabularies: UNESCO Thesaurus, UKAT
(http://www.heritagedata.org/blog/vocabularies-provided/ ? UK)
Outcomes of Copenhagen
General usage of the tool
• Everybody allowed to enter new terms vs. Suggest candidate terms
• Multilinguality: language translations assigned to partners
Other wishes?
• Crowdsourcing?
• Automated Vocabulary enrichment
Thank [email protected]
LoCloud is funded by the European Commission's ICT Policy Support Programme
The views and opinions expressed in thispresentation are the sole responsibility of the
authors and do not necessarily reflect the views of
the European Commission.
Funding