a gentle introduction to riese

Post on 11-May-2015

4.339 Views

Category:

Education

2 Downloads

Preview:

Click to see full reader

DESCRIPTION

Introduces riese, the 'RDFizing and Interlinking the EuroStat Data Set Effort' in a couple of minutes.

TRANSCRIPT

Institute of Information Systems & Information Management

riese – RDFizing & Interlinking the EuroStat Dataset Effort

Wolfgang Halb (JOANNEUM RESEARCH), Yves Raimond (Queen Mary University of London) and Michael Hausenblas (JOANNEUM RESEARCH)

2008-01-30

2

Agenda LinkingOpenData Eurostat (http://ec.europa.eu/eurostat) Architecture Schema & Data Demo Inside

3

LinkingOpenData: Principles Items should be identified using URI references [

URIrefs] (and: don’t use bNodes); URIrefs should be dereferenceable: using HTTP

URIs allows looking up the items identified through URIrefs, cf. [http-range-14 TAG finding];

Looking up an URIref it leads to more data [follow-your-nose principle];

Links to other URIrefs should be included in order to enable the discovery of more data [How to Publish Linked Data on the Web]

4

LinkingOpenData: Current State

5

LinkingOpenData: Current State

in less than a year an emerging community (cf. [LOD ESWiki] created approx. 4 billion triples and approx. 3 million interlinks in

25 separate data sets held diverse F2F meetings, presentations, etc. upcoming: LDOW08 workshop at WWW08

6

Eurostat Eurostat (http://ec.europa.eu/eurostat) publishes statistics in these themes:

General and regional statistics Economy and finance Population and social conditions Industry, trade and services Agriculture and fisheries External trade Transport Environment and energy Science and technology

about the European Union in detail and additional statistics for major non-European countries

7

Eurostat data dump provided as download (TSV-files) updated twice a day additionally needed:

dictionary files to translate the data codes used table of contents for structure

Size of Eurostat data 5 GB data dump in approx. 4,000 files 350 million data values 80,000 different data codes

8

riese: architecture

9

riese: schema & data

riese:Item

xsd:String / xsd:Decimal

rdf:valueevent:Event

rdfs:subClassOf

riese:Dimension riese:dimension

xsd:String

dc:title

dimension:Geo

dimension:xxx

geonames:Feature

rdfs:subClassOf

rdfs:subClassOf

rdfs:subClassOf

dimension:Flags

riese:flagrdfs:subClassOf

riese:Dataset

riese:dataset

xsd:String

dc:title

dimension:Time

rdfs:subClassOf

skos:Concept

rdf:type

skos:narrower /skos:broader

event:time

event:place

xsd:String

dc:title

geonames:parentFeature

Additional features for geo not detailed here

riese:datasetOf

10

riese: schema & data 3 billion triples generated

Example data:

<riese:Dataset rdf:about="http://riese.joanneum.at/data/eb040"

dc:title="Inflation rate"

riese:data_end="2006"

riese:data_start="1980"

riese:last_update="08/01/2008“/>

11

riese: schema & data<riese:Item dc:title=“Inflation rate Austria 2006"

rdf:value=“1.7"

<riese:dimension rdf:resource="http://riese.joanneum.at/dimension/geo/at"/>

<riese:dimension rdf:resource="http://riese.joanneum.at/dimension/time/2006"/>

<riese:dataset rdf:resource="http://riese.joanneum.at/dat/eb040"/>

</riese:Item>

12

riese: schema & data XHTML + RDFa example:

<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN"

"http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd"><html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" xmlns:riese="http://riese.joanneum.at/schema/core#" ... ><head>...</head><body about="http://riese.joanneum.at/data/economy/"

instanceof="riese:Dataset"><span class="toc-entry"><a

href="http://riese.joanneum.at/data/bop/" rel="skos:narrower" class="dim">Balance of payments - International transactions</a></span>

Last update: <span property="dc:date" datatype="xsd:date">2008-01-09</span>

</body></html>

13

riese: demo

14

riese: inside Server:

Apache 2.2 SWI-Prolog PHP 5 RDF/XML documents in the file system

Client XHTML+RDFa Javascript/Yahoo! Interface Library [YUI]

top related