lift your data_inspire2012

30
Ghislain Atemezing 1 , Boris Villazón-Terrazas 2 1 EURECOM, France [email protected] 2 Ontology Engineering Group, FI, UPM [email protected] Slides available at: http://www.slideshare.net/atemezing/ INSPIRE Conference 2012, Istanbul Acknowledgements: Oscar Corcho, Asunción Gómez- Pérez, Luis Vilches, Raphaël Troncy, Olaf Hartig, Luis Vilches and many others that we may have omitted. Lift your data Lift your data Introduction to Linked Introduction to Linked Data Data Workdistributed under the license Creative Commons Attribution- Noncommercial-Share Alike 3.0

Upload: eurecom

Post on 01-Jul-2015

405 views

Category:

Education


3 download

DESCRIPTION

This presentation was made during the a tutorial "A practical introduction to Linked Data: lift tour data" organised by IGN (France), EURECOM, UPM, Ordnance Survey, within the INSPIRE conference in Istanbul, from june 23th - june 27th.

TRANSCRIPT

Page 1: Lift your data_inspire2012

Ghislain Atemezing1 , Boris Villazón-Terrazas2

1 EURECOM, France

[email protected]

2 Ontology Engineering Group, FI, UPM

[email protected]

Slides available at: http://www.slideshare.net/atemezing/

INSPIRE Conference 2012, Istanbul

Acknowledgements: Oscar Corcho, Asunción Gómez-Pérez, Luis Vilches, Raphaël Troncy, Olaf Hartig, Luis Vilches and many others

that we may have omitted.

Lift your dataLift your data

Introduction to Linked DataIntroduction to Linked Data

Workdistributed under the license Creative Commons Attribution-Noncommercial-Share Alike 3.0

Page 2: Lift your data_inspire2012

Agenda

Linked Data

Geospatial LD Datasets

5-star deployment scheme for Linked Open Data

Tutorial “Lift your data”, Istanbul 2012 2

Page 3: Lift your data_inspire2012

Agenda

Linked Data

Geospatial LD Datasets

5-star deployment scheme for Linked Open Data

Tutorial “Lift your data”, Istanbul 2012 3

Page 4: Lift your data_inspire2012

Current WebCurrent Web

Geospatial Database (Spain)

Statistical Database (Spain)

Data exposed to the Web via HTML, pdf,

etc.

© Slide adapted from “5min Introduction to Linked Data”- Olaf HartigTutorial “Lift your data”, Istanbul 2012 4

Page 5: Lift your data_inspire2012

Information from single pages

can be found via search engines

© Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig

Tutorial “Lift your data”, Istanbul 2012 5

Page 6: Lift your data_inspire2012

© Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig

Complex queries over multiple pages/data sources?

Tutorial “Lift your data”, Istanbul 2012 6

Page 7: Lift your data_inspire2012

What do we actually want?What do we actually want?

• Use the Web like a single global database• web of documents -> web of data

7

Statistical Database (Spain)

Geospatial Database (Spain)

© Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig

Tutorial “Lift your data”, Istanbul 2012 7

Page 8: Lift your data_inspire2012

Linked Data enables such Web of DataLinked Data enables such Web of Data

Statistical Database (Spain)

Geospatial Database (Spain)

Global Identifier: URI (Uniform Resource Identifier), which is a string of characters used to identify a name or a resource on the Internet.

http://../Madridhttp://…/industry_2009

Data Model: RDF (Resource Description Framework), which is a standard model for data interchange on the Web

http://.../namehttp://.../measure_quantity

“Madrid”180.23

Access Mechanism: HTTP

Connection: Typed Links

http://.../relatedToArea

© Slide adapted from “5min Introduction to Linked Data”- Olaf Hartig

Tutorial “Lift your data”, Istanbul 2012 8

Page 9: Lift your data_inspire2012

The four principles (Tim Berners Lee, 2006)The four principles (Tim Berners Lee, 2006)

1. Use URIs as names for things

2. Use HTTP URIs so that people can look up those names.

3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL)

4. Include links to other URIs, so that they can discover more things.

http://www.w3.org/DesignIssues/LinkedData.html

http://www.ted.com/talks/tim_berners_lee_on_the_next_web.html

Tutorial “Lift your data”, Istanbul 2012 9

Page 10: Lift your data_inspire2012

Resource Description Framework (RDF)

• RDF is a basic KR language based on semantic networks• Useful to represent metadata and describe any type of

information in a machine-accesible way.

SubjectSubject ObjectObjectproperty

Statement

ignes:Madridignes:Madrid ignes:Spainignes:Spain

ignes:Murciaignes:Murcia

geoes:isPartOf dbpedia:population

rdfs:label

geoes:isPartOf

“Madrid”

1.027.914

Tutorial “Lift your data”, Istanbul 2012 10

Page 11: Lift your data_inspire2012

RDF - SPARQL

11

• Query: “Tell me who are the persons who have Ghislain as colleague”

eurecom:Ghislaineurecom:Ghislain??person:hasColleague

• Result: upm:Boris and eurecom:Raphael

• SPARQL query language for RDF. W3C recommendationSELECT ?s

WHERE { ?s person:hasColleague eurecom:Ghislain . }

upm:Borisupm:Boris eurecom:Ghislaineurecom:Ghislain

eurecom:Raphaeleurecom:Raphael

“Boris Villazón Terrazas”

person:hasName

person:hasColleague

person:hasColleague

“Male”person:hasSex

Tutorial “Lift your data”, Istanbul 2012

Page 12: Lift your data_inspire2012

Evolution of the Linked Open Data

2007

2008

2009

2010

Tutorial “Lift your data”, Istanbul 2012 12

Page 13: Lift your data_inspire2012

Linked Open DataLinked Open Data

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

2011

31 datasets19.43% triples

http://lod-cloud.net/state

Tutorial “Lift your data”, Istanbul 2012 13

Page 14: Lift your data_inspire2012

Agenda

Linked Data

Geospatial LD Datasets

5-star deployment scheme for Linked Open Data

Tutorial “Lift your data”, Istanbul 2012 14

Page 15: Lift your data_inspire2012

GeoData is becoming increasingly relevantGeoData is becoming increasingly relevant

• Datasets in the geospatial domain contribute more than one fifth of the RDF triples in the Web of Data.

• Well-know Geo LD datasets• Ordnance Survey• GeoLinkedData• LinkedGeoData• GeoNames• DBPedia• GDAM-RDF

Tutorial “Lift your data”, Istanbul 2012 15

Page 16: Lift your data_inspire2012

Ordnance Survey Ordnance Survey http://data.ordnancesurvey.co.ukhttp://data.ordnancesurvey.co.uk

• Topic: Administrative units

• Datasets: The administrative gazeeter for Great Britain.

• URI pattern: http://data.ordnancesurvey.co.uk/id/{id}

• Vocabulary: Spatial Relations Ontology, Administrative Geography Ontology, WGS84, FOAF, and Gazeeter Ontology

Tutorial “Lift your data”, Istanbul 2012 16

Page 17: Lift your data_inspire2012

GeoLinkedData http://geo.linkeddata.es/

• Topic: Hydrography, Administrative units, statistical information, meteorological features

• Datasets: Statistical, geospatial, and meteorological data.

• URI pattern: • http://geo.linkeddata.es/ontology/{concept|property}• http://geo.linkeddata.es/resource/{type}/{name}

• Vocabulary: SCOVO, FAO Geopolitical, hydrOntology, WSG84, GeoLinkedData Geometry Model, and Time Ontology.

Tutorial “Lift your data”, Istanbul 2012 17

Page 18: Lift your data_inspire2012

LinkedGeoData LinkedGeoData http://linkedgeodata.org/http://linkedgeodata.org/

• Topic: Points of interest

• Datasets: OpenStreetMap database

• URI pattern: http://linkedgeodata.org/triplify/{id}

• Vocabulary: LGD Ontology, WGS84, NeoGeo

Tutorial “Lift your data”, Istanbul 2012 18

Page 19: Lift your data_inspire2012

GeoNames GeoNames http://www.geonames.org/http://www.geonames.org/

• Topic: Toponyms

• Datasets: Datasets used by geonames

• URI pattern: http://sws.geonames.org/{id}

• Vocabulary: GeoNames Ontology, WGS84

Tutorial “Lift your data”, Istanbul 2012 19

Page 20: Lift your data_inspire2012

DBPedia http://dbpedia.org/

• Topic: General knowledge

• Datasets: Wikipedia

• URI pattern: http://dbpedia.org/resource/{name}

• Vocabulary: DBPedia ontology, WGS84

- 20Tutorial “Lift your data”, Istanbul 2012 20

Page 21: Lift your data_inspire2012

GADM-RDF http://gadm.geovocab.org/

• Topic: Global administrative areas

• Datasets: countries and lower level subdivisions.

• URI pattern: http://gadm.geovocab.org/id/{id}

• Vocabulary: gadm ontology, NeoGeo Ontology, DBPedia ontology, WGS84

- 21Tutorial “Lift your data”, Istanbul 2012 21

Page 22: Lift your data_inspire2012

Agenda

Linked Data

Geospatial LD Datasets

5-star deployment scheme for Linked Open Data

- 22Tutorial “Lift your data”, Istanbul 2012 22

Page 23: Lift your data_inspire2012

The Five Stars deploymet scheme

- 23

http://lab.linkeddata.deri.ie/2010/star-scheme-by-example/

Is your data five stars?

Tutorial “Lift your data”, Istanbul 2012 23

Page 24: Lift your data_inspire2012

One Star: Open Data & Open License

http://opendefinition.org/licenses/

Tutorial “Lift your data”, Istanbul 2012 24

Page 25: Lift your data_inspire2012

Two stars: Reusability

• Data are reusable

• Some formats are helpful• Excel, CSV, JSON, XML, GML

• Others not really• PDF, HTML, MS Word

Credit: Bernadette Hyland : http://www.slideshare.net/3roundstones

Tutorial “Lift your data”, Istanbul 2012 25

Page 26: Lift your data_inspire2012

Three stars: Specialist formats

• Specialist tools often have specialist formats• Few people have the tools• Expensive• Difficult to re-use• Geospatial tools, statistical packages, etc..

• Use Open standards• CSV, JSON, XML, GML, OGC Web services, TSV, RDF

Credit: Richard Ciganiak: http://www.slideshare.net/cygri/edf2012-the-web-of-data-and-its-five-stars

Tutorial “Lift your data”, Istanbul 2012 26

Page 27: Lift your data_inspire2012

Towards INSPIRE (5-Stars) Dataset?

* INSPIRE GeoPortal (to find datasets) http://inspire-geoportal.ec.europa.eu/discovery/

*SKOS Glossary for SpatialData Infrastructure: http://semanticlab.jrc.ec.europa.eu/

Tutorial “Lift your data”, Istanbul 2012 27

Page 28: Lift your data_inspire2012

- 28

http://www.flickr.com/photos/wwworks/4759535950/

28Tutorial “Lift your data”, Istanbul 2012

Page 29: Lift your data_inspire2012

Some references

Wood, David (Ed) Linking Government Data - 2011

Methodological Guidelines for Publishing Government Linked Data

Boris Villazón-Terrazas, Luis M. Vilches, Oscar Corcho, Asunción Gómez-Pérez

Best Practices for Publishing Linked Data

W3C Editor’s Draft – Government Linked Data Working Group

Bernadette Hyland, Boris Villazón-Terrazas, Michael Hausenblas,

https://dvcs.w3.org/hg/gld/raw-file/default/bp/index.html

Cookbook for Open Government Linked Data

W3C Editor’s Draft – Government Linked Data Working Group

Bernadette Hyland, Boris Villazón-Terrazas, Sarven Capadisli

http://www.w3.org/2011/gld/wiki/Linked_Data_Cookbook

Tutorial “Lift your data”, Istanbul 2012 29

Page 30: Lift your data_inspire2012

Ghislain Atemezing1 , Boris Villazón-Terrazas2

1 EURECOM, France

[email protected]

2 Ontology Engineering Group, FI, UPM

[email protected]

Slides available at: http://www.slideshare.net/boricles/

INSPIRE Conference 2012, Istanbul

Acknowledgements: Oscar Corcho, Asunción Gómez-Pérez, Luis Vilches, Raphaël Troncy, Olaf Hartig, Luis Vilches and many others

that we may have omitted.

Lift your dataLift your data

Introduction to Linked DataIntroduction to Linked Data

Workdistributed under the license Creative Commons Attribution-Noncommercial-Share Alike 3.0