#opentourism - linked open data publishing and discovery workshop

118
#openbelgium15 #opentourism A community supporting Digital Creativity in Tourism

Upload: raf-buyle

Post on 20-Mar-2017

16 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: #opentourism - Linked Open Data Publishing and Discovery Workshop

#openbelgium15 #opentourism

A community supporting Digital Creativity in Tourism

Page 2: #opentourism - Linked Open Data Publishing and Discovery Workshop

Cause and contextRaf Buyle - Open Knowledge

@rafke #opentourism

Page 3: #opentourism - Linked Open Data Publishing and Discovery Workshop

WHAT

IF…Imagine we could use the same services in different places around the world without the need for searching new service providers or new apps...

Page 4: #opentourism - Linked Open Data Publishing and Discovery Workshop

WHAT

ISInformation often can't be reused without analizing the data or building expensive custom connectors

Page 5: #opentourism - Linked Open Data Publishing and Discovery Workshop

WHATCOULDBE?

Linked Open Data aims to connect these solitary pieces of information to other data sources on the web.

Page 6: #opentourism - Linked Open Data Publishing and Discovery Workshop

WHATSHOULDWE DO?

we need more Open Data and a common language for machine readable touristic information.

Page 7: #opentourism - Linked Open Data Publishing and Discovery Workshop

Joined forces with ‘sustainable mobile tourism guides’, Thomas More - iMinds

Page 8: #opentourism - Linked Open Data Publishing and Discovery Workshop

shared vision:“Open Standard

for Tourism” VocabularyOntologies are expected to enable

computers to process information “much more effectively in ways that are useful

and meaningful to the human user” (Berners-Lee, Hendler, and Lassila 2001).

Page 9: #opentourism - Linked Open Data Publishing and Discovery Workshop

User Group 40+ actors: SME’s, NPO’s involved in culture and tourism activities, representatives from

the provinces and of the major cities in Flanders

Page 10: #opentourism - Linked Open Data Publishing and Discovery Workshop

priorities both as data producer and as data

consumer

Page 11: #opentourism - Linked Open Data Publishing and Discovery Workshop

“After defining the scope of each sub-

domain we organized working groups. Each group was responsible to work out their sub-domain in detail in terms of domain

modeling”

Page 12: #opentourism - Linked Open Data Publishing and Discovery Workshop

Data in line with the Open Tourism specifiction

The West Flanders tourism organization was

the first to bring their data inline with the specification

Page 13: #opentourism - Linked Open Data Publishing and Discovery Workshop

Open Tourism hackathon

🗸 DATA 🗸 TOOLS 🗸 APP’S 🗸 FEEDBACK

http://www.mobielegidsen.be/hackathon/

Page 14: #opentourism - Linked Open Data Publishing and Discovery Workshop

From ad-hoc activity to a solid community

Our mission is to “support digital creativity concerning tourism" and to host the “Open Standard for Tourism” Vocabulary, which is a reusable Ontology for describing tourism data.

Join the Community Meeting, March 5, 2016 Mechelen

Page 15: #opentourism - Linked Open Data Publishing and Discovery Workshop

Release Open Tourism Communty Site

● Mailing list● Tools and resources● The Open Standard for Tourism Ecosystems Data Specification ● News on the upcoming events

Page 16: #opentourism - Linked Open Data Publishing and Discovery Workshop

Join our community! tourism.openknowledge.be

Page 17: #opentourism - Linked Open Data Publishing and Discovery Workshop

Join our community! tourism.openknowledge.be

Page 18: #opentourism - Linked Open Data Publishing and Discovery Workshop

Vision and strategy…

● Bring The Open Tourism vocabulary at an international level

● Build a bridge to interoperability programme of the Flemish Government, “Open Standards for Linked Governments”.

● First step - March 3 our work and vision

will be presented at the International e-Gov Conference in Montevideo.

Page 19: #opentourism - Linked Open Data Publishing and Discovery Workshop

What you really should remember from this talk:

● Semantic collaboration is going to be a cornerstone of the next generation tourism applications

● semantic standards are a katalysator for better services and Open Data.

● The Open Tourism vocabulary exist http://w3id.org/ost/spec

● You’re welcome to join our community!http://tourism.openknowledge.be

Page 20: #opentourism - Linked Open Data Publishing and Discovery Workshop

Getting started with the Open Tourism Specification

Bruno KoninckxTHOMAS MORE

@brunokon #opentourism

Page 21: #opentourism - Linked Open Data Publishing and Discovery Workshop

Which domains are covered (for now)?➔ Offering

What? By whom?

➔ AvailabilityWhere? When?

➔ Experience How?

➔ Demand

Page 22: #opentourism - Linked Open Data Publishing and Discovery Workshop

Offering

Availability

Experience

Demand

Page 23: #opentourism - Linked Open Data Publishing and Discovery Workshop

AvailabilityDemand

Offering

Page 24: #opentourism - Linked Open Data Publishing and Discovery Workshop

ExperienceOffering

Page 25: #opentourism - Linked Open Data Publishing and Discovery Workshop

Demand

Offering

Page 26: #opentourism - Linked Open Data Publishing and Discovery Workshop

Specificationshttp://w3id.org/ost/spec

Questions, remarks, issues on the domain model and specifications via:

https://github.com/openknowledgebe/tourism/issues

Page 27: #opentourism - Linked Open Data Publishing and Discovery Workshop

Publising and ConsumingLinked Tourism Data

Anastasia Dimou - iMinds - Data Science Lab

@natadimou #opentourism

Page 28: #opentourism - Linked Open Data Publishing and Discovery Workshop

RML.ioGenerating High Quality

Linked Open Datafrom Open or Not Data

Anastasia DimouData Science Lab, Ghent University - iMinds

[email protected] @natadimou

Page 29: #opentourism - Linked Open Data Publishing and Discovery Workshop

What is the Semantic Web?

Page 30: #opentourism - Linked Open Data Publishing and Discovery Workshop

The Semantic Web is the extension of the World Wide Web

Page 31: #opentourism - Linked Open Data Publishing and Discovery Workshop

Are you the owner of your data?

OR

is the application that hosts your data?

Page 32: #opentourism - Linked Open Data Publishing and Discovery Workshop

The Semantic Web is the extension of the World Wide Web

enables sharing content beyond the boundaries of applications & websites

Page 33: #opentourism - Linked Open Data Publishing and Discovery Workshop

the Web for humans, thanks to HTML,is understandable & constant

BUT

is the Web for machines too?

Page 34: #opentourism - Linked Open Data Publishing and Discovery Workshop

The Semantic Web is the extension of the World Wide Web

enables sharing content beyond the boundaries of applications & websites

allows machines to understand the meaning of hyperlinked information

Page 35: #opentourism - Linked Open Data Publishing and Discovery Workshop

Semantic Web enabled applicationsrely on data represented as Linked Data

Page 36: #opentourism - Linked Open Data Publishing and Discovery Workshop

What is Linked (Open) Data?

Page 37: #opentourism - Linked Open Data Publishing and Discovery Workshop

Linked (Open) Data

a standardized way of expressing the relationships between data

Page 38: #opentourism - Linked Open Data Publishing and Discovery Workshop

Linked (Open) Data

a standardized way of expressing the relationships between data

semantically annotated the datawith different vocabularies or ontologies

Page 39: #opentourism - Linked Open Data Publishing and Discovery Workshop

Linked (Open) Data

a standardized way of expressing the relationships between data

semantically annotated the datawith different vocabularies or ontologies

describe domain-level knowledgeunderstandable by humans & machines

Page 40: #opentourism - Linked Open Data Publishing and Discovery Workshop

Where does Linked Data come from?

Page 41: #opentourism - Linked Open Data Publishing and Discovery Workshop

Linked (Open) Data is often derived from originally

heterogeneous (semi-)structured data

Page 42: #opentourism - Linked Open Data Publishing and Discovery Workshop

Linked Open Data from Not Open Data

Page 43: #opentourism - Linked Open Data Publishing and Discovery Workshop

Linked Open Data from Not Open Datafrom Open Data

Page 44: #opentourism - Linked Open Data Publishing and Discovery Workshop

Linked Open Data from Not Open Datafrom Open Datafrom their combination

Page 45: #opentourism - Linked Open Data Publishing and Discovery Workshop

How is Open Data published?

Page 46: #opentourism - Linked Open Data Publishing and Discovery Workshop

The DataTankopen source RESTful data management system

Page 47: #opentourism - Linked Open Data Publishing and Discovery Workshop

The DataTankopen source RESTful data management system

enables publishing data into web readable formats

read the data and publish them on the web using a URI as an identifier.

Page 48: #opentourism - Linked Open Data Publishing and Discovery Workshop

The DataTanksupported data formats

CSV and XLSXMLJSON(-LD)SHPRDFMySQLSPARQL

Page 49: #opentourism - Linked Open Data Publishing and Discovery Workshop

How is Linked (Open) Data published?

Page 50: #opentourism - Linked Open Data Publishing and Discovery Workshop

Linked (Open) Data published in the form of RDF datasets

Page 51: #opentourism - Linked Open Data Publishing and Discovery Workshop

Resource Description Framework (RDF)is the prevalent data model for describing Linked (Open) Data

Page 52: #opentourism - Linked Open Data Publishing and Discovery Workshop

Resource Description Framework (RDF)is the prevalent data model for describing Linked (Open) Data

predicatesubject object

Page 53: #opentourism - Linked Open Data Publishing and Discovery Workshop

Resource Description Framework (RDF)is the prevalent data model for describing Linked (Open) Data

driven by unique identifiers (URIs)

predicatesubject object

Page 54: #opentourism - Linked Open Data Publishing and Discovery Workshop

Resource Description Framework (RDF)is the prevalent data model for describing Linked (Open) Data

driven by unique identifiers (URIs)allows establishing a shared meaning

predicatesubject object

Page 55: #opentourism - Linked Open Data Publishing and Discovery Workshop

How is Linked Data derived from (semi-)structured data?

Page 56: #opentourism - Linked Open Data Publishing and Discovery Workshop

How is Linked Data derivedfrom (semi-)structured data?

id firstname lastname lab city1 Anastasia Dimou DSLab Ghent

2 Ruben Verborgh DSLab Ghent

3 Erik Mannens DSLab Ghent

Page 57: #opentourism - Linked Open Data Publishing and Discovery Workshop

labelPerson 1 Data Science Labworks

“Anastasia Dimou”

locatedDataScience Lab Ghent

label

Person 2 Data Science Labworks

“Ruben Verborgh”

label

Person 3 DataScience Labworks

“Erik Mannens”

Page 58: #opentourism - Linked Open Data Publishing and Discovery Workshop

label

Person {id}

{lab}works

Assign unique identifiers (URIs)

“{firstname} {surname}”

Page 59: #opentourism - Linked Open Data Publishing and Discovery Workshop

label

Person {id}

{lab}works

Assign unique identifiers (URIs)

“{firstname} {surname}”

labelhttp:://

ex.com{id}

http://ex.com{lab}works

“{firstname} {surname}”

Page 60: #opentourism - Linked Open Data Publishing and Discovery Workshop

Annotate data relationships with ontologies

labelhttp:://

ex.com{id}

http://ex.com{lab}works

“{firstname} {surname}”

Page 61: #opentourism - Linked Open Data Publishing and Discovery Workshop

Annotate data relationships with ontologies

labelhttp:://

ex.com{id}

http://ex.com{lab}works

“{firstname} {surname}”

rdfs:labelhttp:://

ex.com{id}

http://ex.com{lab}ex:works

“{firstname} {surname}”

Page 62: #opentourism - Linked Open Data Publishing and Discovery Workshop

rdfs:label

ex:1 ex:DSLabex:works

“Anastasia Dimou”

ex:locatedex:DSLab ex:Ghent

rdfs:label

ex:2 ex:DSLabex:works

“Ruben Verborgh”

rdfs:label

ex:3 ex:DSLabex:works

“Erik Mannens”

Page 63: #opentourism - Linked Open Data Publishing and Discovery Workshop

rdfs:label

ex:{id}

ex:{lab}

ex:works

ex:locatedex:{lab} ex:{city}

sets of triples of a dataset have repetitive patterns

“{firstname} {surname}”

Page 64: #opentourism - Linked Open Data Publishing and Discovery Workshop

rdfs:label

ex:{id}

ex:{lab}

ex:works

sets of triples of a dataset have repetitive patterns

“{firstname} {surname}”

RDF dataset generation tools rely their implementation on repetitively applying those patterns to input data

ex:locatedex:{lab} ex:{city}

Page 65: #opentourism - Linked Open Data Publishing and Discovery Workshop

What are the different Linked Data Generation approaches?

Page 66: #opentourism - Linked Open Data Publishing and Discovery Workshop

Linked Data generation approachescase-specific solutions

Page 67: #opentourism - Linked Open Data Publishing and Discovery Workshop

Linked Data generation approachescase-specific solutions ORformat and source specific

Page 68: #opentourism - Linked Open Data Publishing and Discovery Workshop

Data OWNER / PUBLISHER

CSV JSONXML

RDF RDF RDF

Page 69: #opentourism - Linked Open Data Publishing and Discovery Workshop

R2RML mappings R2RML processor

Data OWNER / PUBLISHER

defines

RDF

DB CSV JSONXML

RDF RDF RDF

Page 70: #opentourism - Linked Open Data Publishing and Discovery Workshop

RDF Terms (focusing on IRIs) are…

generated independentlydisregarding their possible prior definitions

manually replicatedby reconstructing the same URIs (if possible)

manually aligned afterwardslinks with other datasets are defined after the RDF terms are published

Page 71: #opentourism - Linked Open Data Publishing and Discovery Workshop

Why not a uniform approach?

Page 72: #opentourism - Linked Open Data Publishing and Discovery Workshop

Uniform and declarative RDF generation from heterogeneous data sources

mappings processor

Data OWNER / PUBLISHER

defines

RDF

DB CSV JSONXML RDF

Page 73: #opentourism - Linked Open Data Publishing and Discovery Workshop

RDF Mapping Language (RML)

generic scalable mapping languagefor generating and interlinking RDF data from heterogeneous resourcesin an integrable and interoperable fashion

superset of the W3C standardized R2RML mapping language

http://rml.io

Page 74: #opentourism - Linked Open Data Publishing and Discovery Workshop

Uniform and declarative RDF generation from heterogeneous data sources

RML mappings processor

Data OWNER / PUBLISHER

defines

RDF

DB CSV JSONXML RDF

Page 75: #opentourism - Linked Open Data Publishing and Discovery Workshop

Defining Mappings to generate Linked Data

Retrieving Input Data

Editing Mappings

Page 76: #opentourism - Linked Open Data Publishing and Discovery Workshop

Defining Mappings to generate Linked Data

Retrieving Input Data

Editing Mappings

Page 77: #opentourism - Linked Open Data Publishing and Discovery Workshop

RML describeshow to generated RDFfrom structured data

predicatesubject object

Page 78: #opentourism - Linked Open Data Publishing and Discovery Workshop

RML describeshow to generated RDFfrom structured data

predicatesubject object

Predicate MapSubject Map Object Map

Page 79: #opentourism - Linked Open Data Publishing and Discovery Workshop

RML describeshow to generated RDFfrom structured data

predicatesubject object

Predicate MapSubject Map Object Map

<#TriplesMap>

Page 80: #opentourism - Linked Open Data Publishing and Discovery Workshop

rr:constant

rdfs:label

rr:constant ex:works

rr:constantex:located

rr:template “http://ex.com/{lab}” rr:template “http://ex.com/{city}”

rr:template “http://ex.com/{id}”

rr:template “http://ex.com/{lab}”

rr:template “{firstname} {surname}”rr:termType rr:Literal

RDF Mapping Language (RML)

@prefix rr: <http://www.w3.org/ns/r2rml#>

Page 81: #opentourism - Linked Open Data Publishing and Discovery Workshop

rr:constant

rdfs:label

rr:constant ex:works

rr:constant ex:locatedrr:template “http://ex.com/{lab}” rr:template “http://ex.com/{city}”

rr:template “http://ex.com/{id}”

rr:template “http://ex.com/{lab}”

<#ResearcherMap>

<#LabMap>

rr:template “{firstname} {surname}”rr:termType rr:Literal

RDF Mapping Language (RML)

Page 82: #opentourism - Linked Open Data Publishing and Discovery Workshop

Extraction Module Mapping Module

RML Processor

Page 83: #opentourism - Linked Open Data Publishing and Discovery Workshop

Defining Mappings to generate Linked Data

Editing Mappings

Retrieving Input Data

Page 84: #opentourism - Linked Open Data Publishing and Discovery Workshop

RML describesrules to map any structured data to RDF

RML supports any data independently ofwhich structure and format they havewhere they originally residehow they are accessed & retrieved

Page 85: #opentourism - Linked Open Data Publishing and Discovery Workshop

Specifying datawhich data form a data inputhow to reference data input extracts

Accessing & Retrieving datadata input from original source(s)

Page 86: #opentourism - Linked Open Data Publishing and Discovery Workshop

Specifying datawhich data form a data inputhow to reference data input extracts

Accessing & Retrieving datadata input from original source(s)

Page 87: #opentourism - Linked Open Data Publishing and Discovery Workshop

Support data in Heterogeneous Structurestabular-structured

hierarchical-structured

(semi-)structured

… … …

Page 88: #opentourism - Linked Open Data Publishing and Discovery Workshop

Support data in Heterogeneous Structures and Formatstabular-structured

tables in DBs or CSV files …

hierarchical-structured JSON or XML …

(semi-)structuredHTML …

… … …

Page 89: #opentourism - Linked Open Data Publishing and Discovery Workshop

rr:constant

rdfs:label

rr:constant ex:works

rr:template “http://ex.com/{id}”

rr:template “http://ex.com/{lab}”

<#ResearcherMap> rr:template “{firstname} {surname}”rr:termType rr:Literal

id firstname surname lab1 Anastasia Dimou DSLab

2 Ruben Verborgh DSLab

3 Erik Mannens DSLab

tabular-structured data

Page 90: #opentourism - Linked Open Data Publishing and Discovery Workshop

rr:constant ex:locatedrr:template “http://ex.com/{/labs/lab/short}”

rr:template “http://ex.com/{/labs/lab/location/city}”

<#LabMap>

<labs> <lab> <short>MMLab</short> <title>Multimedia Lab</title> <location> <city>Ghent</city> </location> </lab> <lab> …. </lab> …</labs>

hierarchical-structured data

Page 91: #opentourism - Linked Open Data Publishing and Discovery Workshop

Specifying datawhich data form a data inputhow to reference data input extracts

Accessing & Retrieving datadata input from original source(s)

Page 92: #opentourism - Linked Open Data Publishing and Discovery Workshop

Input data

Input data

Input data

Output RDF

Mapping module

RML Processor

Map doc

Page 93: #opentourism - Linked Open Data Publishing and Discovery Workshop

Data source

Access interface

Input data

Input data

Input data

Output RDF

Mapping module

RML Processor

Map doc

Data source

Access interface

Data source

Access interface

Retrieval module

Source description

Page 94: #opentourism - Linked Open Data Publishing and Discovery Workshop

Support different Locations and Access InterfacesLocal File(s)

Database connectivity

Web source(s)

RDF source(s)

Page 95: #opentourism - Linked Open Data Publishing and Discovery Workshop

Support different Locations and Access InterfacesLocal File(s)

Database connectivityD2RQ

Web source(s) (Web API/service)DCAT, CSVW, Hydra, VOiD (Dataset)

RDF source(s)VOiD (Endpoint), SPARQL-SD

Page 96: #opentourism - Linked Open Data Publishing and Discovery Workshop

file.xml

WEB APIDCAT

XML data

JSON data

tabular data

Output RDF

Mapping module

RML Processor

Map doc

Data repo

WEB API Hydra

Data base

JDBCD2RQ

Retrieval module

Source description

Triplestore

SPARQL

Page 97: #opentourism - Linked Open Data Publishing and Discovery Workshop

Defining Mappings to generate Linked Data

Retrieving Input Data

Editing Mappings

Page 98: #opentourism - Linked Open Data Publishing and Discovery Workshop

Semantic Web experts Vs. Data specialists

Page 99: #opentourism - Linked Open Data Publishing and Discovery Workshop

Semantic Web experts Vs. Data specialists

Modeling Domain Knowledge as Linked (Open) Data is not straightforward for Data Specialists

Page 100: #opentourism - Linked Open Data Publishing and Discovery Workshop

Semantic Web experts Vs. Data specialists

Modeling Domain Knowledge as Linked (Open) Data is not straightforward for Data Specialists

Data context is not straightforward for Semantic Web experts

Page 101: #opentourism - Linked Open Data Publishing and Discovery Workshop

Semantic Web experts Vs. Data specialists

Data Specialists should be able to specify the mappings,modify and extend them at any time

Page 102: #opentourism - Linked Open Data Publishing and Discovery Workshop
Page 103: #opentourism - Linked Open Data Publishing and Discovery Workshop
Page 104: #opentourism - Linked Open Data Publishing and Discovery Workshop
Page 105: #opentourism - Linked Open Data Publishing and Discovery Workshop

Defining Mappings to generate Linked Data

Editing Mappings

Retrieving Input Data

Page 106: #opentourism - Linked Open Data Publishing and Discovery Workshop

The five stars of the Linked Open Data schemeshould not be approached as a set of consecutive steps

Page 107: #opentourism - Linked Open Data Publishing and Discovery Workshop

Well-considered policy regarding mapping and interlinking of data in the context of a certain knowledge domain

Page 108: #opentourism - Linked Open Data Publishing and Discovery Workshop

GEO > OPEN

Geraldine Nolf Flemish Information Agency

@natadimou #opentourism

Page 109: #opentourism - Linked Open Data Publishing and Discovery Workshop

> Geographical data in Flanders▪ Must follow its obligations, regulations, standards, …

> F.e. SDI-Flanders partnership (decree, implementing decree, commandments, …)

> MUST’s for describing metadata:

▪ ISO > 19115 (datasets/series), 19119 (services), 19139 (implementation), 19110 (feature catalogue)

▪ INSPIRE > Directive: Implementing Rules on Metadata + Technical Guidelines on Metadata

▪ SDI-Flanders > Best Practices on Metadata v1.0 (metadata for datasets, dataset series, services, feature catalogues)

▪ OGC / INSPIRE> Catalogue Service for the Web (OGC Catalogue Services Specification 2.0.2) = Discovery Service (INSPIRE) =

Zoekdienst (SDI-Flanders)

> Geoportal: http://www.geopunt.be

The way to open geographical data in Flanders?

Geo > Open 109

Page 110: #opentourism - Linked Open Data Publishing and Discovery Workshop

> Open data Portal in Flanders:▪SHOULD follow its manual

> SHOULD HAVE’s for describing metadata: ▪DCAT-AP

> Portal: http://opendata.vlaanderen.be/ ▪CKAN

The way to open geographical data in Flanders?

Geo > Open 110

Page 111: #opentourism - Linked Open Data Publishing and Discovery Workshop

> So, do I have to describe my data on both portals?

> The power of metadata: ▪Describe it only once, ▪Create multiple use

The way to open geographical data in Flanders?

Geo > Open 111

1

Page 112: #opentourism - Linked Open Data Publishing and Discovery Workshop

The way to open geographical data in Flanders?

Geo > Open 112

> Follow the SDI-track to describe metadata (MUST)▪Appear on http://www.geopunt.be▪AND appear on http://opendata.vlaanderen.be/

> Because of:▪ Set keyword “Open data in Flanders” in the metadata record

▪ Via CSW harvest of Geopunt metadata

▪ With filter on keyword “Open data in Flanders”

▪ Mapping to DCAT (XSLT: ISO 19139 to DCAT)

▪ Mapping to CKAN

Page 113: #opentourism - Linked Open Data Publishing and Discovery Workshop

Geo > Open 113

Page 114: #opentourism - Linked Open Data Publishing and Discovery Workshop

Open Discussion

Bruno KoninckxTHOMAS MORE

@brunokon #opentourism

Page 115: #opentourism - Linked Open Data Publishing and Discovery Workshop

Open Discussion

Page 116: #opentourism - Linked Open Data Publishing and Discovery Workshop

Anastasia@natadimou

Raf@rafke

Bruno@BrunoKon

Page 117: #opentourism - Linked Open Data Publishing and Discovery Workshop

Join our community! http://tourism.openknowledge.be

#opentourism |raf{dot}buyle{at}okfn{dot}be

Page 118: #opentourism - Linked Open Data Publishing and Discovery Workshop

A world where knowledge creates power for the many, not the few.A world where data frees us — to make informed choices about how we live, what we buy and who gets our vote.A world where information and insights are accessible — and apparent — to everyone.This is the world we choose

#openbelgium16 #opentourism