lod2 review meeting
TRANSCRIPT
Introducing a New Product
Creating Knowledge out of Interlinked Data
LOD2 Presentation . 02.09.2010 . Page
http://lod2.eu
[email protected]
[email protected]@i2g.pl
Luxembourg, Sep 14, 2012WP9: Use Case 3: LOD2 for Citizens
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 . Page
http://lod2.eu
Year 2 Deliverables (OKFN)D 9.1.1. Report on first release of the Publicdata.eu website Improvements to Publicdata.eu during the past year
D 9.3.1. Presentation on publishing Linked Data
D 9.3.2. Guide to publishing Linked Data
D 9.4 Report on publication of eGovernment Linked Open Data
Addressing Y1 Review Comments
Next steps
D 9.2.1. Further technical improvements to Publicdata.eu (personalization features)
Community engagement with Publicdata.eu
Year 2 Deliverables (Serbian CKAN, Instytut Informatyki Gospodarczej)D 9.5.1. Establishment of the Serbian CKAN
D 9.6. Requirements and Resources used by the Polish Ministry of Economy
Next stepsD 9.7.1. Adaptation of the LOD2 stack for Polish Ministry of Economy
Agenda
Hello everyone,My name is . and I am the new PM overseeing some sections of WP9 from OKFN's side. This presentation includes the work that was spearheaded by OFKFN, as well as Serbian CKAN (9.5.) and the the Requirements and Resources for the Polish Ministry of Economy (9.6.)The slides were drafted to highlight some of the work that was done for WP9 during the course of this year. Slide 2 showcases the deliverables that are part of this WP
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 . Page
http://lod2.eu
WP9 Objectives
The purpose of this PublicData.eu use case is to increase public access to high-value, machine-readable datasets generated by the European, national as well as regional governments and public administrations.
Hello everyone,My name is . and I am the new PM overseeing some sections of WP9 from OKFN's side. This presentation includes the work that was spearheaded by OFKFN, as well as Serbian CKAN (9.5.) and the the Requirements and Resources for the Polish Ministry of Economy (9.6.)The slides were drafted to highlight some of the work that was done for WP9 during the course of this year. Slide 2 showcases the deliverables that are part of this WP
Creating Knowledge out of Interlinked Data
LOD2 Presentation . 02.09.2010 . Page
http://lod2.eu
Year 2 Deliverables (OKFN)
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 . Page
http://lod2.eu
D 9.1.1. First release of Publicdata.eu
Submitted a thorough report summarizing our work on Publicdata.eu it's existing features, previous launches and plans for future improvements: http://svn.aksw.org/lod2/D9.1.1/
Year 2 Deliverables
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 . Page
http://lod2.eu
Year 2 Deliverables
Publicdata.eu Overview
PublicData.eu is pan-European data catalogue and federation mechanism, developed by OKF as part of WP9. Based on the CKAN open-source data portal software, the site is a use case for the citizen aiming to make data as accessible and re-usable as possible. It is a read-only aggregation of both official and community data portals across the EU.
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 . Page
http://lod2.eu
6
Year 2 Deliverables
Key Stats
Publicdata.eu provides robust search, filtering and previewing tools
It currently houses 17027 data sets, harvested from 18 data catalogues, and it provides the option to browse data sets by top level categories
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 . Page
http://lod2.eu
6
Technical improvements to Publicdata.eu during the past year
In March 2012 we upgraded PublicData.eu to CKAN version 1.6, adding the data preview functionality (powered by Recline), improvements to search, interface improvements to dataset pages, newly added resource (file) pages and group pages.
We also re-ran all the harvesters to have the most up to date set of datasets. Some catalogues have been migrated to groups on thedatahub.org and therefore can't currently be harvested without also including non EU datasets. In the future we may resolve this by extending the harvester to allow us to specify which groups or tags should be harvested. This would allow us to import relevant datasets from thedatahub.org without importing non EU datasets.
Many new CKAN instances have recently been launched by various countries, which we plan to include in publicdata.eu for the intermediate launch (August 2013)
Year 2 Deliverables
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 . Page
http://lod2.eu
6
Year 2 Deliverables
Amount of RDF data in publicdata.euWe added the Serbian CKAN to Publicdata.eu bringing their RDF data
Since Publicdata.eu is currently only a read-only portal, we have focused on encouraging the source catalogs to increase their RDF dataour deliverables of 9.3.1 and 9.3.2 facilitate this (presentation & guide on publishing linked open data)
worked with consortium partners to produce more RDF data for the eGovernment report (will cover in next slides)
In future launches we will allow users to add data, meaning we can add converted datasets to be accessible through publicdata.eu
Additionally we plan to improve our harvesting, allowing us to harvest groups to increase the opportunities for what datasets can be added to publicdata.eu (including groups on thedatahub.org)
Addressing Year 1 Review meeting comments
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 . Page
http://lod2.eu
6
Year 2 Deliverables
D 9.3.1. Presentation on publishing Linked Open Data
Overview
In Feb 2012, OKFN put together a best practices presentation regarding the publishing, linking and utilizing Open Data. The presentation is easy accessible for the non technical eye and details the economic, transparency, policy, and efficiency benefits for Governments to publish open data. Other aspects such as licensing, registering and getting the data online are also included in this presentation.
This is aimed to be a detailed resource that anybody can use when referring to Linked Open Data.
Current stateThe presentation can be found here:
[http://svn.aksw.org/lod2/D9.3/D9.3.1/D9.3.1-presentation.pdf]
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 . Page
http://lod2.eu
6
Year 2 Deliverables
OverviewThe Guide [http://svn.aksw.org/lod2/D9.3/D9.3.2/] is written in clear, non-technical language and introduces the reader to the concepts, rationale and tools of Linked Open Data, as well as providing a high-level overview of the publishing process. It has a particular focus on public-sector data, and aims to arm decision-makers with an understanding of Linked Data and the steps necessary to start publishing it.
Expected Impact
The guide will be published on the OKFN website [http://lod2.okfn.org/] where it is hoped that it will become a standard reference document, helping organizations that need to make decisions about whether and how to publish Linked Open Data.
D 9.3.2. Guide to publishing Linked Open Data
Section 1 rehearses the arguments of Open Data (how Governments are moving towards making their data available freely) whereas section 2 provides a full non technicalexplanation of Linked Data (concepts such as the 5 stars of LOD are presented) . Section 3 refers to the LOD life cycle (explains high level concepts such RDF, schemas, triple stores aso). Section 4 describes a step by step way to publish LOD and describes the tools in the LOD2 Stack. Step 5 presents some of the case studies of LOD2. Most known are the EC Financial Transparency System (all grants from the EC since 2007), the Global Health Observatory data set (stats for monitoring public health), Digital Agenda Scoreboard shows progression of countries in relation to DAE, Legal Thesauri by Wolters Kluewer (commercial publisher of legal info)
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 . Page
http://lod2.eu
6
Year 2 Deliverables
D 9.4. Report on publication of eGovernment Linked Open Data
OverviewThe report [http://svn.aksw.org/lod2/D9.4/] summarizes our assessment of the current state of linked data publishing by European Governments and organizations;
Additionally it highlights some of the work that LOD2 partners have been doing to publish more linked data (Publink initiative, Guides and Documentation)
The report details some of the benefits of publishing linked open data as well as the current technical and legal barriers preventing the publishing of more linked data and our proposed approach to increasing the amount of high quality linked data published during the next phase of the LOD2 project.
The report contains two Appendices:
Appendix A - a collection of 9 use cases, showcasing the benefits of LODAppendix B - presenting theLODStats system developed for high performance statistical analysis
Several public authorities(such as: UK Government White Paper, EC commissioner Neelie Kroes) are acknowledging the benefits of LOD (Organizations meet the transparency requirements, and more meeting is provided to data sets by placing them in context with other datasets); in this respect the EC also funded the LATC project that converted approx 20 sets over the past years.
DatasetProject URLTriples
WHOs Global Health Observatoryhttp://gho.aksw.org/273k
European Digital Agenda Scoreboardhttp://data.lod2.eu/scoreboard/127k
National Accounts Linked Data for the UK and Serbiahttp://ukstatistics.lod2.eu/
http://rs.ckan.net/dataset/rzs-national-accounts645k (UK)
10 million (Serbia)
World Bank Data as Linked Datahttp://csarven.ca/statistical-linked-dataspaces165 million
German Labour Law & Courts Thesaurihttp://vocabulary.wolterskluwer.de/150k
German Federal Ministry of Financehttp://data.lod2.eu/gfmf/ 2 million
UK public data setshttp://thedatahub.org/en/dataset/uk-gdp-since-1948http://thedatahub.org/en/dataset/epims-lod2http://thedatahub.org/en/dataset/uk-criminal-justice12 million (total)
LinkedGeoDatahttp://linkedgeodata.org 20 billion
Wiktionaryhttp://wiktionary.dbpedia.org 100 million
Czech tender datahttp://ld.opendata.cz:8900/sparql 1071859
Appendix A Open Data releases
Creating Knowledge out of Interlinked Data
LOD2 Presentation . 02.09.2010 . Page
http://lod2.eu
Next Steps (OKFN)
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 . Page
http://lod2.eu
Next Steps
D 9.2.1. Further technical improvements to Publicdata.eu
Improvements scheduled for the Dec 2012 release (Further personalization features)
Datasets ratings
Allow users to add/revise their own data sets
User tools to enable mash-ups and visualization of data
App marketplace for users to upload their own visualizations, stories and apps
Allow user commenting on datasets
Activity streams and follow support (i.e. allowing users to subscribe to activity updates)
Social / sharing buttons
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 . Page
http://lod2.eu
Next Steps
D 9.2.1. Further technical improvements to Publicdata.eu
Improvements scheduled for the Aug 2013 interim release
CKAN core technology improvements (Harvesting)Optimize & automate the harvesting process
Add further harvesters (to increase number of data and coverage)
Ability to only harvest changed data
Ability to harvest part of a site (e.g. a particular group vs whole catalog)
Additional featuresAdding more advanced multilingual capabilities to the portal to support its Europe-wide coverage
Add upgraded triple store and SPAQRL endpoint
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 10 Page
http://lod2.eu
Next Steps
Community Engagement for Publicdata.eu
We can divide our community building and engagement strategy around PublicData.eu into two main clusters: supply & demand
Objectives on the supply side:
Engage more with data publishers by building (a) a stronger community of official representatives and data catalogue maintainers around PublicData.eu and (b) consensus around key legal and technical standards (e.g. making metadata explicitly open, enabling data catalog interoperability)
Establish datacatalogs.org as the de facto place to go to find out about data catalogs around the world - and encourage data catalog maintainers and other official contact points to maintain up to date information about national, regional and local catalogs, and lists of catalogs
Creating Knowledge out of Interlinked Data
LOD2 Event . 06.09.2010 11 Page
http://lod2.eu
Next Steps
Community Engagement for Publicdata.eu
Objectives on the demand side:
To build a stronger and better connected community of open data re-users from across EU27 around PublicData.eu
Continue to identify and pursue opportunities to engage with the Linked Data community and to use the LOD2 Stack to publish Linked Data derived from PublicData.eu.
We are hoping to achieve this by performing the following activities:
-Organize 2-4 OKFN Labs sprints per year on a variety of different topics and disseminateresults via press releases, media contacts and partners -Promote PublicData.eu at events, workshops and hackdays across EU27 -Dissemination via blogs, guest posts and articles on third party sites, and press releases
Creating Knowledge out of Interlinked Data
LOD2 Presentation . 02.09.2010 . Page
http://lod2.eu
Year 2 Deliverables (Serbian CKAN, Instytut Informatyki Gospodarczej)
SORSServer 1
LOD2
Online
dissemination
DB
XSLT
Server 2Serbian CKAN
RDFCKAN
publishing
search
Server 3Publicdata.euCKAN
import
search
Code listsLOD2
http://elpo.stat.gov.rs/lod2/RS-DATAhttp://elpo.stat.gov.rs/lod2/RS-DIC http://rs.ckan.net search
D 9.5.1 Establishing Serbian CKAN - Infrastructure for Public Sector Information
Year 2 Deliverables
National accounts
Prices
Usage of ICT
Science, Technology and Innovations
Goal
Identify the requirements of Polish Ministry of Economy for publication of the data
Analyze changes of data over time, temporal and topical scope
Prepare for adoption of LOD2 Stack for publication of Ministry data
Status
Delivered on time for M20
Work continues on Task 9.7
D9.6 Requirements and resources used by the Polish Ministry of Economy
Year 2 Deliverables
http://data.gov.pl Current State
Year 2 Deliverables
Need for data
Requirements Querying
INSIGOSInternet System for Business Information
Access to statistical data concerning economy and foreign trade
POLGOS - presentation of comparative data concerning Polish economy
HZ - information about Polish foreign trade
ENERGY mission: energy security
Challenges - Multidimensional database (data ware House) - Possible linking to source for drilling-down - Not up to date probably needs a supplementing process - ENERGY many files in ugly-formatted Excel files
D9.6 Key Points
Year 2 Deliverables
CEIDGCentral Register and Information on Economic Activity
access to data concerning natural persons businesses
references to other registries
ca. 2.9 million records
Challenges -data is not clean -available via API -dynamic data set: ~1000/1000 applications for de-/registration daily -snapshots evolution phase of LOD2 LifecyclePublic procurement data - Pulished in XML, volume of data in 2011 alone: 828MB
D9.6 Key Points
Year 2 Deliverables
No sophisticated tools used at MoE
Groups of requirements
Data Acquisition 7 requirements
Data Processing/Transformation 2 requirements
Publication 3 requirements
Data Analysis 2 requirements
Alignment with LOD2 Life Cycleall 8 phases seem to be important but
Alignment with LOD2 Stackcrucial components identified
D2R/Triplify, Virtuoso, CKAN, PoolParty, Ontowiki, Silk, Sigma
D9.6 Requirements - Summary
Year 2 Deliverables
Creating Knowledge out of Interlinked Data
LOD2 Presentation . 02.09.2010 . Page
http://lod2.eu
Next Steps (Instytut Informatyki Gospodarczej)
Task 9.7 Adoption of the LOD2 Stack for Polish economy data
(I2G)
Goaladaptation of the LOD2 Stack to the requirements of Polish Ministry of Economy
identification of crucial components and how to configure and link them
Statusfirst deliverable D9.7.1 scheduled for M30
identification of existing functionalities in the working infrastructure
first vocabularies linked using Silk Workbench
Next stepsfinishing and cleaning vocabulary
design of data model using SDMX vocabulary
filling in the model
Establishing the Polish CKAN
Next Steps
Creating Knowledge out of Interlinked Data
LOD2 Presentation . 02.09.2010 . Page
http://lod2.eu
Thank you for your attention!
EU-FP7 LOD2 WP10 22.-23.9.2011. 02.09.2010 . Page http://lod2.eu
Creating Knowledge out of Interlinked Data
EU-FP7 LOD2 WP6 13.-14.09.2012. Page http://lod2.eu
Creating Knowledge out of Interlinked Data
Click to edit the outline text formatSecond Outline LevelThird Outline LevelFourth Outline LevelFifth Outline LevelSixth Outline LevelSeventh Outline LevelEighth Outline Level
Ninth Outline LevelMastertextformat bearbeitenZweite Ebene Dritte Ebene
Vierte Ebene
Fnfte Ebene
Click to edit the title text formatMastertitelformat bearbeiten
EU-FP7 LOD2 Project Overview . Page http://lod2.eu
Creating Knowledge out of Interlinked Data
EU-FP7 LOD2 Project Overview . Page http://lod2.eu
Creating Knowledge out of Interlinked Data
Click to edit the outline text formatSecond Outline LevelThird Outline LevelFourth Outline LevelFifth Outline LevelSixth Outline LevelSeventh Outline LevelEighth Outline Level
Ninth Outline LevelMastertextformat bearbeitenZweite Ebene Dritte Ebene
Vierte Ebene
Fnfte Ebene
Click to edit the title text formatMastertitelformat bearbeiten