opencube project general presentation

Post on 01-Jul-2015

286 Views

Category:

Presentations & Public Speaking

3 Downloads

Preview:

Click to see full reader

DESCRIPTION

A general introduction to the OpenCube project: - background - objectives - developed tools - partners - pilots

TRANSCRIPT

OpenCube Publishing and Enriching Linked Open Statistical Data for the

Development of Data Analytics and Enhanced Visualization Services

▪More than 180 Open Government Data portals around the globe provide data that “can be freely used, reused and redistributed by anyone”

2

OpenCube background - OGD

▪A big portion of Open Government Data concerns statistics such as population figures, economic and social indicators

▪For example, the majority (5867 out of 6098 datasets) of the data published on the EU Open Data Portal are of statistical nature

3

OpenCube background - Statistics

4

Example: Flemish Government

▪Linked data is a well established paradigm for publishing open data on the Web.

▪ It enables semantically enriching data and linking data that resides in disparate sources.

▪Employing linked data introduces complexity in data publishing

5

OpenCube background – Linked Data

▪ Linked data could facilitate performing data analytics on top of combined datasets that were previously closed or isolated, hence potentially providing: ▪ Unexpected results

▪ Unexplored relationships

▪ However, dealing (publishing/reusing) with linked statistical data introduces complexity and raises the barrier for reusing open data. ▪ e.g. Modelling multi-dimensional

data

6

OpenCube background – Linked Statistical Data

▪The ultimate goal of OpenCube project is to facilitate ▪ publishing of high-quality linked statistical data

▪ reusing distributed linked statistical datasets to perform advanced data analytics and visualizations

▪OpenCube will support the whole lifecycle of statistical linked open data.

7

OpenCube Aim

8

OpenCube Publish and Reuse

OpenCube Lifecycle

9

▪Objective I: To derive and analyse users’ requirements regarding publishing and reusing of linked open statistical data (LOSD)

▪Objective II: To design the OpenCube reference architecture

▪Objective III: To develop the OpenCube toolkit comprising open-source standalone tools for publishing and reusing LOSD

▪Objective IV: To develop the OpenCube extensions of two popular linked data management platforms

▪ Swirrl’s PublishMyData

▪ fluidOps’ Information Workbench

▪Objective V: To validate and evaluate the research results by developing three proof-of-concept pilots

10

OpenCube Objectives

11

OpenCube Partners

Swiss Bank

▪OpenCube project will develop software tools ▪ Open source OpenCube Toolkit based on fluidOps’ Information

Workbench (open source).

▪ Extension for Swirrl’s PublishMyData

▪ Extension for fluidOps’ Information Workbench (commercial)

12

OpenCube Tools

Information Workbench Linked Data and Semantic Technologies in the Enterprise

!▪ Open standards and technologies

• Semantic Wiki based frontend (Using SMW Syntax) !

• Supporting W3C standards (OWL, RDF, SPARQL, …)!• Community Edition (Open Source) + Enterprise Edition

(Commercial)

▪ Semantics- & Linked Data-based integration of private and public data sources based on data providers • Generic and specific providers for various data

formats and sources • Supports established mapping frameworks (e.g.

R2RML, SILK, …) • Named graphs for managing contexts and

provenance ▪ Intelligent Data Access and Analytics

• Flexible self-service UI • Visualization, exploration, dashboarding and

reporting • Semantic search

▪ Collaboration and knowledge management • Curation & authoring • Collaborative workflows

▪Data Cube vocabulary ▪http://www.w3.org/TR/vocab-data-cube/

▪SKOS and XKOS ▪http://www.w3.org/2004/02/skos/ ▪http://www.ddialliance.org/Specification/RDF/XKOS

14

Using standards

▪A W3C Recommendation for publishing statistical data and metadata according to the Linked Data principles, based on SDMX

15

Data Cube vocabulary

16

SDMX

Data Cube Scope▪Data structure definition

▪ Specification of dimensions, measures, attributes

▪ Specification of code lists

▪ Specification of concepts

▪Dataset

▪ Observation data

▪ Including dimensions, measures, attributes

▪ “Slices” (time series or cross-sections of lower dimensionality within the cube)

17

The model

18

Life expectancy within Welsh Unitary authorities

Data Cube Vocabulary and Linked Data

▪Support use of existing URI sets and SKOS vocabularies as code lists

▪Make code lists available on the web for other purposes (as SKOS vocabularies)

▪Concepts, properties et al. can be shared between data sets

▪Everything has a URI and can be linked to and annotated

20

SKOS

20

SKOS

21

OpenCube Pilots

Swiss Bank

▪http://www.opencube-project.eu ▪Twitter: @OpenCubeProject !▪With presentations to be found at: ▪Slideshare: http://www.slideshare.net/OpenCubeProject

22

OpenCube stay up to date

top related