opencube project general presentation
DESCRIPTION
A general introduction to the OpenCube project: - background - objectives - developed tools - partners - pilotsTRANSCRIPT
OpenCube Publishing and Enriching Linked Open Statistical Data for the
Development of Data Analytics and Enhanced Visualization Services
▪More than 180 Open Government Data portals around the globe provide data that “can be freely used, reused and redistributed by anyone”
2
OpenCube background - OGD
▪A big portion of Open Government Data concerns statistics such as population figures, economic and social indicators
▪For example, the majority (5867 out of 6098 datasets) of the data published on the EU Open Data Portal are of statistical nature
3
OpenCube background - Statistics
4
Example: Flemish Government
▪Linked data is a well established paradigm for publishing open data on the Web.
▪ It enables semantically enriching data and linking data that resides in disparate sources.
▪Employing linked data introduces complexity in data publishing
5
OpenCube background – Linked Data
▪ Linked data could facilitate performing data analytics on top of combined datasets that were previously closed or isolated, hence potentially providing: ▪ Unexpected results
▪ Unexplored relationships
▪ However, dealing (publishing/reusing) with linked statistical data introduces complexity and raises the barrier for reusing open data. ▪ e.g. Modelling multi-dimensional
data
6
OpenCube background – Linked Statistical Data
▪The ultimate goal of OpenCube project is to facilitate ▪ publishing of high-quality linked statistical data
▪ reusing distributed linked statistical datasets to perform advanced data analytics and visualizations
▪OpenCube will support the whole lifecycle of statistical linked open data.
7
OpenCube Aim
8
OpenCube Publish and Reuse
OpenCube Lifecycle
9
▪Objective I: To derive and analyse users’ requirements regarding publishing and reusing of linked open statistical data (LOSD)
▪Objective II: To design the OpenCube reference architecture
▪Objective III: To develop the OpenCube toolkit comprising open-source standalone tools for publishing and reusing LOSD
▪Objective IV: To develop the OpenCube extensions of two popular linked data management platforms
▪ Swirrl’s PublishMyData
▪ fluidOps’ Information Workbench
▪Objective V: To validate and evaluate the research results by developing three proof-of-concept pilots
10
OpenCube Objectives
11
OpenCube Partners
Swiss Bank
▪OpenCube project will develop software tools ▪ Open source OpenCube Toolkit based on fluidOps’ Information
Workbench (open source).
▪ Extension for Swirrl’s PublishMyData
▪ Extension for fluidOps’ Information Workbench (commercial)
12
OpenCube Tools
Information Workbench Linked Data and Semantic Technologies in the Enterprise
!▪ Open standards and technologies
• Semantic Wiki based frontend (Using SMW Syntax) !
• Supporting W3C standards (OWL, RDF, SPARQL, …)!• Community Edition (Open Source) + Enterprise Edition
(Commercial)
▪ Semantics- & Linked Data-based integration of private and public data sources based on data providers • Generic and specific providers for various data
formats and sources • Supports established mapping frameworks (e.g.
R2RML, SILK, …) • Named graphs for managing contexts and
provenance ▪ Intelligent Data Access and Analytics
• Flexible self-service UI • Visualization, exploration, dashboarding and
reporting • Semantic search
▪ Collaboration and knowledge management • Curation & authoring • Collaborative workflows
▪Data Cube vocabulary ▪http://www.w3.org/TR/vocab-data-cube/
▪SKOS and XKOS ▪http://www.w3.org/2004/02/skos/ ▪http://www.ddialliance.org/Specification/RDF/XKOS
14
Using standards
▪A W3C Recommendation for publishing statistical data and metadata according to the Linked Data principles, based on SDMX
15
Data Cube vocabulary
16
SDMX
Data Cube Scope▪Data structure definition
▪ Specification of dimensions, measures, attributes
▪ Specification of code lists
▪ Specification of concepts
▪Dataset
▪ Observation data
▪ Including dimensions, measures, attributes
▪ “Slices” (time series or cross-sections of lower dimensionality within the cube)
17
The model
18
Life expectancy within Welsh Unitary authorities
Data Cube Vocabulary and Linked Data
▪Support use of existing URI sets and SKOS vocabularies as code lists
▪Make code lists available on the web for other purposes (as SKOS vocabularies)
▪Concepts, properties et al. can be shared between data sets
▪Everything has a URI and can be linked to and annotated
20
SKOS
20
SKOS
21
OpenCube Pilots
Swiss Bank
▪http://www.opencube-project.eu ▪Twitter: @OpenCubeProject !▪With presentations to be found at: ▪Slideshare: http://www.slideshare.net/OpenCubeProject
22
OpenCube stay up to date