putting sdmx into practice - unescap.org concepts... · 4/30/2014 2 sdmx • standard formats for...
TRANSCRIPT
4/30/2014
1
SDMX – Concepts and toolsPutting SDMX into practice
ADB/ESCAP SDMX Capacity Building InitiativeBangkok, 28-30 April 2014
Why do we need SDMX ?
• Common language for statistical d t d t d t hdata and metadata exchange
– Variety of formats used within NSI
– International data exchange
• Common data transmission format for statistical data and metadata
4/30/2014
2
SDMX
• Standard formats for data and metadata
• Content guidelines
• IT architecture for exchange of data and metadata
Organizations are free to make use of whichever elements of SDMX are most
appropriate in a given case
Statistical data
4/30/2014
3
Dimensions, attributes and measures
• Dimensions describe the data and form a the id tifi (ID) f th l t d d tidentifier (ID) of the related data
• Attributes provide additional information to qualify the data (typically: unit, status of the data (provisional, estimate…)
• Measure: value of the phenomenon observedMeasure: value of the phenomenon observed
Code list
• Most of dimension values are defined in code li tlist
– Makes SDMX language independent
• Attributes can be defined in a code list or as free text
4/30/2014
4
Example of code list
4/30/2014
5
Metadata
• Structural metadata
– Concepts used in the description and identification of statistical data and metadata
• Reference metadata
– Additional explanatory metadata, for example onAdditional explanatory metadata, for example on the methodology used or quality aspects
Structural Metadata
• Identify and describe the data
• Corresponds to a dimension in a data cube
• Arrange in Structure Definitions
– Data Structure Definitions (DSD)
– Metadata Structure Definitions (MSD)
4/30/2014
6
Reference Metadata
• Conceptual metadata, describing the concepts used and their practical implementationused and their practical implementation;
• Methodological metadata, describing methods used for the generation of the data;
• Quality metadata, describing the different quality aspects of the statistical dataquality aspects of the statistical data.
• Structured according to a Metadata Structure Definition (MSD)
4/30/2014
7
Content oriented guidelines
• Guidelines with the scope of SDMX
– To achieve better interoperability between organizations
– Use is encouraged
• 3 areas
– Cross‐domain conceptsCross domain concepts
– Statistical subject‐matter domains
– A Metadata common vocabulary
Cross‐domain concepts
• List of statistical concepts related to statistical d d t litprocess and data quality
Example
4/30/2014
8
Statistical subject‐matter domains
• high level classification of statistical areas
• starting point for organizing the exchange of statistical data and metadata
4/30/2014
9
Metadata Common Vocabulary
• contains concepts and related definitions used i t t l d f t d t fin structural and reference metadata of international organizations and national data producing agencies.
– General metadata concepts
– Metadata terms describing statistical gmethodologies and data quality
– Terms referring specifically to data and metadata exchange
Metadata Common Vocabulary
4/30/2014
10
IT Architecture
IT architecture
4/30/2014
11
XML
• Extensible Mark‐up Language
• used to describe the content and structure of data in a document
• XML is not designed for use directly by people, but is instead intended to deliver documents to computer applications over the Internetto computer applications over the Internet.
• Uses tags to describe the meaning and hierarchical structure of data
4/30/2014
12
Web service
Mapping process
• Data described differently by data providers d b d t ll tand by data collectors
• Need to harmonize structural metadata
• Mapping is necessary when local concepts are different from the corresponding concept in global DSDsglobal DSDs
4/30/2014
13
Ressources on SDMX
• www.sdmx.org
• https://webgate.ec.europa.eu/fpfis/mwikis/sdmx/index.php/Main_Page
SDMX Reference InfrastructureSDMX‐RI
• Universal framework for modern data provision d hand exchange
• Set of pick‐and‐choose reusable building blocks allowing a statistical office to expose data to the external world based on access rights
• Designed to provide data and structuralDesigned to provide data and structural metadata based on mappings to each organization's dissemination data warehouse
• Uses SDMX standards incl. one for Web Services
4/30/2014
14