creating a national data architecture for evidence-based...
TRANSCRIPT
![Page 1: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/1.jpg)
Creating a National
Data Architecture for
Evidence-Based Policy
Andrea Fernandez
Deputy Director of Dissemination
March 10, 2020 1
![Page 2: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/2.jpg)
01
Business Problem
02
Basic Descriptions
03
Statistical Data Architecture
04
Geospatial Data Architecture
05
A platform for Evidence-Based
Decision Making
Table of
Contents
2
![Page 3: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/3.jpg)
3
01
Business
Problem
❖From data to knowledge
❖From data producer to data user
(policy maker)
![Page 4: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/4.jpg)
4
01
General
Approach
![Page 5: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/5.jpg)
❖National Institute of Statistics
and Geography (INEGI)
❖The National Statistical System
❖Production of Statistical
Information
❖Production of Geographical
Information
❖Metadata
❖Analysis taxonomy
02
Basic
descriptions
5
![Page 6: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/6.jpg)
A Federal Government Organization
50+ offices nationwide
16,000+ employees
150+ production lines
Technically Autonomous Institution
● Household
● Business
● Government and Security
● Geographic Information
4 Divisions that produce information:
● Headed by five board members (1 President, 4 Vice
presidents) appointed by the Senate.
6
Two roles
1. Coordination of the National Statistical and Geographical
System (Information produced by government agencies that support the
design and evaluation of public policy.)
2. Production of Official Statistics and Geographical
Information
❖ 02.1
National Institute of
Statistics and Geography
![Page 7: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/7.jpg)
7
❖ 02.2
The National Statistical
System
National
Statistics
Coordinator
INEGI
Treasury
Interior
Ministry
Education
Ministry
Other
Ministries
Statistical
Division
Statistical
Division
Statistical
Division
Statistical
Division
Mandate
To produce information that supports the
design and evaluation of public policy
Status quo:
Multiple data producers with dispersed
systems of records, various types of data, and
conceptual frameworks.
![Page 8: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/8.jpg)
❖ 02.3
Production of Statistical Information
DataCensus, business
registries, other data
Generic Statistical Business Process
Process Infrastructure
InformationCharts, tables, data
marks
![Page 9: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/9.jpg)
❖ 02.4
Production of Geographical Information
DataHard copyDigital form
Geographical Information Systems
(Standardized following GSBPM)
InformationHard copyDigital form
LinesPointsAreas
Complex
MapsAttributes
ImagesLidarGPS
![Page 10: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/10.jpg)
10
Metadata describes data lineage and relevant information that provides
context, contents, and meaning.
❖ 02.5
Metadata
❖Statistics ❖GIS
ISO Attribute standards
(metadata) to produce
and disseminate
information.
No standards for
simultaneous
production of statistical
and geographical
objects
19100
![Page 11: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/11.jpg)
❖Dominant Schema
❖Standardized production line
(life cycle).
❖Statistical Data Domains
❖Aggregate Data Architecture
levels
❖Statistical Metadata
❖Opportunities
03
Statistical Data
Architecture
11
![Page 12: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/12.jpg)
12
❖ 03.1
Dominant Schema
Specify needs Design Build Collect Process Analyse Disseminate Evaluate
Informant
data
Product
Specify needs Design Build Collect Process Analyse Disseminate Evaluate
Informant
data
Product
Specify needs Design Build Collect Process Analyse Disseminate Evaluate
Informant
data
Product
Specify needs Design Build Collect Process Analyse Disseminate Evaluate
Informant
data
Product
![Page 13: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/13.jpg)
13
Data object
❖ 03.2
Standardized Production Line
Documentation
of Specify
needs
Design Build Collect Process Analyse
DisseminateEvaluate
Collected data
Processeddata
Dissemination principles
Aggregated data
Main Indicators
OtherIndicators
Additional Content
Information Set
Product
Specify needsInformant data
![Page 14: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/14.jpg)
❖ 03.4
Aggregate Data Architecture levels
![Page 15: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/15.jpg)
❖ 03.5
Statistical Metadata
Metadata
Standards
Conceptual
Metadata
● Purpose
● Coverage
● Analysis unit
● Concepts
● Universe
● Variables
● Classifications
● Categories
Data
collection
Metadata
● Methodology
● Sampling
● Collection strategy
Processing
metadata
● Entry
● Coding
● Editing
● Derivation
● Weighting
Support: Data discovery Data analysis Data distribution Data access Data availability
![Page 16: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/16.jpg)
• Despite the heterogeneity in the production of information, every collected
statistical data is either georeferenced or geocoded.
• Other statistical information attributes (metadata) can be stored and
managed in a geospatial infrastructure.
• Consolidation in a Geospatial Infrastructure enables the analysis of cross-
discipline domains enhancing traditional statistical analysis.
16
❖ 03.5
Opportunities with statistical information
![Page 17: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/17.jpg)
❖Geographical Information
Systems (GIS)
❖Geocoding vs. Georeferencing
❖Schemas for Geospatial Data
❖Grid-based representation
❖Administrative boundaries
❖Geographical Metadata
04
Geographical
Data
Architecture
17
![Page 18: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/18.jpg)
A framework for gathering, managing, and analyzing data.
18
❖ 04.1
Geographic Information System (GIS)
Imagery
Transportation
Addresses
Water features
Boundaries
Elevation
![Page 19: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/19.jpg)
Geocoding
❖ 04.2
Geocoding vs. georeferencing
Requires the latitude and longitude of every
statistical object
Substantial historical data is not geocoded
GeoreferencingFixed polígons
Widely available
Less flexible
Most historical data was georeferenced
![Page 20: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/20.jpg)
Grid-Based
❖ 04.3
Schemas for Geospatial data
Requires geocoding of every statistical
observation
Some historical data was not geocoded
Administrative boundariesFixed polígons
Widely available
Less flexible
![Page 21: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/21.jpg)
• Spatial units with equal size and even distribution.
• It offers flexibility in size.
• It is not population-centric.
• It can be applied across boundaries.
• Suitable for overlaying and spatial analysis.
21
❖ 04.4
Grid-based representation
![Page 22: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/22.jpg)
Country
State
Municipality
Region
Enumeration
Area
22
❖ 04.5
Administrative Boundaries
![Page 23: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/23.jpg)
• Identification
• Creation date, data author, contact information, source agency, map projection
and coordinate system, scale, error, explanation of symbology and attributes,
data dictionary, data restrictions, licensing.
• Assessment
• Use constraints, access constraints, data quality, availability
• Access
• On line, order, contact
23
❖ 04.6
Geographical Metadata
![Page 24: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/24.jpg)
❖Evolution to support of
evidence-based analysis
❖Geospatial Data Infrastructure
❖Metadata Approach
❖Implementation stages
05
Geostatistical
Approach
24
![Page 25: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/25.jpg)
25
National Statistics DataStatistical Analysis
Geostatistical
National Data
ArchitectureGeostatistical
Analysis
National Geospatial DataGeospatial Analysis
❖ 05.1
Evolution to support
evidence-based
analysis
![Page 26: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/26.jpg)
Statistical Data
Geospatial Data
Stat Data Layers
Geo Data Layers
Imagery
Transportation
National Data Architecture
Geospatial Database
Addresses
Water features
Boundaries
Elevation
❖ 05.2
Geostatistical Data
Infrastructure
![Page 27: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/27.jpg)
❖ 05.3
Metadata Approach
Time PlaceTheme Geospatial
Metadata
Statistical Metadata
![Page 28: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/28.jpg)
❖ 05.4
05
Implementation Stages
Silos
0Standardization
of processes and deliverables
1Integration into a
Geospatial Infrastructure
2Geospatial
analysis
3Geospatial Network
4
![Page 29: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/29.jpg)
❖ 05.4
Implementation Stages
Silos
0
Informant
data
Product
Informant
data
Product
Informant
data
Product
Informant
data
Product
Ad Hoc consolidation
Geographic Information
StatisticalInformation
Data Warehouse
Geospatial Database
![Page 30: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/30.jpg)
❖ 05.4
Implementation Stages
Standardization of processes and
deliverables
1Geographic Information
StatisticalInformation
Both withstandardized:1)Output2)Metadata3)Paradata
![Page 31: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/31.jpg)
❖ 05.4
Implementation Stages
Integration into a Geospatial
Infrastructure
2Geographic Information
StatisticalInformation Geospatial
Infrastructure
![Page 32: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/32.jpg)
❖ 05.4
Implementation Stages
Geospatial analysis
3
Grid Base
Administrative Boundaries
Representation
Mapping
Analysis
![Page 33: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/33.jpg)
❖ 05.4
Implementation Stages
Geospatial Network
4
Implement the schema in every Federal Statistical Institution...
![Page 34: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/34.jpg)
❖ 05.4
Implementation Stages
A virtual web geospatial portal to
access ALL available data
Geospatial Network
4
![Page 35: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/35.jpg)
Statistical production has been supporting traditional evidence-based policymaking.
Most of the statistical production has been taking place within independent silos.
Metadata standards can enable common interfaces, but they do not provide a
framework to define shared storage and use.
GIS provide a pool to concentrate statistical and geographical data into a common
Geospatial Infrastructure.
The existence of standardized statistical and geographical metadata allows the
consolidation of data and enhances the capabilities for representation, mapping
and geospatial analysis.
Summary
35
![Page 36: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization](https://reader034.vdocument.in/reader034/viewer/2022051810/60168100a1f2356a0f3f2e17/html5/thumbnails/36.jpg)
36