corporate data model (cdm) underlying principle:4 datastores 1.input-raw data 2.clean unit-cleaned...

9
Corporate Data Model (CDM) Underlying principle: 4 datastores 1. INPUT - raw data 2. CLEAN UNIT - cleaned data 3. AGGREGATE - aggregated data 4. DISSIMINATION - published data CDM was seen as active DWH

Upload: stephany-harvey

Post on 19-Jan-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Corporate Data Model (CDM) Underlying principle:4 datastores 1.INPUT-raw data 2.CLEAN UNIT-cleaned data 3.AGGREGATE-aggregated data 4.DISSIMINATION-published

Corporate Data Model (CDM)

Underlying principle: 4 datastores

1. INPUT - raw data

2. CLEAN UNIT - cleaned data

3. AGGREGATE - aggregated data

4. DISSIMINATION - published data

CDM was seen as ≈ active DWH

Page 2: Corporate Data Model (CDM) Underlying principle:4 datastores 1.INPUT-raw data 2.CLEAN UNIT-cleaned data 3.AGGREGATE-aggregated data 4.DISSIMINATION-published

Corporate Data Model (CDM)

Main characteristics:• All (statistical) processes must use the

4 datastores• Processing systems interact on the data stores• At some moments: snap shots,

which build next data store• It is possible to work further on the same

(snap shotted) data store• Simultanious updating of / on data

is mainly organisational issue

Page 3: Corporate Data Model (CDM) Underlying principle:4 datastores 1.INPUT-raw data 2.CLEAN UNIT-cleaned data 3.AGGREGATE-aggregated data 4.DISSIMINATION-published

Corporate Data Model CSO - Ireland

INPUTCLEANED

DATASETSAGGREGATEDATASETS DISSEMINATION

DATA

MANAGEMENT

STORE

ADMINISTRATIVE

DATA CENTRE

2 OPERATIONALIMPLEMENTATIONS

Surveys

Admin data

Page 4: Corporate Data Model (CDM) Underlying principle:4 datastores 1.INPUT-raw data 2.CLEAN UNIT-cleaned data 3.AGGREGATE-aggregated data 4.DISSIMINATION-published

Data Management Store (DMS)• First implementation of CDM• Mainly survey data• Data tables are created and populated through

the DMS applications.• Metadata must be entered as the data tables

are created.• Metadata capturing = minimal

bottleneck• BR outside DMS (stand alone)

Page 5: Corporate Data Model (CDM) Underlying principle:4 datastores 1.INPUT-raw data 2.CLEAN UNIT-cleaned data 3.AGGREGATE-aggregated data 4.DISSIMINATION-published

Corporate Data Model CSO - Ireland

DA

TA

C

OL

LE

CT

ION

A

CT

IVIT

IES

INPUTCLEANED

DATASETSAGGREGATEDATASETS DISSEMINATION

D

M

S

APP – layer, incl. I/O interfaces

DMS meta layer – Basic descriptions

SHAREDINPUT

SHAREDCLEANED UNIT

AGGREGATESTORE

SNAPSHOTS

B

I

SYS 1

SYS 2

SYS n

Mainly surveys

Page 6: Corporate Data Model (CDM) Underlying principle:4 datastores 1.INPUT-raw data 2.CLEAN UNIT-cleaned data 3.AGGREGATE-aggregated data 4.DISSIMINATION-published

Administrative Data Centre (ADC)• Developed for organisational reasons• Only Admin data• A catalyst to exploit administrative data for

statistical purposes• Interface with public authorities on admin data

flows to CSO• Clearing house inside CSO for admin data• Data governance with respect to admin data

Page 7: Corporate Data Model (CDM) Underlying principle:4 datastores 1.INPUT-raw data 2.CLEAN UNIT-cleaned data 3.AGGREGATE-aggregated data 4.DISSIMINATION-published

Administrative Data Centre (ADC)• Has analysis layer• R&D on available data• To develop new datasets• Without specific needs / demands

from statistics

Page 8: Corporate Data Model (CDM) Underlying principle:4 datastores 1.INPUT-raw data 2.CLEAN UNIT-cleaned data 3.AGGREGATE-aggregated data 4.DISSIMINATION-published

Corporate Data Model CSO - Ireland

INPUTCLEANED

DATASETSAGGREGATEDATASETS DISSEMINATION

A

D

C

ADC meta layer

B

I

SYS 1

SYS 2

SYS n

DA

TA

C

OL

LE

CT

ION

A

CT

IVIT

IES

SOURCES DataProducts

E

T

L

ADC

Front

Door

LEAN INTERFACE

Only Admin Data

Page 9: Corporate Data Model (CDM) Underlying principle:4 datastores 1.INPUT-raw data 2.CLEAN UNIT-cleaned data 3.AGGREGATE-aggregated data 4.DISSIMINATION-published

Corporate Data Model CSO - Ireland

DA

TA

C

OL

LE

CT

ION

A

CT

IVIT

IES

INPUTCLEANED

DATASETSAGGREGATEDATASETS DISSEMINATION

D

M

S

A

D

C

APP – layer, incl. I/O interfaces

DMS meta layer – Basic descriptions

ADC meta layer

SHAREDINPUT

SHAREDCLEANED UNIT

AGGREGATESTORE

SNAPSHOTS

B

I

SYS 1

SYS 2

SYS n

DA

TA

C

OL

LE

CT

ION

A

CT

IVIT

IES

SOURCESData

Products

E

T

L

ADC

Front

Door

LEAN INTERFACE