data vault

34
Datavault Hennie de

Upload: henniedenooijer

Post on 19-Jan-2015

1.219 views

Category:

Technology


9 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Data vault

Datavault

Hennie de Nooijer

Page 2: Data vault

Dan Linstedt

Data modeling

All data, all the time

Method of design

Data Vault

Page 3: Data vault

10-04-2023 3

Agenda

PositionDefinitionArchitectureModelingMethodologyQuestions?

Page 4: Data vault

Information provisioning

10-04-2023 4

Page 5: Data vault

Controllled information provisioning

DWH

Information provisioning

10-04-2023 5

Page 6: Data vault

10-04-2023 6

Business Intelligence

Data warehouse

ETL

RDBMS

Hardware

Page 7: Data vault

10-04-2023 7

Definition

The Data Vault is a detail oriented, historical tracking and uniquely linked set of normalized tables that support one or more functional

areas of business.

The Data Vault is a detail oriented, historical tracking and uniquely linked set of normalized tables that support one or more functional areas of business.

Page 8: Data vault

10-04-2023 8

Detailoriented

Page 9: Data vault

10-04-2023 9

Historical tracking

Page 10: Data vault

10-04-2023 10

Uniquely linked set normalized tables

Page 11: Data vault

10-04-2023 11

Functional areas of business

Page 12: Data vault

10-04-2023 12

But there are more aspects…..

Page 13: Data vault

10-04-2023 13

Auditable

Page 14: Data vault

10-04-2023 14

Scalable

Page 15: Data vault

10-04-2023 15

Adaptable

Page 16: Data vault

10-04-2023 16

Active

Page 17: Data vault

10-04-2023 17

Metadata

Page 18: Data vault

10-04-2023 18

MDM aware

Page 19: Data vault

10-04-2023 19

Agenda

PositionDefinitionArchitectureModelingMethodologyQuestions?

Page 20: Data vault

DWH

Conventional architecture

TRANSFORM

Integration Storage Presentation

Business

Informatio

n Model

Current Business Demands/Wishes

STAGE

Page 21: Data vault

STAGE

source

DWH

business

DWH

Modern architecture

TRANSFORM

Integration Storage PresentationStorage

Current B

usiness

Informatio

n Model

Current Business Demands/Wishes

ALL DATA, A

LL THE TIME

Page 22: Data vault

Business

Information

Model

Leveringscondities

Leverancier

Materiaalsoort

Materiaalbehoeftemagazijn

Bestelling

Levering

Magazijn

omvangwerkdag

werkdag

omvang

Is geplaatst onder/betreft

Is bereid te leveren/kan geleverd worden door

Ontvangt/Is geplaatst bij

Verplicht tot/Is realisatie van

op

heeft

Bestaat uit/zit in

Bestaat uit/komt voor in

Betreft de bereidhied tot het levereren aan een/kan conform worden geleverd aan

Voorziet in/wordt in voorzien door

Wordt ontvangen door/ontvangt

Bestaat uit

Komt voor inmet Moet in voorzien worden voor

Page 23: Data vault

10-04-2023 23

Architecture (detail)

Bron nBron 2Bron 1

Raw Datavault 1

Business Datavault

Datamarts

Frond end

Raw Datavault 2

Raw Datavault n

Replicatielaag

Patient

Patient Customer

Patient Customer

Patient

Patient

KNA1

KNA1

Page 24: Data vault

10-04-2023 24

Enterprise Service Bus (Biztalk/Cloverleaf/SOA)

Architecture (Advanced)

Bron nBron 2Bron 1

Datamarts

Datavault

Frond end tools

Page 25: Data vault

10-04-2023 25

Manage and enforce Compliance (SOX, HIPPA en BASEL II).Reduces Business cycle time.Enabling Master Data management.CMM Level 5 compliant.

Repeatable, consistent and redundant.Trace all data back to source systems.Flexibility.Scalability.Consistent.Adaptable.Possible automatic generation of the DDL and ETL.Supports VLDBDesigned for EDW

Benefits

Page 26: Data vault

10-04-2023 26

Agenda

PositionDefinitionArchitectureModelingMethodologyQuestions?

LinkHub

Satellite

Satellite

Satellite

Satellite

Hub

Satellite

Satellite

Satellite

Patient

Treat

Treatment

Page 27: Data vault

10-04-2023 27

Hub

Represents the business key.

A surrogate key as the primary key.

Load date timestamp (when did it get there?)

Record source (where did it come from?)

Hub

Patient_ID

Patient_CodePatient_NamePatient_DescPatient_CategoryPatient_SubCategoryPatient_AddressPatient_Gender

Patient_Key

Patient_CodeLoad_DateRecord_Source

Hub_PatientPatient

Page 28: Data vault

10-04-2023 28

Satellite Satellite

Descriptive items of a hub or a link

A surrogate key as the primary key.

Load date timestamp (when did it get there?)

Record source (where did it come from?)

Patient_ID

Patient_CodePatient_NamePatient_DescPatient_CategoryPatient_SubCategoryPatient_AddressPatient_Gender

Patient_KeyLoad_Date

Patient_NamePatient_DescPatient_CategoryPatient_SubCategoryPatient_AddressPatient_Gender

SAT_PatientPatient

Patient_KeyLoad_Date

Patient_NamePatient_DescPatient_AddressPatient_Gender

Patient_KeyLoad_Date

Patient_CategoryPatient_SubCategory

SAT_Patient

SAT_PatientCategory

Page 29: Data vault

10-04-2023 29

Links two or more hubs

Own surogate key.

Keys from the hub

Load date time stamp

Record source

Link Link

Patient_Key

Patient_CodeLoad_DateRecord_Source

Hub_Patient

Treat_Key

Treat_CodeLoad_DateRecord_Source

Hub_Treat

Treatment_Key

Patient_KeyTreat_KeyLoad_DateRecord_Source

Link_Treatment

Page 30: Data vault

10-04-2023 30

Bron datamodel

Page 31: Data vault

10-04-2023 31

Analyse datamodel

Page 32: Data vault

10-04-2023 32

Datavault datamodel

Page 33: Data vault

10-04-2023 33

Datavault

Point in Time views (PIT).‘truth’ at a certain moment.Helper table?

Bridge.Same as Point in Time but then a range.

Page 34: Data vault

10-04-2023 34

Questions?