data vault
DESCRIPTION
TRANSCRIPT
Datavault
Hennie de Nooijer
Dan Linstedt
Data modeling
All data, all the time
Method of design
Data Vault
10-04-2023 3
Agenda
PositionDefinitionArchitectureModelingMethodologyQuestions?
Information provisioning
10-04-2023 4
Controllled information provisioning
DWH
Information provisioning
10-04-2023 5
10-04-2023 6
Business Intelligence
Data warehouse
ETL
RDBMS
Hardware
10-04-2023 7
Definition
The Data Vault is a detail oriented, historical tracking and uniquely linked set of normalized tables that support one or more functional
areas of business.
The Data Vault is a detail oriented, historical tracking and uniquely linked set of normalized tables that support one or more functional areas of business.
10-04-2023 8
Detailoriented
10-04-2023 9
Historical tracking
10-04-2023 10
Uniquely linked set normalized tables
10-04-2023 11
Functional areas of business
10-04-2023 12
But there are more aspects…..
10-04-2023 13
Auditable
10-04-2023 14
Scalable
10-04-2023 15
Adaptable
10-04-2023 16
Active
10-04-2023 17
Metadata
10-04-2023 18
MDM aware
10-04-2023 19
Agenda
PositionDefinitionArchitectureModelingMethodologyQuestions?
DWH
Conventional architecture
TRANSFORM
Integration Storage Presentation
Business
Informatio
n Model
Current Business Demands/Wishes
STAGE
STAGE
source
DWH
business
DWH
Modern architecture
TRANSFORM
Integration Storage PresentationStorage
Current B
usiness
Informatio
n Model
Current Business Demands/Wishes
ALL DATA, A
LL THE TIME
Business
Information
Model
Leveringscondities
Leverancier
Materiaalsoort
Materiaalbehoeftemagazijn
Bestelling
Levering
Magazijn
omvangwerkdag
werkdag
omvang
Is geplaatst onder/betreft
Is bereid te leveren/kan geleverd worden door
Ontvangt/Is geplaatst bij
Verplicht tot/Is realisatie van
op
heeft
Bestaat uit/zit in
Bestaat uit/komt voor in
Betreft de bereidhied tot het levereren aan een/kan conform worden geleverd aan
Voorziet in/wordt in voorzien door
Wordt ontvangen door/ontvangt
Bestaat uit
Komt voor inmet Moet in voorzien worden voor
10-04-2023 23
Architecture (detail)
Bron nBron 2Bron 1
Raw Datavault 1
Business Datavault
Datamarts
Frond end
Raw Datavault 2
Raw Datavault n
Replicatielaag
Patient
Patient Customer
Patient Customer
Patient
Patient
KNA1
KNA1
10-04-2023 24
Enterprise Service Bus (Biztalk/Cloverleaf/SOA)
Architecture (Advanced)
Bron nBron 2Bron 1
Datamarts
Datavault
Frond end tools
10-04-2023 25
Manage and enforce Compliance (SOX, HIPPA en BASEL II).Reduces Business cycle time.Enabling Master Data management.CMM Level 5 compliant.
Repeatable, consistent and redundant.Trace all data back to source systems.Flexibility.Scalability.Consistent.Adaptable.Possible automatic generation of the DDL and ETL.Supports VLDBDesigned for EDW
Benefits
10-04-2023 26
Agenda
PositionDefinitionArchitectureModelingMethodologyQuestions?
LinkHub
Satellite
Satellite
Satellite
Satellite
Hub
Satellite
Satellite
Satellite
Patient
Treat
Treatment
10-04-2023 27
Hub
Represents the business key.
A surrogate key as the primary key.
Load date timestamp (when did it get there?)
Record source (where did it come from?)
Hub
Patient_ID
Patient_CodePatient_NamePatient_DescPatient_CategoryPatient_SubCategoryPatient_AddressPatient_Gender
Patient_Key
Patient_CodeLoad_DateRecord_Source
Hub_PatientPatient
10-04-2023 28
Satellite Satellite
Descriptive items of a hub or a link
A surrogate key as the primary key.
Load date timestamp (when did it get there?)
Record source (where did it come from?)
Patient_ID
Patient_CodePatient_NamePatient_DescPatient_CategoryPatient_SubCategoryPatient_AddressPatient_Gender
Patient_KeyLoad_Date
Patient_NamePatient_DescPatient_CategoryPatient_SubCategoryPatient_AddressPatient_Gender
SAT_PatientPatient
Patient_KeyLoad_Date
Patient_NamePatient_DescPatient_AddressPatient_Gender
Patient_KeyLoad_Date
Patient_CategoryPatient_SubCategory
SAT_Patient
SAT_PatientCategory
10-04-2023 29
Links two or more hubs
Own surogate key.
Keys from the hub
Load date time stamp
Record source
Link Link
Patient_Key
Patient_CodeLoad_DateRecord_Source
Hub_Patient
Treat_Key
Treat_CodeLoad_DateRecord_Source
Hub_Treat
Treatment_Key
Patient_KeyTreat_KeyLoad_DateRecord_Source
Link_Treatment
10-04-2023 30
Bron datamodel
10-04-2023 31
Analyse datamodel
10-04-2023 32
Datavault datamodel
10-04-2023 33
Datavault
Point in Time views (PIT).‘truth’ at a certain moment.Helper table?
Bridge.Same as Point in Time but then a range.
10-04-2023 34
Questions?