© 2013 kms technology. agile data warehouse design dao vo confidential2

38
© 2013 KMS Technology

Upload: hugo-strickland

Post on 13-Dec-2015

217 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

© 2013 KMS Technology

Page 2: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 2

AGILE DATA WAREHOUSE DESIGNDao Vo

Page 3: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 3

AGENDA

• Overview of data warehousing • Designing and implementing a data

warehouse• Waterfall BI/WH development• Agile BI/WH development framework• Q&A

Page 4: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 4

OVERVIEW OF DATA WAREHOUSINGWhat is a data warehouse?

Page 5: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 5

OVERVIEW OF DATA WAREHOUSING

• The business problem• What is a data warehouse?• BI/WH Architectures

Page 6: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

THE BUSINESS PROBLEM

• Key business data is distributed across multiple systems

Page 7: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

THE BUSINESS PROBLEM

• Finding the information required for business decision making is time-consuming and error-prone

Page 8: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

THE BUSINESS PROBLEM

• Fundamental business questions are hard to answer

Page 9: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

WHAT IS A DATA WAREHOUSE?

Page 10: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

WHAT IS A DATA WAREHOUSE?

• A centralized store of business data for reporting and analysis

• Typically, a data warehouse:– Contains large volumes of historical data– Is optimized for querying data (as

opposed to inserting or updating)– Is incrementally loaded with new

business data at regular intervals– Provides the basis for enterprise

business intelligence solutions

Page 11: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 11

DESIGNING AND IMPLEMENTING A DATA WAREHOUSE

How to design a data warehouse and BI solution?

Page 12: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 12

DESIGN AND IMPLEMENT WH

• Introduction to Dimensional Modeling• Star Schemas• Considerations for Dimension Tables• Considerations for Fact Tables• Snowflake Schemas

Page 13: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 13

WAREHOUSE MODELING

Page 14: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

INTRODUCTION TO DIMENSIONAL MODELING

• Business questions focus on measures that are aggregated by business dimensions

• Measures are facts about the business

• Dimensions are ways in which the measures can be aggregated

Product Line

Product Line

SalespersonSales

person ProductProduct

TimeTime

CustomerCustomerRegionRegion

QuantityRevenue

CostProfit

Page 15: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

STAR SCHEMAS

• Group related dimensions into dimension tables

• Group related measures into

fact tables• Relate fact tables

to dimension tables by using foreign keys

DimSalesPersonSalesPersonKeySalesPersonNameStoreNameStoreCityStoreRegion

DimSalesPersonSalesPersonKeySalesPersonNameStoreNameStoreCityStoreRegion

DimProductProductKeyProductNameProductLineSupplierName

DimProductProductKeyProductNameProductLineSupplierName

DimCustomerCustomerKeyCustomerNameCityRegion

DimCustomerCustomerKeyCustomerNameCityRegion

FactOrdersCustomerKeySalesPersonKeyProductKeyShippingAgentKeyTimeKeyOrderNoLineItemNoQuantityRevenueCostProfit

DimDateDateKeyYearQuarterMonthDay

DimDateDateKeyYearQuarterMonthDay

DimShippingAgentShippingAgentKeyShippingAgentName

DimShippingAgentShippingAgentKeyShippingAgentName

Page 16: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

SNOWFLAKE SCHEMASDimSalesPersonSalesPersonKeySalesPersonNameStoreKey

DimSalesPersonSalesPersonKeySalesPersonNameStoreKey

DimProductProductKeyProductNameProductLineKeySupplierKey

DimProductProductKeyProductNameProductLineKeySupplierKey

DimCustomerCustomerKeyCustomerNameGeographyKey

DimCustomerCustomerKeyCustomerNameGeographyKey

FactOrdersCustomerKeySalesPersonKeyProductKeyShippingAgentKeyTimeKeyOrderNoLineItemNoQuantityRevenueCostProfit

DimDateDateKeyYearQuarterMonthDay

DimDateDateKeyYearQuarterMonthDay

DimShippingAgentShippingAgentKeyShippingAgentName

DimShippingAgentShippingAgentKeyShippingAgentName

DimProductLineProductLineKeyProductLineName

DimProductLineProductLineKeyProductLineName

DimGeographyGeographyKeyCityRegion

DimGeographyGeographyKeyCityRegion

DimSupplierSupplierKeySupplierName

DimSupplierSupplierKeySupplierName

DimStoreStoreKeyStoreNameGeographyKey

DimStoreStoreKeyStoreNameGeographyKey

Page 17: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 17

WAREHOUSE MODELING

Page 18: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 18

WATERFALL BI/WH DEVELOPMENTTraditional SDLC to develop a BI/WH product

Page 19: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 19

WATERFALL BI/WH DEVELOPMENT

• SDLC Overview

Page 20: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 20

WATERFALL BI/WH DEVELOPMENT

Page 21: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 21

SDLC OVERVIEW

Page 22: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 22

AGILE BI/WH DEVELOPMENT FRAMEWORK

Incremental development framework for BI/WH product

Page 23: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 23

AGILE BI/WH DEVELOPMENT

FRAMEWORK• Agile BI/WH life cycle• Agile DW design overview• Agile ETL Solution

Page 24: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 24

AGILE BI/WH LIFE CYCLE

Page 25: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 25

AGILE BI/WH LIFE CYCLE

Page 26: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 26

AGILE DW DESIGN OVERVIEWHow to design to answer business question?

Page 27: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 27

AGILE DW DESIGN OVERVIEW

• How do we ask question?• The 7Ws framework• Design using natural language• Straightforward methodology• Model storming• BEAM methodology

Page 28: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 28

HOW DO WE ASK QUESTION?

• Events/Transactions– A immutable "fact" that occurs in a time

and place

• Interrogatives:–Who, What, When, Where, Why– Descriptive context that fully describes

the event– A set of “dimensions" that describe

events

Page 29: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 29

THE 7WS FRAMEWORK

WhyWhere

How

WhoWhen

What

HowMany

Page 30: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

THE 7WS FRAMEWORK

HOW – FACTsMuchManyOften£$€

WhoCustomerEmployee

SellerOrganization

WhatProductService

TransactionsBooking

Event

WhyCausal

PromotionReason

WeatherCompetition

WhereLocation

GeographicStore

Ship toHospital

WhenTimeDay

MonthYear

Page 31: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 31

DESIGN USING NATURAL LANGUAGE

• Verbs – Events – Relationships – Fact Tables

• Nouns – Details – Entities – Dimensions

• Main Clause – Subject-Verb-Object• Prepositions – connect additional

details to the main clause• Interrogatives – The 7Ws – Dimension

Types

Page 32: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

STRAIGHTFORWARD METHODOLOGY

Confidential

Who

What

When

Where

How (many)

Why

How

1

3

11111

4

5

2

6

7

8

Declare Event Type

Subject-Verb-Object

Quantities - Facts

Sufficient Detail Fact Granularity

Initial Data Examples

9

Page 33: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 33

DESIGN USING NATURAL LANGUAGE

• Verbs – Events – Relationships – Fact Tables

• Nouns – Details – Entities – Dimensions

• Main Clause – Subject-Verb-Object• Prepositions – connect additional

details to the main clause• Interrogatives – The 7Ws – Dimension

Types

Page 34: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 34

BUSINESS EVENT ANALYSIS AND MODELING (BEAM✲)

An agile approach to dimensional modeling

Page 35: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

MODEL STORMING

Confidential 35

Quick

Data Modeler BI Stakeholders

Inclusive

Interactive

Fun

Page 36: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

Confidential 36

BEAM ✲ METHODOLOGYStructured, non-technical, collaborative working conversation directly with BI

Users

• BI User’s Business Process, Organizational, Hierarchical, and Data Knowledge• Focused Data

Profiling

• Logical and Physical Dimensional Data Models

• Example data• Detailed and

Testable ETL Specification• DW

Prototype

BEAM✲

Data

Modeler

BI Stakeholders

Page 37: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

37

Q&A

Page 38: © 2013 KMS Technology. AGILE DATA WAREHOUSE DESIGN Dao Vo Confidential2

© 2013 KMS Technology

THANK YOU.