guidelines_for_getting_started.pdf

24
A Discussion Paper on Guidelines for Getting Started with Teradata Page 1 27-05-2012 15:27 1. Management Summary ................................................................................................................................. 1 2. Getting Started............................................................................................................................................... 2 3. What Have We Learned? ............................................................................................................................. 11 4. The Author ................................................................................................................................................... 11 Appendix A. Data Model Templates ................................................................................................................ 12 1. Management Summary There are three Phases in the lifecycle of a typical Teradata Data Warehouse :- 1. Getting Started 2. Assessment of an operational Data Warehouse 3. Business As Usual (BAU) – changes and enhancements This Paper discusses Phase 1 – Getting Started – and has the objective of promoting discussion of Best Practice. The later two Phases are described in a separate Paper which is available on demand from Database Answers at [email protected] . We use Transportation as the Application to discuss our Approach but it applies equally well to any other area for which a Teradata Industry Data Model is available. Phase 1 – Getting Started This incorporates a Blueprint and will incorporate any appropriate Teradata Industry Data Model. It will then involve design of a Conceptual High-level Model and Subject Areas Models. This Chapter provides material that is suitable for presentations, preparation of a business case, clarification of the requirements or to establish the scope of a Data Warehouse. Presentations can be prepared for specific Industry Areas using this material, supplemented with ‘Kick-Start’ Data Model Templates in Appendix A for Banking, Communications, Retail, Transport and Web Visit Statistics.

Upload: stephanie-armstrong

Post on 16-Dec-2015

1 views

Category:

Documents


0 download

TRANSCRIPT

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 1 27-05-2012 15:27

    1. Management Summary ................................................................................................................................. 1

    2. Getting Started ............................................................................................................................................... 2

    3. What Have We Learned? ............................................................................................................................. 11

    4. The Author ................................................................................................................................................... 11

    Appendix A. Data Model Templates ................................................................................................................ 12

    1. Management Summary There are three Phases in the lifecycle of a typical Teradata Data Warehouse :-

    1. Getting Started

    2. Assessment of an operational Data Warehouse

    3. Business As Usual (BAU) changes and enhancements

    This Paper discusses Phase 1 Getting Started and has the objective of promoting discussion of Best Practice.

    The later two Phases are described in a separate Paper which is available on demand from Database Answers at [email protected].

    We use Transportation as the Application to discuss our Approach but it applies equally well to any other area for which a Teradata Industry Data Model is available.

    Phase 1 Getting Started

    This incorporates a Blueprint and will incorporate any appropriate Teradata Industry Data Model.

    It will then involve design of a Conceptual High-level Model and Subject Areas Models.

    This Chapter provides material that is suitable for presentations, preparation of a business case, clarification of the requirements or to establish the scope of a Data Warehouse.

    Presentations can be prepared for specific Industry Areas using this material, supplemented with Kick-Start Data Model Templates in Appendix A for Banking, Communications, Retail, Transport and Web Visit Statistics.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 2 27-05-2012 15:27

    2. Getting Started 2.1 What is this ? This Chapter describes how to get started with a Teradata Project.

    2.2 Why is it important ? It is important because it helps us to understand the overall process and concepts of a Data Warehouse.

    2.3 A Blueprint

    This is a Teradata Blueprint for an Architecture where rooms become Subject Areas in a Data Model.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 3 27-05-2012 15:27

    2.4 A Transport Enterprise Data Model

    This shows an extract of the Teradata Transport and Logistics Data Model.

    It acted as a starting-point for the creation of a version of the Model for Maersk, which was the Teradata Customer.

    The yellow area in this diagram shows an EDM with separate Subject Areas :-

    Agreement, Asset, Event, Geography, Party, Service, Shipment and Transportation. These are shown in larger sizes around the yellow rectangle.

    TOPTOPTOPTOP----LEVEL BUSINESS DATA MODEL FOR MAERSK LINE LEVEL BUSINESS DATA MODEL FOR MAERSK LINE LEVEL BUSINESS DATA MODEL FOR MAERSK LINE LEVEL BUSINESS DATA MODEL FOR MAERSK LINE

    SERVICESERVICESERVICESERVICE

    GEOGRAPHYGEOGRAPHYGEOGRAPHYGEOGRAPHY

    ExampleGeo Types :

    TRANSPORTATIONTRANSPORTATIONTRANSPORTATIONTRANSPORTATION

    AGREEMENTAGREEMENTAGREEMENTAGREEMENT

    Example Example Example Example Party Party Party Party Roles Roles Roles Roles ::::

    PARTYPARTYPARTYPARTY

    ExampleExampleExampleExamplePartyPartyPartyPartyRoles Roles Roles Roles ::::

    ASSETASSETASSETASSET SHIPMENTSHIPMENTSHIPMENTSHIPMENTEVENTEVENTEVENTEVENT

    Business Process Management

    Work Process Specification

    Workflow:- Actual- Planned

    Business Metric

    Work Process Actual

    Business Process Management

    Work Process Specification

    Workflow:- Actual- Planned

    Work Process Actual

    Financial Planning, Analysis, Management and Consuklting

    Financial information:- Actual- Multiple Plans

    Geographic Defined Area

    Site

    Country

    City

    Zone

    Service

    Geography

    Transportation

    Asset

    Event

    Party

    Agreement

    Shipment

    Dated Voyage

    Shipment Route Link

    Shipment Route Point

    Shipment Route

    Dated Voyage

    Shipment Route Link

    Shipment Route Point

    Shipment Route

    Transport Plan

    AgreementTariff

    Service Contract

    String Role

    PartySegment

    Vendor

    Customer

    Employee

    Internal Organisation

    Carrier

    Vessel

    Asset

    Equipment

    Container

    Equipment Size/TYpe

    Shipment

    Transport Document

    Shipment Equipment Assignment Cargo Package

    Cargo

    Cargo Conditioning RequirementEvent

    Dated Site Call

    Equipment Event

    Business Process Workflow Business Process Workflow SLABusiness Process Workflow GL Account Chart of Accounts Balance

    Account Transaction

    Actual Balance

    Financial Plan Balance

    Allocation (Rule)

    Communication Mode

    Calendar & Time

    Currency

    Unit of Measure

    Commodity

    Service

    StringZone Deployment Plan

    Transport Mode Transport Link

    Non-Dated Zone Call

    Rotation

    Rotation Service Assignment

    Service

    StringZone Deployment Plan

    Transport Mode Transport Link

    Non-Dated Zone Call

    Rotation

    Rotation Service Assignment

    BPM Data KPI Data

    Key Perfpormance Indicator

    Financial Data Reference Data

    Commodity

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    2.5 A Data Architecture and BIReady

    This shows six Layers in a Data Architecture for Data Warehouses.

    On the left-hand side we show the layers in a Teradata Data Architecture.

    On the right-hand side, we use a product called BIReady that provides the functionality within the dotted lines which corresponds to the Data Integration and Data Marts Stages in the Road Map.

    In other words, BIReady automates the implementation of ETL and ELT functions, the Data Warehouse and the Data Marts.

    BIReady facilities are shown in the dotted square

    Data Marts Semantic Data Model Business Intelligence (BI)

    Data Governance

    3NF Data Warehouse Dimensional Models Operational Data Stores Page 4 27-05-2012 15:27

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    2.6 Stages in loading Data into a Warehouse

    There is a logical sequence to the Stages of loading data into a Warehouse.

    The lowest levels of data provide a foundation. The lowest level is Reference Data, such as Currencies and Languages.

    Then, after these have been identified, sources and loaded we can move to the Master Data.

    This includes Products, Services, Customers and so on.

    Then we come to the more dynamic data such as Sales, Transactions and so on.

    Best Practice states that we should set up load procedures that can be run so that data at each Stage can be deleted and replaced repetitively.

    This shows three Stages of data in a typical Data Warehouse.

    Stage 3) Customer Sales (Adjustments, Transactions, etc.) Page 5

    Stage 2) Master Data Products/Services,

    Stage 1) Reference Data (Currencies, Languages ,etc.) Customers etc. 27-05-2012 15:27

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 6

    2.7 Business Intelligence (Top-down)

    Key Performance Indicators (KPIs) provide an excellent starting-point for Business Intelligence.

    Basic KPIs can be defined for Profitability and Operational Performance.

    These can be enhanced in collaboration with the business users to establish commitment.

    Profitability can iniitially be defined as a basic difference between Profits and Revenue.

    Performance can be measurement of Deliveries being On-Time against plans agreed in Service Level Agreements (SLAs).

    The initial Data Architecture will include this basic structure of KPI and Source Data.

    This will provi ework which can be

    We will use this as a starting-point for discussions to establish the requirements with

    Profitability

    Revenue minus Costs

    Revenue Data Source

    Costs Data Source

    Performance

    Deliveries against Plans

    Dates and Deliveries

    Contracts and SLAs enhanced in a controlled manner. the business users. de a simple but flexible fram27-05-2012 15:27

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 7 27-05-2012 15:27

    2.8 Build the Data Warehouse Incrementally

    A Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that provide a validation with each Step.

    2.8.1 Canonical Data Model

    This shows a Common Data Model that provides an excellent Starting Point for building an Enterprise Data Warehouse.

    2.8.2 Event Message

    We can define Events with corresponding Messages.

    For each Message, we can add to the design of our Data Warehouse.

    The strength of this approach is that it validates the design of the Data Warehouse.

    This Common Data Model defines a Message for each Event.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 8 27-05-2012 15:27

    2.8.3 Example Event Message (1)

    For example, a Customer purchases one Product at a Store :-

    Date Store ID Customer Name Product Quantity Amount

    The Business Rules would say :-

    1. Each Sale is made to one Customer

    2. Each Sale can involve just one Product

    3. Each Sale is made at a specific Store

    4. Each Sale is made at a specific valid Date and Time which is defined in a Calendar

    Therefore we can define this fragment of a normalised Data Model :-

    Stores

    Sales

    Products

    Calendar Customers

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 9

    2.8.4 Event Message (2)

    In this Event, the Customer buys two Products. Therefore our initial Data Model must be enhanced to have a separate Table for Products Sold.

    When we continue to include more Events, we enhance our Model until it is complete.

    Date Store ID Customer Name

    Product 1 Quantity 1 Amount 1 Product 2 Quantity 2 Amount 2 Total Amount

    This Event changes one of the Business Rules :-

    1. Each Sale can involve multiple Products

    Therefore we add a separate entity called Products Sold to our Data Model :-

    Product Customers Stores Calendar Sales Products Sold 27-05-2012 15:27

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 10 2

    2.9 Generic Dimensional Model

    The existence of Dimensions and Facts makes it possible to define a Generic Design for a range of Dimensional Models.

    This example shows six Dimensions and six Facts.

    Each of these Dimensions and Facts can be translated from the generic names to specifics, such as Customer, Product, Location, Time Period and so on.

    2.10 Simple Data Architecture

    A Data Architecture should be composed of separate Components with separate and clearly-definedThe 3NF Data Warehouse implements data integration and provides a Single View of the Truth (SVOThe objective is to show the major Building Blocks in a simple clean Architecture.

    A more detailed version is shown in the Data Warehouse Bus Architecture in Chapter 3.

    2.11 Sample Kick-Start Data Warehouses

    Some sample Data Warehouses are included in Appendix A for a starting-point in discussion

    ODS Staging Area

    3NF Data Warehouse)

    Dimensional Models (Data Marts) 7-05-2012 15:27

    functionality. T).

    s.

    Semantic Layer

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 11 27-05-2012 15:27

    3. What Have We Learned? We have learned some very important rules that will help us to get started in an organized manner and put in place a sound foundation that we can build on in a safe and stable manner.

    4. The Author Barry Williams is the founder and principal consultant with Database Answers.

    His company has been providing advice and assistance to blue-chip clients for over 20 years.

    His particular interest is in advancing the role of data models as a way of improving communication between the business user community and data management professionals.

    As part of this role he publishes best practice on his Database Answers Web site at

    http://www.databaseanswers.org/

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 12 27-05-2012 15:27

    Appendix A. Data Model Templates A.1 What is this ? These Templates provide introductory Data Models for some specific Applications.

    A.2 Why is it important ? These Templates provide Kick-Start samples to get up and running quickly.

    They also provide samples that can be used to assess existing Data Models for fit for purpose.

    A.3 Banking

    A.3.1 Data Warehouse

    This shows theTemplate EDW for the analysis of retail Banking.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 13 27-05-2012 15:27

    A.3.2 Dimensional Model

    This shows theTemplate Dimensional Model for retail Banking.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 14 27-05-2012 15:27

    A.3.3 Operational Data Stores (ODS)

    This shows the ODS for the analysis of retail Banking.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 15 27-05-2012 15:27

    A.4 Communications

    A.4.1 Data Warehouse

    This shows theTemplate EDW for Communications and, in particular, Telephone Billing .Systems.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 16 27-05-2012 15:27

    A.4.2 Dimensional Model

    This shows theTemplate Dimensional Model for Communications.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 17 27-05-2012 15:27

    A.4.3 Operational Data Stores (ODS)

    This shows the ODS for Communications.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 18 27-05-2012 15:27

    A.5 Retail Sales

    A.5.1 Data Warehouse

    This shows the EDW for the analysis of Retail Sales.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 19 27-05-2012 15:27

    A.5.2 Dimensional Model

    This shows the Dimensioanl Model for the Retail Sales.

    A.5.3 Operational Data Stores (ODS)

    This diagram shows a sample of extract data for Retail Sales.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 20 27-05-2012 15:27

    A.6 Transport

    A.6.1 Data Warehouse

    This shows the EDW for the analysis of Transport.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 21 27-05-2012 15:27

    A.6.2 Dimensional Model

    This shows the Dimensional Model for the Transport Subject Area.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 22 27-05-2012 15:27

    A.6.3 Operational Data Stores (ODS)

    This diagram shows a sample of extract data for Transport.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 23 27-05-2012 15:27

    A.7 Web Visit Statistics

    A.7.1 Data Warehouse

    This shows the EDW for the analysis of Transport.

    A.7.2 Dimensional Model

    This diagram shows a Dimensional Model for Web Visit Stats.

  • A Discussion Paper on Guidelines for Getting Started with Teradata

    Page 24 27-05-2012 15:27

    A.7.3 Operational Data Stores (ODS)

    This diagram shows a sample of extract data for Web Visit Stats.

    1. Management Summary2. Getting Started2.3 A Blueprint2.4 A Transport Enterprise Data Model2.5 A Data Architecture and BIReady2.6 Stages in loading Data into a Warehouse2.7 Business Intelligence (Top-down)2.8 Build the Data Warehouse Incrementally2.8.1 Canonical Data Model2.8.2 Event Message2.8.3 Example Event Message (1)2.8.4 Event Message (2)

    2.9 Generic Dimensional Model2.10 Simple Data Architecture2.11 Sample Kick-Start Data Warehouses

    3. What Have We Learned?4. The AuthorAppendix A. Data Model TemplatesA.3 BankingA.3.1 Data WarehouseA.3.2 Dimensional ModelA.3.3 Operational Data Stores (ODS)

    A.4 CommunicationsA.4.1 Data WarehouseA.4.2 Dimensional ModelA.4.3 Operational Data Stores (ODS)

    A.5 Retail SalesA.5.1 Data WarehouseA.5.2 Dimensional ModelA.5.3 Operational Data Stores (ODS)

    A.6 TransportA.6.1 Data WarehouseA.6.2 Dimensional ModelA.6.3 Operational Data Stores (ODS)

    A.7 Web Visit StatisticsA.7.1 Data WarehouseA.7.2 Dimensional ModelA.7.3 Operational Data Stores (ODS)