data warehousing presentation

Upload: praveenrao

Post on 07-Apr-2018

222 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/4/2019 Data Warehousing Presentation

    1/32

    Business Intelligence

    David Cheemakuti

  • 8/4/2019 Data Warehousing Presentation

    2/32

    Review of Previous Presentation

    Datawarehousing concept

    BW Architecture

  • 8/4/2019 Data Warehousing Presentation

    3/32

    DataWarehousing

    DataWarehouse is the main repository of anorganizations historical Data. It contains the

    raw material for management's decisionsupport system

    The critical factor leading to the use of adata warehouse is that a data analyst canperform complex queries and analysis, such

    as data mining, on the information withoutslowing down the operational systems.

    http://en.wikipedia.org/wiki/Decision_support_systemhttp://en.wikipedia.org/wiki/Decision_support_systemhttp://en.wikipedia.org/wiki/Data_mininghttp://en.wikipedia.org/wiki/Data_mininghttp://en.wikipedia.org/wiki/Decision_support_systemhttp://en.wikipedia.org/wiki/Decision_support_system
  • 8/4/2019 Data Warehousing Presentation

    4/32

    Sales AnalysisA BusinessScenario

    Suppose that you are a sales manager, who isresponsible for planning and implementing salesstrategy. Your tasks include the following:

    Monitoring and forecasting sales demands andpricing trends

    Managing sales objectives and coordinating thesales force and distributors

    Reviewing the sales activities of eachrepresentative, office, and region

  • 8/4/2019 Data Warehousing Presentation

    5/32

  • 8/4/2019 Data Warehousing Presentation

    6/32

  • 8/4/2019 Data Warehousing Presentation

    7/32

  • 8/4/2019 Data Warehousing Presentation

    8/32

    To extract, transfer, transform, and load (ETTL)data from diverse data sources into the datawarehouse

  • 8/4/2019 Data Warehousing Presentation

    9/32

    Online transaction processing, or OLTP, refersto a class of systems that facilitate and manage

    transaction-oriented applications, typically for dataentry and retrieval transaction processing

    Online Analytical Processing, or OLAP is anapproach to quickly provide answers to analytical

    queries that are multidimensional in nature.

  • 8/4/2019 Data Warehousing Presentation

    10/32

    Business Intelligence (BI) is the broadcategory of applications and technologies forgathering, storing, analyzing, and providingaccess to data to help users make better

    business decisions.

    BI applications include the activities ofdecision support systems, query and reporting,

    OLAP, forecasting and data mining.

  • 8/4/2019 Data Warehousing Presentation

    11/32

    SAP BW Architecture

  • 8/4/2019 Data Warehousing Presentation

    12/32

    Source Systems

    mySAP Business suite: BI is fully integrated into

    mySAP Business suite. It functions as thecentral data warehousing tool. Predefinedextraction structures and programs are delivered

    by SAP. These allow source data from themySAP business suite to be loaded directly intothe warehouse.

    SAP XI: Data can be sent from SAP and non-SAP sources to BI using SAP XI (Exchangeinformation). The data transfer here is based inSOAP (Simple Object Access Protocol).

  • 8/4/2019 Data Warehousing Presentation

    13/32

    Source Systems

    Non-SAP Systems: A big advantage of BI is theopen architecture. It is possible to use BI as aconsolidated data basis for reporting that coversthe entire organization, particularly in theheterogeneous system landscape.

    External Data Providers: Target-oriented datacan be provided to BI by the external dataproviders like ACNielsen and Dun&Bradstreetthat provides research data which can be loadedto BI for benchmarking and then measuresagainst own operative data.

  • 8/4/2019 Data Warehousing Presentation

    14/32

    DB Connect: It allows all relational databases to accessdirectly. It is used to set up connection between DBMS in

    the external database. By importing metadata andoriginal data, the necessary structures can be generatedin BI and data can be loaded easily and efficiently.

  • 8/4/2019 Data Warehousing Presentation

    15/32

    UD Connect: It is used to access all relational

    and multidimensional data sources. UD connectconverts and transfers multidimensional data asflat data. This technology runs on J2EE Engineand supports the J2EE connector architecture.

  • 8/4/2019 Data Warehousing Presentation

    16/32

    BI Service API: It connects the SAP sourcesystems to BI. The Service API is a technologypackage in SAP source systems of BI thatfacilitates a high level of integration for datatransfer from SAP source systems to BI.

  • 8/4/2019 Data Warehousing Presentation

    17/32

    File:SAP supports automatic import of files inCSV or ASCII format for flat files

  • 8/4/2019 Data Warehousing Presentation

    18/32

    Web Service:A Simple Object Access Protocolis used to read XML data and to load it in a BIServer.

  • 8/4/2019 Data Warehousing Presentation

    19/32

    Business Advanced Programming Interface(BAPI): This the used an interface between theLegacy applications like Oracle with the BI.

  • 8/4/2019 Data Warehousing Presentation

    20/32

    Persistent Staging Area: It stores data in theoriginal format while being imported from thesource system. PSA allows for quality checkbefore the data are loaded into theirdestinations, such as ODS Objects or InfoCubes

  • 8/4/2019 Data Warehousing Presentation

    21/32

    Data Warehouse: It holds the very detaileddata for longer periods.

  • 8/4/2019 Data Warehousing Presentation

    22/32

    Operational Data store: It contains detailed datafor low-level operational needs. Data is not keptfor long time.

  • 8/4/2019 Data Warehousing Presentation

    23/32

    Data Marts: It provides summarized oraggregated data with the database objects forlong-time reference. Most of the BI data comefrom this area.

  • 8/4/2019 Data Warehousing Presentation

    24/32

    Administrator Workbench maintains meta-data and

    all BW objects. It has two components: BW Scheduler for scheduling jobs to load data BW Monitor for monitoring the status of data

    loads

  • 8/4/2019 Data Warehousing Presentation

    25/32

    BW Business Explorer (BEx):

    BEx Analyzer is Microsoft Excel with a BW add-in. BEx Browser works much like an information center,

    allowing users to organize and access all kinds ofinformation

  • 8/4/2019 Data Warehousing Presentation

    26/32

    Staging Engine: It implements data mapping andtransformation. Triggered by BW Scheduler, it sendsrequests to a source system for data loading. The sourcesystem then selects and transfers data into BW.

  • 8/4/2019 Data Warehousing Presentation

    27/32

    ODS (Operational Data Store) Objects allow us to build a multilayer

    structure for operational data reporting. They are not based on thestar schema and are used primarily for detail reporting, rather thanfor dimensional analysis.

    Info Cubes are the fact tables and their associated dimension tablesin a star schema

  • 8/4/2019 Data Warehousing Presentation

    28/32

    Data Manager maintains data in ODS Objectsand InfoCubes and tells the OLAP Processorwhat data are available for reporting

  • 8/4/2019 Data Warehousing Presentation

    29/32

    Metadata Repository contains information about the datawarehouse. Meta-data comprise data about data.

    Metadata Repository contains two types of meta-data:business-related (definitions and descriptions used forreporting) and technical (structure and mapping rulesused for data extraction and transformation). We useMetadata Manager to maintain Metadata Repository

  • 8/4/2019 Data Warehousing Presentation

    30/32

    BDS (Business Document Services) stores documents. The

    documents can appear in various formats, such as Microsoft Word,

    Excel, PowerPoint, PDF, and HTML. BEx Analyzer saves queryresults, or MS Excel files, as workbooks in the BDS.

    User Roles are a concept used in SAP authorization management.BW organizes BDS documents according to User Roles. Only usersassigned to a particular User Role can access the documentsassociated with that User Role

  • 8/4/2019 Data Warehousing Presentation

    31/32

    OLAP Processor is the analytical processingengine. It retrieves data from the database, andit analyzes and presents those data according tousers' requests

  • 8/4/2019 Data Warehousing Presentation

    32/32

    Concepts covered:

    What is DataWarehousing (Decision Supportsystem)

    Concept of Datawarehousing(ETL Process)

    Introduction to Star Schema

    BW architecture