1 o.l.a.p. (online analytical processing) by creating doubt you may find certainties. certainties do...
TRANSCRIPT
1
O.L.A.P.
(Online Analytical Processing)
By creating doubt you may find certainties.Certainties do not create enterprise. Doubt and questions do.
Dedicated to Dr. Ing. Andrea Fraschetti, my uncle, a Ferrari man who personally circuit tested,
because he had doubts, a racing car he designed and died doing so
Release 13 August 2012
2
Thank you to the following reviewers of this presentation
Mr. Dean Tallam – Senior Manager of SciFinance, Inc"...SciFinance takes complex mathematical models and translates them into something a computer can solve, allowing banks to flexibly change pricing
models as they introduce new products." Newsweek International
Ing. Filippo Heilpern - Consultant in BD & International, Corporate ExecutiveDr. Ignazio Palau – Consultant in BD & International, Corporate Executive
My son Lorenzo
Some, among others, sources
Introduction to OLAP - Slice, Dice and Drill! - Hari MailvaganamBUSINESS INTELLIGENCE for DUMMIES – Swain Scheps (2008)
Data Warehousing Part 1 : OLAP and OLTP – Mike BruntOLAP Workshop : Basic overview of OLAP Concepts – Keith Laker
MS SQL Server 7.0 OLAP Services – Microsoft Inc.http://whatis.techtarget.com
3
Your multidimensional business query
Given what are my needs, where can I find in 3 areas/regions (France, Europe, South America) and from 2 countries (India and China) the offers that reflect the needs of the industry whom I can fulfill with my acquired educational skills ?
This personal question describes both the data that you need to examineand the way you need the data structured
Some of the questions contained in the above query :
What is my product ? (“…my needs…”) Where can I sell it ? (“…3 areas/regions…and from 2 countries..”) Who wants to buy it (“… the offers…”) How much ? (you are too green, forget about it for the moment….)
YOUR ANSWER TO THIS QUERY ?
FOOD FOR YOUR THOUGHTS
4
OLAP
is
working with data & information - in business terms - without needing to understand the underlying storage mechanism
as well as
having the ability of intelligently and transparently working with the different types of business rules that exist within any organisation and sustain/support them
5
It has also been defined as
Fast Analysis of Shared Multidimensional Information
FastDelivers information to the user at a fairly constant rate. Most queries should be delivered to the user in five seconds or less.
AnalysisPerforms basic numerical and statistical analysis of the data, predefined by an application developer or defined ad hoc by the user.
SharedImplements the security requirements necessary for sharing potentially confidential data across a large user population.
Multidimensional
Not bi-dimensional, not tri-dimensional, multidimensionalInformationAccesses all the data and information necessary and relevant for the application, wherever it may reside and not limited by volume
6
Keep well embedded in your mind the two terms
SHARED
MULTIDIMENTIONAL
7
Multidimentional Data Model
DIMENSIONS descriptive cathegories MEASURES
quantitative values
SHARED
MULTIDIMENTIONAL
8
O.L.A.P is an approach that may quickly provide answers to analytical queries that are multi-dimensional in nature.
Think at the queries you have about your future :
What do I need ?What do I want ?What does the market offer ?What is my offer to the market ?What are the skills that I can bring to the market ?How can I match these with the offer ?How do I find the sources of the offer ? When and how do we “tango”, the offer and me ?
The typical applications of OLAP are in business reporting for sales, marketing, management reporting, business process management (BPM), budgeting and forecasting, financial reporting, etc.
In your case finding a challenge which you will love !
9
It is used extensively by Intelligence Services and Intelligence Agencies (a prime example, the E.C.H.E.L.O.N evesdropping program from the N.S.A. in the US, that along with the F.B.I., just detected massive intrusions in Obama’s and McCain’s campaigns data bases)
10
Databases configured for OLAP employ a multidimensional data model, allowing for complex analytical and ad-hoc queries with a rapid execution time.
They borrow aspects of navigational databases and hierarchical databases that are speedier than their relational kin (proche).
The output of an OLAP query is typically displayed in a matrix (or pivot) format.
The dimensions form the rows and columns of the matrix; the measures, the values.
11
OLAP Data Model
In an OLAP data model, information is conceptually viewed as cubes,which consist of descriptive categories (dimensions) and quantitative values (measures).
The multidimensional data model makes it simple for users to formulate complex queries, arrange data on a report, switch from summary to detail data, and filter or slice data into meaningful subsets.
Cubes is an easy expression to describe a form.
In the real business world OLAP can be multi-dimentional & multifaceted with 5,6,7,…x… dimensions and measures
12
To simplify
Dimension is What TimeGeographyProductChannelOrganizationScenario (budget or
actual)
Measure is How Much € SalesUnit SalesInventoryHead countsIncomeExpensesProfits/Losses
13
Multidimentional Data Model
MEASURES
:
quantitative DIMENSIONS : values descriptive cathegories
14
OLAP environment is centred around use of the term “business intelligence” where the emphasis is on
“online” or active access
“dynamic”
“analytical” in terms of the reports that are generated.
15
online WHAT ?
dynamic WHAT ?
analytical WHAT ?
16
Online
a. Live access to data rather than static reporting.
b. Analytic queries are submitted against the database in real time, and the results are returned in real time.
Analytical processing
i. Easily navigate multidimensional data to perform unpredictable ad hoc queries and display the results in a variety of different layouts
ii. Transparently manage business rules across dimensions and cubes
iii.“Drill through” levels of detail to uncover significant aspects of data
iv. Rapidly and efficiently obtain the results of sophisticated data calculation and selection across multiple dimensions of data
17
A few definitions
A metadata repository is a database of data about data (metadata). The purpose of the metadata repository is to provide a consistent and reliable means of access to data. The repository itself may be stored in a physical location or may be a virtual database, in which metadata is drawn from separate sources. Metadata may include information about how to access specific data, or more detail about it, among a myriad of possibilities.
A data warehouse is an Enterprise reporting solution. It will typically hold all historical data for the company for all time.
A datamart is a smaller version of the data warehouse. It's going to hold a year or two's worth of information, and may not hold all the tables in the data warehouse.
While the data warehouse is for the enterprise, a datamart is typically for a department’s use.
Source http://whatis.techtarget.com
18
Online
Analytical processing
Output
19
One standard transactional report or query will ask the following question :
When was order number 84305 shipped?
This simple, down-to-earth, two-dimensional query reflects basicmechanics/data of doing business.
a. Date of shipmentb. Order Number
It involves simple data selection and little or no calculation processing.
It can be answered directly from the transactional system without any impact
other operations.
No organisation can survive without this basic level of information.
20
OLAP systems – on the other hand - allow an organization’s to answer a
much broader multi-dimentional range of business queries about the data they
are collecting in their transactional systems:
i. How do same quarter sales for our top 10 most profitable products across EMEA Region for this quarter compare with sales a year ago?
ii. What are the differences in the product-sales mix between Regions Scandinavia, North, Central and South Europe , in context to the global European sales mix?
iii.What are our forecast units, unit price per service, unit cost per product, sales, cost trends, and profit for the next 12 months?
iv. In what ways does the mix vary by salesperson, and what is the relative performance of our salespeople?
v. What are , year to date, the products making up to 40% of our gross profit for each Region over the period 2006 to 2008?
21
Two illustrations of OLAP scenarios/architecture that
can allow broad multi-dimentional business queries
22
Figure 1. Data Model for OLTP
23
24
Two dimentional transactional query vs. broader multi-dimentional queries
The main differences between a simple two dimentional transactional query
and broader multi-dimentional queries are :
i. the fact that the latter are much more analytical and quite complex,
ii. that the answer to one question often leads immediately to another question as the user follows a train of thought in addressing a business problem/issue or scouting for an opportunity.
25
OLAP is designed to make it easy for end users to ask broader multi-dimentional range of analytical queries and enhance its day-to-day use without requiring:
Assistance from the IT department
Programming skills
Technical knowledge about the organization of the database
The results of queries also need to be rapid so that the analyst’s train of thought is not interrupted and the value of the analysis is not diminished.
Time and reaction time is of essence in any business scenario. Information is old the minute it is generated.
If it is generated late it could be obsolete.
26
A typical multidimensional business query
For each region of France, what was the percentage change in revenue for our top 15% products, over a rolling three-month period this year starting March compared to the same period last year?
This rather simple business question describes both the data that the user wants to
examine and they way he wants the data structured (i.e.: structural form of that data).
Business users typically want to answer questions that include terms such aswhat, where, who, when and, above all, how much !
You find the following essential questions contained in the above query :
What products are selling best? (“…top 15%…”) Where are they selling? (“…each region France…”) When have they performed the best? (“…over a rolling period….starting
March…”) How much ? (“…percentage change in revenue…”)
27
Your multidimensional business query
Given what are my needs, where can I can find in 3 areas/regions (France, Europe, South America) and from 2 countries (India and China) the offers that reflect the needs of the industry whom I can fulfill with my acquired educational skills ?
This personal question describes both the data that you need to examineand the way you need the data structured
Some of the questions contained in the above query :
What is my product ? (“…my needs…”) Where can I sell it ? (“…3 areas/regions…and from 2 countries..”) Who wants to buy them (“… the offers…”) How much ? (you are too green, forget about it for the moment….)
YOUR ANSWER ????? FOOD FOR YOUR THOUGHTS
GOOD HUNT WOLF PACK !