session2: implementing sap hana dr. bjarne berg associate professor computer science lenoir-rhyne...

43
Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Upload: christal-foster

Post on 24-Dec-2015

224 views

Category:

Documents


4 download

TRANSCRIPT

Page 2: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Introduction

2

This second session is more technical as we explore what HANA is and how it works for real-time data warehousing and fast access to big data.

We will look inside HANA, examine its capabilities and interfaces, and see how to size, plan, and install HANA based on examples from several real companies.

You will also see several HANA demos on how to create tables, how to quickly load data, and how to execute hyper-fast reports using BusinessObjects Analysis, WebIntelligence, dashboards, Crystal Report, BI Self-Service through modules, and BI Workspaces on HANA.

Page 3: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

EDW Design Vs. Evolution

An organization has two fundamental choices:

1. Build a new well architected EDW

2. Evolve the old EDW or reporting system

Both solutions are feasible, but organizations that selects an evolutionary approach should be self-aware and monitor undesirable add-ons and ‘workarounds”.

Failure to break with the past can be detrimental to an EDW’s long-term success…

Page 4: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

What is Analytics?

• Discovery and communication of meaningful patterns in data. Applied to business data, to describe, predict, and improve business

performance.

• Analytic applications often favor data visualization to communicate insight.

• Intelligent data Behavioral data, sensor data, transactional data, market research, …

• Questions (e.g., marketing): Who are my core customers? What drives their behavior? Can I predict change? Can I segment? What has led up to this point?

Page 5: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Session Agenda

5

• Introduction• The Components of SAP HANA• Sizing, planning and installing HANA • New EDW Data Design In HANA• Demo: Performance dashboard from from real implementation• Demo: HANA building tables, data loads & deploying

BusinessObjects • Demo: Deploying Analyics with HANA and BOBJ• Questions and Answers

Page 6: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Persistence Layer

Looking Inside SAP HANA — In-Memory Computing Engine

Disk Storage

Data Volumes Page Mgmt.

BusinessObjects Data Services

Log

Volumes

Logger

Metadata ManagerAuthorization

Manager

Transaction

Manager

Relational Engine

-Row Store-Column Store

Load Controller

SQL Script

Calculation

Engine

Replication Server

SQL Parser

MDX

Session Manager

Inside the Computing Engine of SAP HANA, we have many different components that manage the access and storage of the data. This includes MDX and SQL access, as

well as Load Controller (LC) and the Replication Server.

Inside the Computing Engine of SAP HANA, we have many different components that manage the access and storage of the data. This includes MDX and SQL access, as

well as Load Controller (LC) and the Replication Server.

Page 7: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Session Agenda

7

• Introduction• The Components of SAP HANA• Sizing, planning and installing HANA • New EDW Data Design In HANA• Demo: Performance dashboard from from real implementation• Demo: HANA building tables, data loads & deploying

BusinessObjects • Demo: Deploying Analyics with HANA and BOBJ• Questions and Answers

Page 8: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

A T-Shirt Model for Sizing HANA on BW

A T-shirt model is a quick way to get some basic ideas on what a system may look like.

While very inaccurate for sizing, it provides basic information for those just staring considering SAP HANA

Data -Compression (from)

Working Memory

Processors SAS/SSD (for data)

Replication Speed (per hour)

Extra Large (XXL)

7000–-100,000 GB

3072 GB

to 20+ TB

(multi node)

12+ Intel E7 2.4 Ghz

10+ TB 20+ GB

Very Large (XL)

3500–-7000GB

2048 GB

(multi node)

8+ Intel E7 2.4 Ghz

5 – 10 TB 20+ GB

Large (L) 2000–-3500GB

1024GB 4 x Intel E7 2.4 Ghz

4 - 5 TB 5–20GB

Medium (M)

1250–2000GB

512GB 4 x Intel E7 2.0+ Ghz

2048GB 5–20 GB

Small (S) 500–1250GB

256GB 2 x Intel E7 2.0+ Ghz

1024GB 5GB

Extra Small (XS)

256–500GB

128GB 2 x Intel E7 2.0+ Ghz

1024GB 5GB

The number of processors are largely driven by the number of users and usage patterns. Serious consideration should be made before buying hardware.

Page 9: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

A Rule-Of-Thumb Approach to Sizing HANA - Memory

Memory can be estimated by taking the current system size, and running the programs in ”get_size.zip” in SAP Note 1637145 to get row and column store sizes for your system.

The 50 GB is for HANA services and caches. The 1.5 is the compression expected for rowstore tables and the 4 is the compression expected for column store tables. The 2-factor refers to the space needed for run-time objects and temporary result sets in HANA. Finally the term ‘existing DB compression’ is to account for any compression already done in your system (if any).

Memory = 50 GB + [ (rowstore tables footprint / 1.5) + (colstore tables footprint * 2 / 4) ] * Existing DB Compression

Remember, these are quick rules-of-thumb, so don’t rely on it for finalized budgeting and hardware purchases.

Page 10: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

A Rule-Of-Thumb Approach to Sizing HANA - Disk

The next item you need is disk space, which can be estimated by the following

In this example, you need 4 x 710 GB disk for the persistence layer and about 710 GB for the logs. This equals around 3.5TB (don’t worry, disk space of this size is now almost “cheap”).

The persistence layer is the disk that keeps the system secure and provides for redundancy if there are any memory failures, so it’s important not to underestimate this.

Disk for persistence layer = 4 MemoryDisk for the log = 1 Memory

Remember, these are quick rules-of-thumb, so don’t rely on it for finalized budgeting and hardware purchases.

Page 11: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

A Rule-Of-Thumb Approach to Sizing HANA - CPU

The CPUs are based on the number of cores that you include. For example 10 core CPUs now exist (depending on when you bought your system).

If you have a single node with 4 x 10 cores, you will have 40 cores and can handle 200 active users on that hardware node, and quite a larger number of named users.

CPU = 0.2 CPU cores per active user

Remember, these are quick rules-of-thumb, so don’t rely on it for finalized budgeting and hardware purchases.

Page 12: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

SAP QuickSizer tool for HANA

There are three versions of the tool for each versions of SAP HANA.

SAP’s Quick Sizer for SAP HANA, is available at http://service.sap.com/quicksizer.

The last is for those who want to use SAP HANA as a standalone platform for in-memory data (i.e., using SAP Data Services to load data to).

The second Quick Sizer version is for SAP HANA on SAP NetWeaver BW

The Quick Sizer for the Rapid Deployment Solutions (RDS) allows you to size for specific tools.

Page 13: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

An Alternative SAP BW on HANA Sizing Tool

SAP has released a new ABAP based tool that generates a report significantly better sizing fro SAP BW than using just the QuickSizer above. This program takes into consideration existing database compression, different table types and also include the effects of non-active data on the HANA system.

The higher precision you run the estimate at the longer the program is going to run.

To increase speed, you can also suppress analysis tables with less than 1 MB size.

With 14 parallel processors and 8Tb data warehouse, it is not unusual to see 45-75 minutes run time.

Page 14: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

SAP BW on HANA Sizing Tool

Since timeouts are common when running the sizing program, you can temporarily change the parameter in rdisp/max_wprun_time to 0 in BW transaction RZ11. Finally, you estimate the growth for the system as a percentage, or as absolute growth.

This program is attached to SAP Note: 1736976 on SAP Marketplace

After you have downloaded and installed the program, and-selected the parameters above, you can go to SE38 and run SDF/HANA_BW_SIZING as a background job.

The output is stored in the file you specified and the file can now be email emailed to hardware vendors for sizing input and hardware selection.

Page 15: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Summary of HANA Sizing Approaches

Approach Quality of Estimate Effort RequiredT-Shirt Sizing Sort of ‘OK” Very LowRule-of-Thumb Better accuracy LowSAP QuickSizer Much better HighSizing for BW program Excellent Moderate

• Work with your preferred vendor before ordering your hardware or finalizing your budgets.

Doing a correct HANA sizing effort is essential for not buying a too small, or too big HANA system.

Page 16: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Staffing a HANA Migration Project – Small Team

System ProfileRaw data size: 2.7 TBComplexity: MediumDataStores: 87InfoCubes: 63Queries: 409

Duration: 14 weeksEnvironments: 4+1Risk aversion: MediumOther usage: IP

This organization is using BWA and will be retiring it as part of the HANA migration

This assumes that the test team is dedicated for 3 weeks during the migration of QA and Prod environments

The test team from the business is already experienced users of the BW system and need minimal training

HANA Optimization of InfoCubes and DSOs are currently for SD only for this organization.

Real Example

Page 17: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Staffing a HANA Migration Project – Medium Team

System ProfileRaw data size: 5.6 TBComplexity: MediumDataStores: 439InfoCubes: 603Queries: 1,300+(incl. BOBJ)

Duration: 18 weeksEnvironments: 4Risk aversion: HIGHOther usage: None

This assumes testing of core queries in BEx and WebIntelligence is done by the business

The data reconciliation and process chain testing is done by dedicated resources in each team.

This team must be staffed with experienced resources. HANA training for team members and hardware installs must be in place prior to project start.

Real Example

Page 18: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Staffing a HANA Migration Project – Very Large Team

System ProfileRaw data size: 38TBComplexity: HighDataStores: 1,300+InfoCubes: 1,720+Queries: 2,200+

Duration: 5 mosEnvironments: 4Risk aversion: HIGHOther usage: APO, IP, BPC

This assumes minimal additional functional optimization

Real Example

Page 19: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Budgeting a HANA Migration Project - Systems

There are a set of items you need to budget for. From a system perspective you will need to consider:

• Hardware quotes Give at least two vendors your sizing estimate and ask for quotes.

• Vendor Support Make sure your hardware vendor include 3-years support in your purchase

• Upgrades Plan and budget for any DW upgrades required before going to HANA

Do DW data cleanup, archiving and move non-needed data to Near-Line Storgae, and then conduct a sizing effort, before requesting a hardware quote.

Page 20: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Hardware Options for HANA

There are currently 7 different certified HANA hardware vendors with 13 different products.

Some boxes can be used as single nodes with others are intended for scale-out solutions for large multi-node systems

Hardware Memory

128GB 256GB 512GB 1024GB

Cisco C260 X X

Cisco C460 X X

Cisco B440 X X+

Dell R910 X X X X

Hitachi CB 2000 X X X

NEC Express 5800 X X X+

Fujitsu RX 600 S5 X X X

Fujitsu RX 900 S2 X X+

HP DL 580 G7 X X X

HP DL 980 G7 X X

HP BL 680 X X X X+

IBM x3690 X5 X X X

IBM x3950 X5 X X X+

Page 21: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Small Example HW Quotes - Dell

This is example is a quote for a smaller 128 GB Memory Box with 2 x 10 cores is based on Dell’s R910 platform for a HANA sidecar usage for less then $40,000 (including tax!)

Most of the smaller HANA systems from the other vendors are similarly prices and depends on the number of boxes you buy, existing discount agreements and the size of the deals you are requesting.

Expect competitive bids for larger systems and similar vendor pricing for similar capabilities

Real Example

Page 22: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Mid-Size Budgeting Example HW Quotes - HP

This example quote is for a mid-sized 512 GB memory box with 4 x 10 cores CPUs and 7 TB disks based on Hewlett-Packard's high-end DL-980 Box.

Including all services and support agreements, this quote is only $150,000

Certified HANA vendors such as HP, IBM, Dell, Cisco, NEC, Hitachi and Fujitsu has dedicated staff to help you get a detailed quote in matter of days.

Real Example

Page 23: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Large Example HW Quotes - Fujitsu

This is example is a quote for a Large 1 TB Memory Box for only $105,000

Real Example

Page 24: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Budgeting a HANA Migration Project - People

Remember to budget for HANA training for your employees before the project starts

Class schedules are found at: training.sap.com

On average plan for $3,000 to $6,000 to train each team member on average plus travelling costs.

Page 25: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Budgeting a HANA Migration Project - People

• Experienced HANA consultants are in very high demand, so budget $1,600 to $2,300 per day for these resources (US)

• Testers with BW experience and some HANA training can be found for more normal consulting rates.

• Solid hands-on migration experience with the latest versions is key a HANA migrations project.

When staffing your HANA project, don’t schedule the start date before you get your staff. You want

the best resources, not whoever is available.

Page 26: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

On-Going Support Tasks and Staff Required

Major on-going support tasks consists of: User and role maintenance Security maintenance Backup and disaster recovery Load balancing, monitoring and hardware maintenance Software patches and notes for HANA, BW and Components Cleanup, NLS, Archiving, log deletions Transports, table copies, system copies and data copies Periodic system upgrades

While most tasks are similar to the old relational database systems, the way we do this is quite different. Make sure your HANA support staff is onboarded early and trained before cut-over to production of your migration project.

Page 27: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Session Agenda

27

• Introduction• The Components of SAP HANA• Sizing, planning and installing HANA • New EDW Data Design In HANA• Demo: Performance dashboard from from real implementation• Demo: HANA building tables, data loads & deploying

BusinessObjects • Demo: Deploying Analyics with HANA and BOBJ• Questions and Answers

Page 28: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

The Layered Scalable Architecture (LSA)

The LSA consists logically of: Acquisition layer Harmonization/quality layer Propagation layer Business transformation layer Reporting layer Virtualization layer

Page 29: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Data Design - Real Example of LSA Implementation

This company implemented a full LSA Architecture and also partitioned the Infoproviders for faster data loads and faster query performance.

While this architecture has benefits, there are significant issues around data volumes and the Total cost of Ownership when changes are made to the data model(s)

Real Example

Page 30: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Data Design Example of LSA Simplification In HANA

Since many of the benefits sought by the LSA architecture are inherent in HANA, significant simplifications can be made to the data design and data flows

This design has a dramatically smaller cost of ownership (fewer objects to maintain, design and change) than the traditional LSAs

Real Example

Page 31: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Session Agenda

31

• Introduction• The Components of SAP HANA• Sizing, planning and installing HANA • New EDW Data Design In HANA• Demo: Performance dashboard from from real implementation• Demo: HANA building tables, data loads & deploying

BusinessObjects • Demo: Deploying Analyics with HANA and BOBJ• Questions and Answers

Page 32: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

HANA Performance DemoReal Example

Page 33: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Session Agenda

33

• Introduction• The Components of SAP HANA• Sizing, planning and installing HANA • New EDW Data Design In HANA• Demo: Performance dashboard from from real implementation• Demo: HANA building tables, data loads & deploying

BusinessObjects • Demo: Deploying Analyics with HANA and BOBJ• Questions and Answers

Page 34: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

HANA Design, Build and Deploy Demo

Page 35: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Session Agenda

35

• Introduction• The Components of SAP HANA• Sizing, planning and installing HANA • New EDW Data Design In HANA• Demo: Performance dashboard from from real implementation• Demo: HANA building tables, data loads & deploying

BusinessObjects • Demo: Deploying Analyics with HANA and BOBJ• Questions and Answers

Page 36: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Demo: Deploying Analyics with HANA and BOBJ

Page 37: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Session Agenda

37

• Introduction• The Components of SAP HANA• Sizing, planning and installing HANA • New EDW Data Design In HANA• Demo: Performance dashboard from from real implementation• Demo: HANA building tables, data loads & deploying

BusinessObjects • Demo: Deploying Analyics with HANA and BOBJ• Questions and Answers

Page 38: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

Questions and Answers

Dr. [email protected]

Page 39: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

39

SAP has a checklist tool for SAP NetWeaver BW powered by HANA (thanks Marc Bernard).

In this tool, SAP provided automatic check programs for both the 3.5 version and the 7.x version of BW. These are found in SAP Note: 1729988.

In version 2.x of this tool, hundreds of checks are done automatically in the BW system. This includes platform checks on database and application and system information.

Moving EDW to SAP HANA – Automated tool 1

There are even basis checks for support packs, ABAP/JAVA stacks, Unicode, BW releases, and add-ons to your system.

Page 40: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

40

The idea of the checklist tool from SAP is that you run it several times throughout the project.

Once before you start, then periodically as you resolve issues and upgrade requirements, and then finally when the system has been migrated to HANA.

The checklist tool also has specific checks for the HANA system that can help you identify any issues before turning over the system to end users..

Moving EDW to SAP HANA – Automated tool 1

Page 41: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

41

Moving EDW to SAP HANA – Automated tool 2

SAP has released a new ABAP based tool that generates a report significantly better sizing fro SAP BW than using just the QuickSizer above. This program takes into consideration existing database compression, different table types and also include the effects of non-active data on the HANA system.

The higher precision you run the estimate at the longer the program is going to run.

To increase speed, you can also suppress analysis tables with less than 1 MB size.

With 14 parallel processors and 8Tb data warehouse, it is not unusual to see 45-75 minutes run time.

Page 42: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

42

Since timeouts are common when running the sizing program, you can temporarily change the parameter in rdisp/max_wprun_time to 0 in BW transaction RZ11. Finally, you estimate the growth for the system as a percentage, or as absolute growth.

This program is attached to SAP Note: 1736976 on SAP Marketplace

After you have downloaded and installed the program, and-selected the parameters above, you can go to SE38 and run SDF/HANA_BW_SIZING as a background job.

The output is stored in the file you specified and the file can now be email emailed to hardware vendors for sizing input and hardware selection.

Moving EDW to SAP HANA – Automated tool 2

Page 43: Session2: Implementing SAP HANA Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University

43

Many experienced developers are not aware that moving BW to HANA can in some cases result in slower transformations during data loads.

a. Select for all entries (SFAE) statements without HANA DB hints --> add hints

b. Select * --> specify fields to selectc. Database access in the field routines

--> move to start routined. Loops which do not assign field

symbols --> use field symbolse. Selects from master data tables -->

Use the read master data rulef. Selects from DSOs --> Use the read

DSO ruleg. Direct updates to BW object tables --

> Do not update tables directly

Moving EDW to SAP HANA – Automated tool 3