sentiment analysis with sap hana - business analytics · pdf filesentiment analysis with sap...

Post on 06-Mar-2018

261 Views

Category:

Documents

5 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Sentiment Analysis with SAP HANA

Hochschule Ludwigshafen am RheinProf. Dr. Klaus Freyburger

3/7/2013

2

Prof. Dr. Klaus Freyburger

1982 - 1989 studying mathematics and business administration at the University of Mannheim and University of Massachusetts at Amherst

1991 PhD in mathematics

1991 - 2002 working at SAP AG in WalldorfMain activities: Development of software for business planning

since 2002 professor of Business Information Technology at the University of Ludwigshafen

Teaching and research:

● Programming

● Business Intelligence with SAP, Microsoft and Open Source

personal

3

4.200 students

72 full professors

140 part time lecturers

Wide selection of courses with business administration and social work

Bachelor Business Information Technology

Master Business Information Technology (Information Management & Consulting)

Dualer Studies Bachelor International Business Administration and Information Technology (IBAIT)

Application and practice-oriented, innovative and internationally

Some figures…

4

Beginning 1994 R/2 as one of the first universities

First, own R/3 System

HCC Passau

UCC Magdeburg

Products used:

ERP

SCM

BI

SAP in classroom teaching…

5

Students as student workers at SAP

Thesiswork (Bachelor/Master)

At SAP or SAP customers

Many graduates at SAP and SAP customers

Former SAP employees as professors

… and beyond

6

Leading in the development of the SAP BI curriculum…

… and assist in the rollout

UA Community

The ProjectSentiment Analysis

Goal

Data harvesting from social media

Using SAP BOE Data Services text data processing capabilities for

making sense out of unstructured data

SAP HANA data persistence for data mining, rapid retrieval and

analysis

Topic: „US Presidential Elections 2012“

8

9

Overview

1.) Extract 2.) Sentiment Analysis

3.) Modeling 4.) Analysis

Facebook

TwitterIn-

Memory

BusinessObjects Dashboards

Advanced Analysis for Office

SAPBusinessObjectsData Services

SAPHANA

The Project1.) Data Extraction

11

Extraction

Extraction

1.500.000 posts per day average

> 18 posts per second

4122

1

4121

9

4121

7

4121

5

4121

3

4121

1

4120

9

4120

7

4120

5

4120

3

4120

1

4119

9

4119

7

4119

5

4119

30

1.000.000

2.000.000

3.000.000

4.000.000

5.000.000

6.000.000

7.000.000

12

Number of posts

3rd debate2nd debate

election day

The Project2.) Sentiment Analysis

Sentiment Analysis

Extraction

HANA HANA

SAP BusinessObjects Data Services

Sentiment analysis

14

time to HANA< 2 min

time to extraction< 1sec (Twitter)

Sentiment Analysis

15

SAP BusinessObjects Data Services Workflow

HANA HANA

Sentiment Analysis

16

Incoming posts

find opinions about a person / topic

Extraction of entities

Linguistic analysis

Homogenization

17

Obama is great. Barack is great.

Candidates● Barack Obama● Mr. President● Barack● Obama● …● Mitt● Mr. President● Barack● Obama● …● …● Mr. President● Barack● Obama● …

18

Homogenization

Obama is great.Barack is great.

Dictionary

Sentiment Analysis

19

Incoming posts

find opinions about a person / topic

Extraction of entities

Linguistic analysis

sentiment●strong positive ● weak positive● neutral● weak negative●strong negative

20

Linguistic analysis

negativepositive

Obama is great Obama is terrible

Obama is ok

Example:

I really like Obama. He seems like somebody that I can trust.

21

Linguistic analysis

like

Positive Sentiment

Barack Obama

Candidate

The Project3.) Modeling

23

Modeling

Facebook

TwitterIn-

Memory

BusinessObjects Dashboards

Advanced Analysis for Office

SAPBusinessObjectsData Services

SAPHANA

1.) Extract 2.) Sentiment Analysis

3.) Modeling 4.) Analysis

24

Modeling in SAP HANA Studio

Attribute Views

– give master data tables context

– subset of columns and rows from a data table

Analytic Views

– build data foundation based on transactional tables

– contain normal attributes and measures

– measures are aggregated key figures

Calculation Views

– provide composites of other views

– join or union two or more analytic views or tables

25

Modeling Tasks

HANA

Create different SAP HANA Views for different purposes– provide fast data access for frontend tools

– aggregate information

– filter ambiguous sentiments within post content

26

Modeling Example

500ms

~45.mio

The Project4.) Analysis

28

Interfaces frontend tools – SAP HANA

BusinessObjects Dashboards

Advanced Analysis for Office

universe

BICS

JDBC

ODBC

In-Memory

SAPHANA

Dashboards: Start View

BusinessObjects Dashboards 29

Dashboards: overview

BusinessObjects Dashboards 30

Dashboards: Time Line

BusinessObjects Dashboards 31

Dashboards: Live Posts

BusinessObjects Dashboards 32

Dashboards: TV Debate

BusinessObjects Dashboards 33

Dashboards: topics

BusinessObjects Dashboards 34

Dashboards: geographical distribution

BusinessObjects Dashboards 35

Dashboards: devices

BusinessObjects Dashboards 36

37

Project resumé

great experience in a interesting real life example

exceptional opportunity to work with the innovative In-Memory DB SAP HANA and SAP BusinessObjects tools

result and performance of developed set-up is above all expectations

top related