text analysis hana from sap

Upload: raj

Post on 07-Jul-2018

227 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/19/2019 Text Analysis HANA From SAP

    1/12

    Text Search and Text Analysiswith SAP HANAcreated by  jagadeesh NUNE on Oct 13, 2014 12:20 PM, last modied by jagadeesh NUNE on Oct 13, 2014 12:20 PM

    !ersion 1 "#eet

    Text Search and Text Analysis with SAP HANA:- 

    Text Analysis:- Text Analysis is the process of analyzing unstructured text, extracting relevant inforation and then transforingthat inforation into structured inforation that can !e "ueried and leveraged in different ways# 

    http://scn.sap.com/people/jagadeesh.nunehttp://scn.sap.com/people/jagadeesh.nunehttp://scn.sap.com/people/jagadeesh.nunehttps://twitter.com/sharehttp://scn.sap.com/people/jagadeesh.nunehttps://twitter.com/sharehttp://scn.sap.com/people/jagadeesh.nune

  • 8/19/2019 Text Analysis HANA From SAP

    2/12

    Hidden facts in Text:- $%& of enterprise inforation originates in unstructured data, a'ing this a huge source ofinforation#(nstructured data provides insights into custoers) perceptions of !rands, products, ar'etingcapaigns, and the li'e#Text analysis also ena!les re"uest extraction, a ethod used to extract wishes orre"uests for iproveent fro custoers# 

  • 8/19/2019 Text Analysis HANA From SAP

    3/12

    SEO (Search Engine optimization) Analytics: 

    Text *ata processing 

  • 8/19/2019 Text Analysis HANA From SAP

    4/12

     

    SAP HANA supports in-data!ase Text Analysis +SPS%#The ain goal of this feature is to extract eaningful inforation fro texts#.n other words, copanies can now process !igvolues of data sources and extract eaningful inforation without having to read every single sentence# 

  • 8/19/2019 Text Analysis HANA From SAP

    5/12

  • 8/19/2019 Text Analysis HANA From SAP

    6/12

     

    Text analysis with SAP HANA re"uires that the unstructured data is of a supported file typeand gets loaded into a HANA ta!le#Text !eing loaded into HANA ta!les is saved in individual rows# These rows are calleddocuents#

  • 8/19/2019 Text Analysis HANA From SAP

    7/12

    =onfiguration:- =onfiguration tells SAP HANA which type of analysis the user wants to do#They are saved in 234 forat and contain all the iportant text analysis options#(sers can access configurations through the HANA repository# There are five predefined configurations# 

    4oading the P*1 docuents to SAP HANA 

    The easiest and "uic'est way to load !inary docuents into a HANA ta!le is !y using aPython script# The user can use the sae script for ultiple docuents# The only paraetersthat have to !e adusted are:

    HANA server connection inforationPath of the !inary docuentSchea>ta!le nae

    Additional inforation on data provisioning of !inary files into HANA can !e retrieved atacadey#saphana#co

  • 8/19/2019 Text Analysis HANA From SAP

    8/12

  • 8/19/2019 Text Analysis HANA From SAP

    9/12

  • 8/19/2019 Text Analysis HANA From SAP

    10/12

    The TATFP< colun specifies the type of entity extracted# 1or instance, P

  • 8/19/2019 Text Analysis HANA From SAP

    11/12

    $$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$$%

    SAP HANA Text Analysiscreated by &'g(n E)E on )eb 23, 201* 4:12 PM, last modied by &'g(n E)E on )eb 23, 201* 4:1+ PM!ersion 3 "#eet

    As many are aware, twenty-first century corporations are facing a crisis. Many corporations have been

    accurately and comprehensively storing data for years. The data is in variety of forms like social media posts,

    email, blogs, news, feedback, tweets, business documents etc. 

    It is very important to extract meaningful information without having to read every single sentence. Now, what

    is meaningful information. The extraction process should identify the "who", "what", "where", "when" and"how much" (among other things) from these data.

    For example, use social media data to find out -

    What people are saying about my brand or products?

    How many people recommend my brand vs. advocate against it? 

     "et -nalysis is the sol.tion o/ all this roblemIn this article we will explain:

    What is Text Analysis?

    Why Text Analysis is so important for business?

    How does SAP HANA support text analysis? 

    Before understanding Text Analysis, you will have to first understand Structured Data and

    Unstructured Data. 

    Structured and Unstructured Data:

    Str'ct'red -ata:Data that resides in a fixed field within a record or file is calledstructured data. This includesdata contained in relational data#ases and spreadsheets .For example data stored in database tables are structured data. 

    tr.ct.red data has the adantage o/ being easily entered, stored, .eried and analy'ed 

    http://scn.sap.com/people/ozgunefehttp://scn.sap.com/people/ozgunefehttps://twitter.com/sharehttp://scn.sap.com/people/ozgunefehttp://scn.sap.com/people/ozgunefehttps://twitter.com/share

  • 8/19/2019 Text Analysis HANA From SAP

    12/12

     +nstr'ct'red -ata:The phrase "unstructured data" usually refers to in0ormation thatdoesn1t reside in a traditional rowcol'mn data#ase. 

    Unstr.ct.red data les o/ten incl.de tet and m.ltimedia content Eamles incl.de e5mail messages, #ord rocessing doc.ments, ideos, hotos, a.dio les, resentations,#ebages and many other 6inds o/ b.siness doc.ments 

    7igging thro.gh .nstr.ct.red data can be c.mbersome and costly Email is a goodeamle o/ .nstr.ct.red data 8t9s indeed by date, time, sender, reciient, and s.bject,b.t the body o/ an email remains .nstr.ct.red Other eamles o/ .nstr.ct.red dataincl.de boo6s, doc.ments, medical records, and social media osts 

    2hy 'nstr'ct'red data is so important 0or #'siness Experts estimate that80 to 90 percent of the data in any organiation is unstructured. !nd the amount ofunstructured data in enterprises is growing significantly often many times faster thanstructured databases are growing. 

     "he only roblem is etracting meaning/.l in/ormation /rom .nstr.ct.red data