informatica - presentation at hortonworks booth - strata 2014

13
Hortonworks and Informatica Informatica + Hortonworks to take Hadoop into Enterprise Production

Upload: hortonworks

Post on 05-Dec-2014

250 views

Category:

Software


3 download

DESCRIPTION

Learn more about the Informatica Big Data Edition as we demonstrate how easy it is to ingest, parse, integrate, and cleanse data on Hadoop. To get started you can download a free Big Data Edition Trial Sandbox for Hortonworks at http://marketplace.informatica.com/bdehortonworks

TRANSCRIPT

Page 1: Informatica - Presentation at Hortonworks Booth - Strata 2014

Hortonworks and Informatica

Informatica + Hortonworks to take Hadoop into Enterprise Production

Page 2: Informatica - Presentation at Hortonworks Booth - Strata 2014

Informatica + Hortonworks to Unleash the Power of Big Data

Archive

Profile Parse Cleanse ETL Match

Stream

Load

Load

Events

Replicate

Topics

Machine Device, Cloud

Documents and Emails

Relational, Mainframe

Social Media, Web Logs

Data Warehouse

Mobile Apps

Analytics & Op

Dashboards

Alerts

Analytics Teams

Page 3: Informatica - Presentation at Hortonworks Booth - Strata 2014

EDI–X12

EDI-Fact

RosettaNet

HL7

HIPAA

XML

LegalXML

IFX

cXML

Salesforce CRM

Force.com

RightNow

NetSuite

Access All Types of Data 200+ High Performance Connectors, Pre-built Parsers for Specialized Data Formats

3

WebSphere MQ JMS MSMQ SAP NetWeaver XI

JD Edwards Lotus Notes Oracle E-Business PeopleSoft

Oracle DB2 UDB DB2/400 SQL Server Sybase

ADABAS Datacom DB2 IDMS IMS

Word, Excel PDF StarOffice WordPerfect Email (POP, IMPA) HTTP

Informix Teradata Netezza ODBC

VSAM C-ISAM Binary Flat Files Tape Formats…

Web Services TIBCO webMethods

SAP NetWeaver SAP NetWeaver BI SAS Siebel

Messaging, and

Web Services

Relational

and Flat

Files

Mainframe

and Midrange

Unstructured

Data and Files

Flat files ASCII reports HTML RPG ANSI LDAP

ebXML

HL7 v3.0

ACORD (AL3, XML)

AST

FIX

SWIFT

Cargo IMP

MVR

ADP Hewitt SAP By Design Oracle OnDemand

Packaged

Application

s

Industry Standards

XML

Standards

SaaS/BPO

Social

Media Facebook Twitter LinkedIn

Kapow Datasift

EMC/Greenplum Vertica

Teradata AsterData

MPP Appliances

Hive HBase MongoDB

Page 4: Informatica - Presentation at Hortonworks Booth - Strata 2014

No-code visual

development

environment Preview results at any

point in the data flow

Data Integration & Quality on Hortonworks Integrate, Cleanse, and Profile Data on Hadoop

Page 5: Informatica - Presentation at Hortonworks Booth - Strata 2014

Informatica, Part of the Modern Data Architecture

• Optimize ETL workloads

for the appropriate

infrastructure

• Lower cost of data

storage and processing

• Curate data for analysis

and operational systems

• Minimizes risks of big

data projects

• Easy migration for

Informatica customers

Page 6: Informatica - Presentation at Hortonworks Booth - Strata 2014

Why Informatica + Hortonworks

Features Benefits

Visual development environment Increase productivity 5x,

operational efficiency, reuse

200+ high-performance

connectors (legacy & new)

Move all types of customer data

into HDP faster

100+ pre-built transforms for

ETL & data quality

Provide broadest out-of-box

transformations on Hortonworks

100+ pre-built parsers for

complex data formats

Analyze and integrate all types of

data faster

Joint MDA reference

architecture

Complementary capabilities to

accelerate customer success

100K+ trained Informatica

developers

Use existing & readily available

skills for big data

Page 7: Informatica - Presentation at Hortonworks Booth - Strata 2014

Top Use Cases

Solution patterns

7

Page 8: Informatica - Presentation at Hortonworks Booth - Strata 2014

The Big Data Journey

The Big Data Journey

Optimize infrastructure for

performance, cost, &

scalability

A single place to

manage the supply and

demand of data

Real-time proactive 360

customer engagement

Data Warehouse

Optimization

Real-Time

Operational

Intelligence

Managed Data

Lake

IT driven Business driven

Common Project Use-Case Patterns

Page 9: Informatica - Presentation at Hortonworks Booth - Strata 2014

Data Warehouse Optimization - Data Flow

Reports

Data Warehouse

1. Identify inactive &

infrequently used data

4. Store & prepare (e.g.

ETL) data on Hadoop

5. Move high value

results data into DW

2. Offload data &

processing to Hadoop

3. Ingest raw data,

replicate changes &

schemas

Machine Device,

Cloud

Documents and

Emails

Relational, Mainframe

Social Media, Web

Logs

Optimize performance /

cost to store & process

data at scale

Page 10: Informatica - Presentation at Hortonworks Booth - Strata 2014

Data Lake Architecture

Visual Development Environment

Enterprise

Repositories

EDW

MDM

DATA REFINEMENT

Profile Profile

Parse

ETL

Cleanse

Match

LOAD

SOURCE

DATA

Batch

Replicate

Stream

Archive

JMS Queue’s

Servers &

Mainframe

Files

Databases

Sensor data

Social

Apache YARN

Apache MapReduce

1 ° ° °

° ° ° °

° ° ° °

°

°

N

HDFS (Hadoop Distributed File System)

Apache Tez

Apache Hive

SQL

DELIVER

Batch

Services

Events

Topics

Page 11: Informatica - Presentation at Hortonworks Booth - Strata 2014

Real-Time Operational Intelligence

Big Data Integration / Analytics

Streaming

Master

Data

Mgmt Financial Advisors

Integration

& Quality

Web Logs

Clickstream Data

Customer / Product

Master

Customer

Customer

Smartphone

Real-Time

Event

Processing

Visualization

Social Data / Signals

Social Data

Connector

FIX, SWIFT,

Market Data

Customer Portal

Customer

Transactions

Mainframe

Connector

Proactive Customer Engagement

Page 12: Informatica - Presentation at Hortonworks Booth - Strata 2014

1

2

Next Steps TRIAL DOWNLOADS

marketplace.informatica.com/bdehortonworks

community.informatica.com/solutions/vibe_data_

stream_for_machine_data

More about Informatica & Hortonworks http://hortonworks.com/partner/informatica/

Page 13: Informatica - Presentation at Hortonworks Booth - Strata 2014

Demo