next-generation data integration series - informatica · 2012-10-03 · using data integration...

13
10/3/2012 1 1 Using Data Integration Testing for Reconciling Production Data Assets Best Practices for Data Integration / ETL Testing Series David Loshin, Industry Analyst Robert Myers, HealthNow NY Ash Parikh, Informatica Next-Generation Data Integration Series 30 Minutes with Industry Experts

Upload: others

Post on 22-May-2020

12 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

1

1

Using Data Integration Testing for Reconciling Production Data Assets

Best Practices for Data Integration / ETL Testing Series

David Loshin, Industry Analyst

Robert Myers, HealthNow NY

Ash Parikh, Informatica

Next-Generation Data Integration Series

30 Minutes with Industry Experts

Page 2: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

2

2

Best Practices for Data Integration / ETL Testing – Expert Insight Series

With Industry Expert David Loshin

Page 3: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

3

3

Page 4: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

4

4

How is the World Dealing with Production Data?

„ “We have logic in our mappings to ensure the data is correct”

„ “Our tool generates SQL to ensure no missing records”

„ “We write mappings to balance staging and production tables”

„ “Thorough testing is a critical part of our SDLC”

Page 5: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

5

5

Yet Failures Happen…

• “Our business users were telling us that

there was data missing from their reports”

• “Our auditors told us there was no way

to verify that the data was correct”

• “When we migrated systems we found

a lot discrepancies across all the data”

• “We were finding discrepancies between

our Master Data hub and source systems”

• “The aggregate values we get are often

outside the expected ranges we need…”

• “We get more and more data from outside

our organization – an audit trail is crucial…”

Page 6: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

6

6

1. Bad upstream data in the

environment

2. Failed jobs that were not

rerun properly

3. Logic errors in integration

system

4. Logic errors in related

processing systems

5. “Rogue” updates bypass

change control process

6. Data and environments are

dynamic

Why Do these Problems Occur?

Page 7: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

7

7

What is the Business Impact?

• Incorrect data flowing inside /

outside your company

• Poor or incorrect decisions

affecting the business

• Lost business opportunities

in a competitive market

• Money wasted on

inappropriate initiatives and

projects

• Customer dissatisfaction and

degrading experience

• Exposure to regulatory

non-compliance risks

Page 8: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

8

8

A way to identify ERRORS in data sets that have

been MOVED or TRANSFORMED to ensure

they are COMPLETE and ACCURATE and

meet EXPECTATIONS or REQUIREMENTS.

Page 9: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

9

9

What is the Ideal Solution?

AUTOMATION

REPEATABILITY

AUDITABILITY

Page 10: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

10

10

Production Reconciliation

Protect the integrity of data

that is loaded into

production systems.

Erroneous data due to failed

loads, faulty logic or operational

issues is caught in a proactive

automated manner and can be

addressed as needed

Development & Test

Provide automation for unit and

regression testing

of integration logic.

Ensure that data produced by DI

code meets requirements and

expectations

Informatica’s Data Validation Solution (DVO)

Ensures the integrity of data as it moves through the IT environment...

How Can Informatica Help?

Page 11: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

11

11

Benefits of Informatica’s Solution…

• High project success/low project risk

• More cost savings/faster time to market

– 50% source-to-target testing

– 80-90% regression / upgrade testing

• Test all data, not just a small sample

• Test in heterogeneous environments

• No need to write SQL

• Complete audit trail of all testing

• No additional server technology

Page 12: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

12

12

Benefits

REQUIREMENTS MANUAL

TESTING INFORMATICA

“We used Informatica to compare 14 tables & approx. 30 million rows in less than 5

hours. The largest of the tables was 94 columns. When I asked our QA people how long it

would take them to run the scripts and test this amount of data, they mentioned months…”

- Customer

Summary

Page 13: Next-Generation Data Integration Series - Informatica · 2012-10-03 · Using Data Integration Testing for ... Provide automation for unit and regression testing of integration logic

10/3/2012

13

13

Informatica.com > Products > Enterprise Data

Integration > PowerCenter > Options

[email protected]

PowerCenter

Productivity Pack*

PowerCenter Data Validation

Option (DVO)

Proactive Monitoring

For PC Operations

Proactive Monitoring

for PC Governance

NEW

*Single Package at an Attractive Price

NEXT UP IN THE SERIES:

ETL Verification: Do Your ETL Processes Do What You Think They Do?

PART 3

On-Demand

Product Desk Webinar:

Best Practices for DI/ETL

Testing Presentation & Demo

Click on link in browser