u01 pc8 lid presentation 20061204

Post on 07-Apr-2015

131 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

TRANSCRIPT

1

PowerCenter 8Level I Developer

Education Services

Version PC8LID-20060428

Informatica Corporation, 2006. All rights reserved.

2

Course Objectives

After completing this course, you should be able to:

• Use PowerCenter developer tools to: • Create and debug mappings

• Create, run, monitor and troubleshoot workflows

• Design basic mappings and workflows

3

Unit 1. Data Integration Concepts

Unit 1

4

Unit Objectives

After completing this unit, you should be able to:

• Describe Informatica corp and its place in the data integration marketplace

• Define basic data integration terms and concepts

Unit 1

5

Informatica – The Data Integration Company

Informatica provides data integration tools for both batch and real-time applications:

Data Migration Data Synchronization

Data Warehousing Data Hubs

Business Activity Monitoring

Unit 1

6

• Founded in 1993

• Leader in enterprise solution products

• Headquarters in Redwood City, CA

• Public company since April 1999 (INFA)

• 2000+ customers, including over 80% of Fortune 100

• Strategic partnerships with IBM Global Services, HP, Accenture, SAP, and many others

• Technology partnership with Composite Software for Enterprise Information Integration (EII) – real-time federated views and reporting across multiple data sources

• Worldwide distribution

Informatica – Company Information

Unit 1

7

Informatica Affiliations

Unit 1

8

Informatica Resources

www.informatica.com – provides information (under Services) on:• Professional Services• Education Services

my.informatica.com – customers and contractual partners can sign up to access:• Technical Support• Product documentation (under Tools – online documentation)• Velocity Methodology (under Services)• Knowledgebase• Mapping templates

devnet.informatica.com – sign up for Informatica Developers Network• Discussion forums• Web seminars• Technical papers

Unit 1

9

Data Integration – Example

• Transaction data• Optimized for transaction

response time• Current• Normalized or

De-normalized data

Operational Systems

Mainframe RDBMS Other

• Aggregated data

• Historical data

Decision Support

Data

Warehouse

ETL Load

Transform

Extract

• Aggregate data• Cleanse data• Consolidate data• Apply business rules• De-normalize data

Unit 1

10

Assembly Line Analogy

Assembly Line Data Integration

Product design Target data model

High-level instructions Mapping

Detailed instructions Transformations

Operational assembly line Workflow

Operational steps Task

Documented procedures Metadata

Unit 1

11

Mapping

Logically Defines the Data Integration Process:

• Reads data from sources

• Applies transformation logic to data

• Writes transformed data to targets

Source TargetTransformations

Note: Sources and targets can be flat files, relational tables, XML files, application systems, message queues, etc

Unit 1

12

Transformations

• Generate, modify, or pass data

• Data passes into and out of transformations through ports that you link in a mapping

Unit 1

Input Ports Output Ports

13

PowerCenter Transformations (partial list)

Source Qualifier: reads data from flat file and relational sources

Expression: performs row-level calculations

Filter: uses a condition statement to select rows for inclusion

Sorter: sorts data

Aggregator: performs aggregate calculations

Joiner: joins heterogeneous sources

Lookup: looks up values and passes them to other objects

Update Strategy: tags rows for insert, update, delete, reject

Router: routes rows conditionally

Transaction Control: allows data-driven commits and rollbacks

Unit 1

14

Advanced PowerCenter Transformations

Union: Performs a union-all join between two data streams

Java: allows Java syntax to be used within PowerCenter

Midstream XML Parser: reads XML from anywhere in mapping

Midstream XML Generator: writes XML to anywhere

More Source Qualifiers: read from XML, message queues and applications

Unit 1

15

Task

• An executable set of actions, functions or commands

• Examples:Session task runs a mapping

Command task runs a shell script

Email task sends an email

Decision task branches workflow conditionally

Timer task waits for a specified period

Unit 1

16

Workflow

• A collection of ordered tasks

• Tasks can be linked sequentially, concurrently or both

• Links can depend on the successful completion of previous tasks

Unit 1

17

Metadata

• Defines data and processes

• Examples:• Source and target definitions

• Type (flat file, database table, XML file, etc)

• Datatype (character string, integer, decimal, etc)

• Other attributes (length, precision, etc.)

• Mapping logic

• Workflow logic

• Stored in a metadata repositoryRepository

Unit 1

18

Unit 1 Quiz

1. ETL

2. Mapping

3. Workflow

4. Metadata

5. Transformation

6. Task

a. An executable set of actions, functions or commands

b. Defines data and processes

c. Generates or manipulates data

d. Logically defines the ETL process

e. A collection of ordered tasks

f. Extract, transform and load data

Match the terms and explanations:

Unit 1

top related