u01 pc8 lid presentation 20061204

18
1 PowerCenter 8 Level I Developer Education Services Version PC8LID-20060428 Informatica Corporation, 2006. All rights reserved.

Upload: jiethro

Post on 07-Apr-2015

131 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: U01 PC8 LID Presentation 20061204

1

PowerCenter 8Level I Developer

Education Services

Version PC8LID-20060428

Informatica Corporation, 2006. All rights reserved.

Page 2: U01 PC8 LID Presentation 20061204

2

Course Objectives

After completing this course, you should be able to:

• Use PowerCenter developer tools to: • Create and debug mappings

• Create, run, monitor and troubleshoot workflows

• Design basic mappings and workflows

Page 3: U01 PC8 LID Presentation 20061204

3

Unit 1. Data Integration Concepts

Unit 1

Page 4: U01 PC8 LID Presentation 20061204

4

Unit Objectives

After completing this unit, you should be able to:

• Describe Informatica corp and its place in the data integration marketplace

• Define basic data integration terms and concepts

Unit 1

Page 5: U01 PC8 LID Presentation 20061204

5

Informatica – The Data Integration Company

Informatica provides data integration tools for both batch and real-time applications:

Data Migration Data Synchronization

Data Warehousing Data Hubs

Business Activity Monitoring

Unit 1

Page 6: U01 PC8 LID Presentation 20061204

6

• Founded in 1993

• Leader in enterprise solution products

• Headquarters in Redwood City, CA

• Public company since April 1999 (INFA)

• 2000+ customers, including over 80% of Fortune 100

• Strategic partnerships with IBM Global Services, HP, Accenture, SAP, and many others

• Technology partnership with Composite Software for Enterprise Information Integration (EII) – real-time federated views and reporting across multiple data sources

• Worldwide distribution

Informatica – Company Information

Unit 1

Page 7: U01 PC8 LID Presentation 20061204

7

Informatica Affiliations

Unit 1

Page 8: U01 PC8 LID Presentation 20061204

8

Informatica Resources

www.informatica.com – provides information (under Services) on:• Professional Services• Education Services

my.informatica.com – customers and contractual partners can sign up to access:• Technical Support• Product documentation (under Tools – online documentation)• Velocity Methodology (under Services)• Knowledgebase• Mapping templates

devnet.informatica.com – sign up for Informatica Developers Network• Discussion forums• Web seminars• Technical papers

Unit 1

Page 9: U01 PC8 LID Presentation 20061204

9

Data Integration – Example

• Transaction data• Optimized for transaction

response time• Current• Normalized or

De-normalized data

Operational Systems

Mainframe RDBMS Other

• Aggregated data

• Historical data

Decision Support

Data

Warehouse

ETL Load

Transform

Extract

• Aggregate data• Cleanse data• Consolidate data• Apply business rules• De-normalize data

Unit 1

Page 10: U01 PC8 LID Presentation 20061204

10

Assembly Line Analogy

Assembly Line Data Integration

Product design Target data model

High-level instructions Mapping

Detailed instructions Transformations

Operational assembly line Workflow

Operational steps Task

Documented procedures Metadata

Unit 1

Page 11: U01 PC8 LID Presentation 20061204

11

Mapping

Logically Defines the Data Integration Process:

• Reads data from sources

• Applies transformation logic to data

• Writes transformed data to targets

Source TargetTransformations

Note: Sources and targets can be flat files, relational tables, XML files, application systems, message queues, etc

Unit 1

Page 12: U01 PC8 LID Presentation 20061204

12

Transformations

• Generate, modify, or pass data

• Data passes into and out of transformations through ports that you link in a mapping

Unit 1

Input Ports Output Ports

Page 13: U01 PC8 LID Presentation 20061204

13

PowerCenter Transformations (partial list)

Source Qualifier: reads data from flat file and relational sources

Expression: performs row-level calculations

Filter: uses a condition statement to select rows for inclusion

Sorter: sorts data

Aggregator: performs aggregate calculations

Joiner: joins heterogeneous sources

Lookup: looks up values and passes them to other objects

Update Strategy: tags rows for insert, update, delete, reject

Router: routes rows conditionally

Transaction Control: allows data-driven commits and rollbacks

Unit 1

Page 14: U01 PC8 LID Presentation 20061204

14

Advanced PowerCenter Transformations

Union: Performs a union-all join between two data streams

Java: allows Java syntax to be used within PowerCenter

Midstream XML Parser: reads XML from anywhere in mapping

Midstream XML Generator: writes XML to anywhere

More Source Qualifiers: read from XML, message queues and applications

Unit 1

Page 15: U01 PC8 LID Presentation 20061204

15

Task

• An executable set of actions, functions or commands

• Examples:Session task runs a mapping

Command task runs a shell script

Email task sends an email

Decision task branches workflow conditionally

Timer task waits for a specified period

Unit 1

Page 16: U01 PC8 LID Presentation 20061204

16

Workflow

• A collection of ordered tasks

• Tasks can be linked sequentially, concurrently or both

• Links can depend on the successful completion of previous tasks

Unit 1

Page 17: U01 PC8 LID Presentation 20061204

17

Metadata

• Defines data and processes

• Examples:• Source and target definitions

• Type (flat file, database table, XML file, etc)

• Datatype (character string, integer, decimal, etc)

• Other attributes (length, precision, etc.)

• Mapping logic

• Workflow logic

• Stored in a metadata repositoryRepository

Unit 1

Page 18: U01 PC8 LID Presentation 20061204

18

Unit 1 Quiz

1. ETL

2. Mapping

3. Workflow

4. Metadata

5. Transformation

6. Task

a. An executable set of actions, functions or commands

b. Defines data and processes

c. Generates or manipulates data

d. Logically defines the ETL process

e. A collection of ordered tasks

f. Extract, transform and load data

Match the terms and explanations:

Unit 1