u01 pc8 lid presentation 20061204
TRANSCRIPT
1
PowerCenter 8Level I Developer
Education Services
Version PC8LID-20060428
Informatica Corporation, 2006. All rights reserved.
2
Course Objectives
After completing this course, you should be able to:
• Use PowerCenter developer tools to: • Create and debug mappings
• Create, run, monitor and troubleshoot workflows
• Design basic mappings and workflows
3
Unit 1. Data Integration Concepts
Unit 1
4
Unit Objectives
After completing this unit, you should be able to:
• Describe Informatica corp and its place in the data integration marketplace
• Define basic data integration terms and concepts
Unit 1
5
Informatica – The Data Integration Company
Informatica provides data integration tools for both batch and real-time applications:
Data Migration Data Synchronization
Data Warehousing Data Hubs
Business Activity Monitoring
Unit 1
6
• Founded in 1993
• Leader in enterprise solution products
• Headquarters in Redwood City, CA
• Public company since April 1999 (INFA)
• 2000+ customers, including over 80% of Fortune 100
• Strategic partnerships with IBM Global Services, HP, Accenture, SAP, and many others
• Technology partnership with Composite Software for Enterprise Information Integration (EII) – real-time federated views and reporting across multiple data sources
• Worldwide distribution
Informatica – Company Information
Unit 1
7
Informatica Affiliations
Unit 1
8
Informatica Resources
www.informatica.com – provides information (under Services) on:• Professional Services• Education Services
my.informatica.com – customers and contractual partners can sign up to access:• Technical Support• Product documentation (under Tools – online documentation)• Velocity Methodology (under Services)• Knowledgebase• Mapping templates
devnet.informatica.com – sign up for Informatica Developers Network• Discussion forums• Web seminars• Technical papers
Unit 1
9
Data Integration – Example
• Transaction data• Optimized for transaction
response time• Current• Normalized or
De-normalized data
Operational Systems
Mainframe RDBMS Other
• Aggregated data
• Historical data
Decision Support
Data
Warehouse
ETL Load
Transform
Extract
• Aggregate data• Cleanse data• Consolidate data• Apply business rules• De-normalize data
Unit 1
10
Assembly Line Analogy
Assembly Line Data Integration
Product design Target data model
High-level instructions Mapping
Detailed instructions Transformations
Operational assembly line Workflow
Operational steps Task
Documented procedures Metadata
Unit 1
11
Mapping
Logically Defines the Data Integration Process:
• Reads data from sources
• Applies transformation logic to data
• Writes transformed data to targets
Source TargetTransformations
Note: Sources and targets can be flat files, relational tables, XML files, application systems, message queues, etc
Unit 1
12
Transformations
• Generate, modify, or pass data
• Data passes into and out of transformations through ports that you link in a mapping
Unit 1
Input Ports Output Ports
13
PowerCenter Transformations (partial list)
Source Qualifier: reads data from flat file and relational sources
Expression: performs row-level calculations
Filter: uses a condition statement to select rows for inclusion
Sorter: sorts data
Aggregator: performs aggregate calculations
Joiner: joins heterogeneous sources
Lookup: looks up values and passes them to other objects
Update Strategy: tags rows for insert, update, delete, reject
Router: routes rows conditionally
Transaction Control: allows data-driven commits and rollbacks
Unit 1
14
Advanced PowerCenter Transformations
Union: Performs a union-all join between two data streams
Java: allows Java syntax to be used within PowerCenter
Midstream XML Parser: reads XML from anywhere in mapping
Midstream XML Generator: writes XML to anywhere
More Source Qualifiers: read from XML, message queues and applications
Unit 1
15
Task
• An executable set of actions, functions or commands
• Examples:Session task runs a mapping
Command task runs a shell script
Email task sends an email
Decision task branches workflow conditionally
Timer task waits for a specified period
Unit 1
16
Workflow
• A collection of ordered tasks
• Tasks can be linked sequentially, concurrently or both
• Links can depend on the successful completion of previous tasks
Unit 1
17
Metadata
• Defines data and processes
• Examples:• Source and target definitions
• Type (flat file, database table, XML file, etc)
• Datatype (character string, integer, decimal, etc)
• Other attributes (length, precision, etc.)
• Mapping logic
• Workflow logic
• Stored in a metadata repositoryRepository
Unit 1
18
Unit 1 Quiz
1. ETL
2. Mapping
3. Workflow
4. Metadata
5. Transformation
6. Task
a. An executable set of actions, functions or commands
b. Defines data and processes
c. Generates or manipulates data
d. Logically defines the ETL process
e. A collection of ordered tasks
f. Extract, transform and load data
Match the terms and explanations:
Unit 1