informatica pdf a

39
Data Integration Tools ETL : Extraction, Transformation and Load Software ELT : Extraction, Load and Transformation Software Do it once at night vs. do it every time for a query The “360 degree view of the enterprise” is a commonly discussed goal that really means data integration.

Upload: munish-prashar

Post on 30-Oct-2014

115 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Informatica PDF A

Data Integration Tools

ETL : Extraction, Transformation and Load Software

ELT : Extraction, Load and Transformation Software

Do it once at night vs. do it every time for a query

The “360 degree view of the enterprise” is a commonly discussed goal that really means data

integration.

Page 2: Informatica PDF A

Software Available.

Oracle Warehouse Builder

Informatica - Powercenter

Oracle Data Integrator (Sunopsis)

SQL Server Integration Services

Endeca - Information Discovery Integrator (CloverETL)

Page 3: Informatica PDF A

Informatica - Powercenter

PowerCenter provides an environment that allows you to load data into a centralized location, such as a data warehouse or operational data store (ODS). You can extract data from multiple sources, transform the data according to business logic you build in the client application, and load the transformed data into file and relational targets.

PowerCenter also provides the ability to view and analyze business information and browse and analyze metadata from disparate metadata repositories.

Page 4: Informatica PDF A

PowerCenter - Getting Started

If you want to learn PowerCenter Informatica, the best way to follow is:

First to install it

and then to follow the Getting Started Modules.

Page 5: Informatica PDF A

PowerCenter - Architecture/Components

PowerCenter provides an environment that allows you to load data into a centralized location, such as a data warehouse or operational data store (ODS). You can extract data from multiple sources,

transform the data according to business logic you build in the client application, and load the transformed data into file and relational targets.

PowerCenter also provides the ability to view and analyze business information and browse and analyze metadata from disparate metadata repositories.

Components

Page 6: Informatica PDF A

PowerCenter - Domain. The Power Center domain is the primary unit for management and administration within PowerCenter. The PowerCenter - Service Manager runs on a PowerCenter - Domain. The Service Manager supports the domain and the application services. Application services represent server-based functionality and include the Repository Service, Integration Service, Web Services Hub, and SAP BW Service.

PowerCenter - Repository. The PowerCenter repository resides in a relational database. The repository database tables contain the instructions required to extract, transform, and load data.

PowerCenter - Administration Console. The Administration Console is a web-based administration tool you can use to administer the PowerCenter domain.

PowerCenter - Client. The PowerCenter Client is used to manage users, define sources and targets, build mapping and mapplets with the transformation logic, and create workflows to run the mapping logic. The PowerCenter Client connects to the repository through the Repository Service to modify repository metadata. It connects to the integration service to start aworkflow.

PowerCenter - Repository Service. The Repository Service accepts requests from the PowerCenter Client to create and modify repository metadata and accepts requests from the Integration Service for metadata when a workflow runs.

PowerCenter - Integration Service. The Integration Service extracts data from sources and loads data to targets.

Powercenter - Web Services Hub.

Page 7: Informatica PDF A

Web Services Hub is a gateway that exposes PowerCenter

functionality to external clients through web services.

Powercenter - Sap BW service. The SAP BW Service extracts data from and loads data to SAP BW.

PowerCenter - Data Analyzer. Data Analyzer provides a framework to perform business analytics on corporate data. With Data Analyzer, you can extract, filter, format, and analyze corporate information from data stored

in a data warehouse, operational data store, or other data storage models.

PowerCenter - Metadata Manager. Metadata Manager is a metadata management tool that you can use to browse and analyze metadata from disparate metadata repositories. Metadata Manager helps you understand and manage how information and processes are derived, the fundamental relationships between them, and how they are used.

PowerCenter - Repository Reports. PowerCenter Repository Reports are a set of prepackaged Data Analyzer reports and dashboards to help you analyze and manage PowerCenter metadata.

Page 8: Informatica PDF A

PowerCenter - Installation and Configuration (9.0.1 for OEL Linux

SOFTWARE USED.

Oracle Entreprise Linux 64 bit version 5 Update 5

Oracle Database 11gR2

Power Center – Informatica 9.1

After you complete the installation and configuration process, you have to log in to the Administrator tool to create and configure the following services:

PowerCenter Repository Service

PowerCenter Integration Service

STEP 1-

Oracle Database parameter

Set the open_cursors parameter to 1000

SQL> SHOW parameter open_cursor;

NAME TYPE VALUE

------------------------------------ ----------- ------------------------------

open_cursors integer 300

SQL> ALTER system SET open_cursors=1000;

System altered.

SQL> ALTER system SET open_cursors=1000 SCOPE=BOTH;

System altered.

SQL> SHOW parameter open_cursors;

NAME TYPE VALUE

------------------------------------ ----------- ------------------------------

open_cursors integer 1000

Page 9: Informatica PDF A

Repository Account

At least two database user accounts are required to host this repository :

POW_REP for the repository Database

POW_DOM fro the domain Configuration Database with the CONNECT, RESOURCE, and CREATE VIEW privileges.

For Oracle, you must perform the following instruction :

CREATE USER POW_REP IDENTIFIED BY POW_REP

DEFAULT tablespace users

TEMPORARY tablespace temp

quota unlimited ON users;

GRANT CONNECT, resource, CREATE VIEW TO POW_REP;

CREATE USER POW_DOM IDENTIFIED BY POW_DOM

DEFAULT tablespace users

TEMPORARY tablespace temp

quota unlimited ON users;

GRANT CONNECT, resource, CREATE VIEW, SELECT ANY dictionary TO POW_DOM;

The grant “select dictionary privilege” to the user POW_DOM is only needed to verify the parameter Open_Cursor during the prerequisite check.

Repository Tablespace

Set the storage size for the tablespace to a small number to prevent the repository from using an excessive amount of space.

Also verify that the default tablespace for the user that owns the repository tables is set to a small size. The following example shows how to set the recommended storage parameter for a tablespace named POWERCENTER_DAC.

ALTER TABLESPACE "POWERCENTER_DAC" DEFAULT STORAGE ( INITIAL 10K NEXT 10K MAXEXTENTS UNLIMITED

PCTINCREASE 50 );

Operating System Login as root

Page 10: Informatica PDF A

Processor

Verify that the process match the software (x86_64)

[root@di-obia-01 ~]# uname -p

x86_64

[root@di-obia-01 ~]# uname -orv

2.6.18-194.el5 #1 SMP Mon Mar 29 22:10:29 EDT 2010 GNU/Linux

Temporary Disk Space Requirements

The installer writes temporary files to the hard disk. When the installation completes, the installer deletes the temporary files and releases the disk space.

Installer UNIX 1 GB

Informatica Services UNIX 2 GB

Verification of the temporary space with the df commando: Linux - disk free command (df)

[root@di-obia-01 ~]# df -h /tmp

Filesystem Size Used Avail Use% Mounted on

/dev/mapper/VolGroup00-LogVol00

77G 16G 58G 22% /

User Account

On UNIX, create a user account specifically to run the Informatica daemon.

[root@di-obia-01 ~]# /usr/sbin/useradd powercenter

[root@di-obia-01 ~]# id powercenter

Page 11: Informatica PDF A

uid=502(powercenter) gid=506(powercenter) groups=506(powercenter)

[root@di-obia-01 ~]# passwd powercenter

Changing password for user powercenter.

New UNIX password:

BAD PASSWORD: it is WAY too short

Retype new UNIX password:

passwd: all authentication tokens updated successfully.

Port

Verify that the port numbers are available on the machines where you install Informatica Services.

[root@di-obia-01 ~]# netstat -tulpn|grep 6005

The installer validates the port numbers that you specify to ensure that there will be no port conflicts in the domain.

Environment Variables

Use LANG and LC_ALL to set the UNIX code page for the installer.

Page 12: Informatica PDF A

File Descriptor

Informatica service processes can use a large number of files. Set the Linux - File Descriptor limit per process to 3000 or higher. In the file /etc/security/limits.conf, add this line:

# PowerCenter

powercenter hard nofile 3001

powercenter soft nofile 2500

And verify with the ulimit function logged as the powercenter system user account:

[powercenter@di-obia-01 ~]$ ulimit -Hn

3001

The limit is set to 3001 to prevent an error with the value 300 during the prerequisites check

Installation directory and Permission

As root, create the installation base directory:

mkdir -p /powercenter/9.0.1

chown -R powercenter:powercenter /powercenter

chown -R powercenter:powercenter /powercenter/9.0.1

chmod -R 775 /powercenter

chmod -R 775 /powercenter/9.0.1

As the installer (powercenter), copy the file to a tmp directory, unzip and untar it:

[powercenter@di-obia-01 powercenter901]$ unzip V26109-01.zip

Archive: V26109-01.zip

inflating: 901HF2_Server_Installer_linux-x64.tar

[powercenter@di-obia-01 powercenter901]$ tar -xf 901HF2_Server_Installer_linux-x64.tar

Page 13: Informatica PDF A

Change the permission of the file install.sh to execute

chmod -R 744 /tmp/powercenter901/install.sh

Server

Pre-Installation System Check To verify whether a machine meets the system requirements for an Informatica installation, run the Pre-Installation System Check Tool (i9Pi) before you start the installation process.

You can find the i9Pi tool in the following directory: <InstallerExtractDirectory>/i9Pi

[powercenter@di-obia-01 i9Pi]$ ./i9Pi.bin

Preparing to install...

Extracting the JRE from the installer archive...

Unpacking the JRE...

Extracting the installation resources from the installer archive...

Configuring the installer for this system's environment...

Launching installer...

Page 14: Informatica PDF A

Enter the port 6004 to prevent a problem during the ckeck on the port 2010. The tool normally must verify until the port 6009 but check also the port 6010 which is the standard port for X Forwarding

A pdf is available in the tool directory if you want more information. The report files are stored in the directory where you extract the Pre-Installation System Check tool

files:

HTML format: <i9Pi Directory>/i9Pi/i9Pi_summary.html

Text format: <i9Pi Directory>/i9Pi_summary.log i9Pi_summary.log

Output - Informatica9 Installation Location:/

Informatica starting port number:6004

Database Type:ORACLE

Database Hostname:di-rep-db

Database Port number:1521

Database Service name:direp.hotitem.local

Database User ID:POW_DOM

PASSED: OS Version Check: RHL Version Identified is 2.6.18-194.el5

PASSED: Diskspace: Available DiskSpace is 55105 MB

Page 15: Informatica PDF A

PASSED: Processors: Available Processors is 2

PASSED: Physical Memory: Available RAM Size is 4242 MB

PASSED: Temporary Space: Available temporary diskspace: 55105 MB. Sufficient for installation

PASSED: Socket Port: Socket port Range 6004-6009

PASSED: LANG Environment Variable: LANG Set to en_US.UTF-8 and LC_ALL Set to null

PASSED: JRE_HOME Environment Variable: JRE_HOME is not set

PASSED: File Descriptors : File descriptors limit is 3001

PASSED: Oracle Open Cursors: Open cursors in Oracle database is set to 1000Sufficient for install

PASSED : Resource privilege : The user account has the RESOURCE privilege.

PASSED: Create Table check : Successfully created a database table.

PASSED: Create View : Successfully created a database view.

Initial Server Software Installation This sequence of installation don't say that you have to hit the next button and does not take care of review and informative windows.

[powercenter@di-obia-01 /]$ unset JRE_HOME

[powercenter@di-obia-01 /]$ /tmp/powercenter901/install.sh

OS detected is Linux

unjar task is in progress.............

You can wait a long time on this step ….

unjar of ESD completed.....

Do you want to continue installation (y/n) ?

y

Starting installation ...

\***************************************************************************

\* Welcome to the Informatica 9.0.1 HotFix 2 Server Installer. *

\***************************************************************************

Page 16: Informatica PDF A

To verify whether a machine meets the system requirements for an Informatica installation, run

the

Pre-Installation System Check Tool (i9Pi) before you start the installation process. You can

find the i9Pi

tool in the following directory: <InstallerExtractDirectory>/i9Pi

Before you continue, read the 9.0.1 HotFix 2 Installation Guide and Release Notes.

You can find the 9.0.1 HotFix 2 documentation in the Documentation Center at

http://my.informatica.com

Configure the LANG and LC_ALL variables to generate appropriate code pages and

create and connect to repositories and Repository Services.

Do you want to continue? (Y/N)Y

Installer requires operating system Linux version 2.6 and later.

Current operating system Linux version 2.6.

Current operating system meets minimum requirements.

Select a choice :

1. Install Informatica 9.0.1 with Hot Fix 2 or Upgrade from a previous version of Informatica

to

Informatica 9.0.1 with Hot Fix 2

2. Apply Informatica 9.0.1 Hot Fix 2 to existing 9.0.1 install.

Enter the choice(1 or 2):1

-----------------------------------------------------------

Checking for existing 9.0.1 HotFix 2 product installation.

Select (G)UI mode (needs X Window support) or (C)onsole mode (G/C):G

Launching installer in GUI mode ...

Preparing to install...

Extracting the JRE from the installer archive...

Unpacking the JRE...

Extracting the installation resources from the installer archive...

Configuring the installer for this system's environment...

Launching installer...

Preparing SILENT Mode Installation...

===============================================================================

Informatica 9.0.1 Services HotFix2 (created with InstallAnywhere)

-------------------------------------------------------------------------------

Informatica 9.0.1 HotFix 2 Steps:

Installation Type: Install (and not upgrade)

Pre-requisites: Next

License and Directory. (Transfer the license file from the client zip file)

Page 17: Informatica PDF A

Pre-Installation Summary: Install (Required Disk Space 2815 MB)

Domain Creation: Create a domain

Configuration Database

Page 18: Informatica PDF A

with as JDBC parameters:

MaxPooledStatements=20;CatalogOptions=0;BatchPerformanceWorkaround=true

Domain Configuration

Port Configuration

Page 19: Informatica PDF A

Post-Installation Summary

Installation Status:: SUCCESS

The Informatica 9.0.1 HotFix 2 installation is complete.

For more information, see the debug log file:

/powercenter/Informatica/9.0.1/Informatica_9.0.1_Services_HotFix2.log

Informatica Administrator Home Page http://di-obia-01.hotitem.local:6007

Product Name : Informatica 9.0.1 HotFix 2

Installation Type : Fresh Installation

Installation Directory : /powercenter/Informatica/9.0.1

Login to the Administration console with the Administrator user (Administration/Administrator)

Page 20: Informatica PDF A

Repository Configuration

Start Informatica services. Start > All Programs > Informatica PowerCenter 8.6.1 > Services >

Start Informatica Services.

Launch the Administration console. Start > All Programs > Informatica PowerCenter 8.6.1 >

Services > PowerCenter Administration Console

Login with the account given in the configuration (admin/admin for this article).

Select configuration assistant (if you don't see it, you can also configure a repository with the

configuration tool on the left side)

Page 21: Informatica PDF A

Welcome > Next

Enter a name for the repository, click on the configure button and fill in the fields. You must enter

a connect identifier and not a connect string. The connect string for Oracle contains the

credentials.

Page 22: Informatica PDF A

If the creation of the repository hang, try to give the DBA privileges to the oracle database account

POW_REP

Integration service: fill a name in : INT_OBIA7961 and click next.

Character Data for Integration service: (Note: Character Encoding give the choice

between ASCII and Unicode)

Finish ! A global administrator (for the domain and the repository) is automatically created during the installation with the credentials Administrator/Administrator. You may change the password for security purpose

Page 23: Informatica PDF A

Client Installation Steps:

Double click on the file install.bat located in this directory : File1\pc861_client_win32_x86\

Choose Language. Next

Choose Install Option. Next

Prerequisites. Next

Installation directory For the path, you can leave the default value because it doesn't contain

space C:\Informatica\PowerCenter8.6.1

Pre-installation summary. Next

Post-installation summary. Next

Installation Complete. Done Hot Fix 10 installation:

file2>pc861_HotFix10_Client\install.bat

Select Language. Ok

Welcome. Next. Stop PowerCenter. Ok

Installation Prerequisites. Next

Installation Type: Apply. Next

Installation Directory. Next

Pre-installation summary. Next

Post-installation summary. Next

Installation Complete. Done It will install :

PowerCenter - Client

PowerCenter - Designer

PowerCenter - Repository Manager

PowerCenter - Workflow Manager

PowerCenter - Workflow Monitor

PowerCenter Mapping Architect for Visio (Data Stencil?)

Custom Metadata Configurator

Page 24: Informatica PDF A

Components

1. DOMAINS The Power Center domain is the primary logical unit for management and administration within PowerCenter. The service manager runs on a PowerCenter domain. The Service Manager supports :

the domain

and the application services.

PowerCenter has a service-oriented architecture that provides the ability to scale services and share resources across multiple machines. PowerCenter provides the PowerCenter domain to support the administration of the PowerCenter services.

where:

gateway host and gateway port are the basis for the administration console url A domain can contain multiple repositories:

Page 25: Informatica PDF A

What is a domain ? A domain is the primary unit for management and administration of services in PowerCenter.

A domain contains the following components:

One or more nodes. A node is the logical representation of a machine in a domain. A domain may

contain more than one node. The node that hosts the domain is the master gateway for the

domain. You can add other machines as nodes in the domain and configure the nodes to run

application services, such as the Integration Service or Repository Service. All service requests

from other nodes in the domain go through the master gateway. A nodes runs service processes, which is the runtime representation of an application service running

on a node.

Service Manager. The Service Manager is built in to the domain to support the domain and

the application services. The Service Manager runs on each node in the domain. The Service

Manager starts and runs the application services on a machine.

Application services. A group of services that represent PowerCenter server-based functionality.

The application services that run on each node in the domain depend on the way you configure the

node and the application service. You can use the PowerCenter - Administration Console to manage the domain. If you have the high availability option, you can scale services and eliminate single points of failure for services. The Service Manager and application services can continue running despite temporary network or hardware failures. High availability includes resilience, failover, and recovery for services

and tasks in a domain.

Page 26: Informatica PDF A

This Figure shows a sample domain with three nodes:

This domain has a master gateway on Node 1.

Node 2 runs an Integration Service

and Node 3 runs the Repository Service.

INFA_POWERCENTER_DOMAIN environment variable Before using the command line programs, you can configure the environment

variable INFA_DEFAULT_DOMAIN to set the default domain.

Page 27: Informatica PDF A

PowerCenter - Repository

About The PowerCenter repository resides in a relational database. The repository database tables contain the instructions required to extract, transform, and load data and store administrative information such as :

user names,

passwords,

permissions,

and privileges. For Oracle, when you create the database user, you create it with the following instruction :

CREATE USER POW_REP IDENTIFIED BY POW_REP

DEFAULT tablespace users

TEMPORARY tablespace temp

quota unlimited ON users;

GRANT CONNECT, resource, CREATE VIEW TO POW_REP;

Administration PowerCenter applications access the repository through the PowerCenter - Repository Service. You administer the repository using :

the PowerCenter - Repository Manager client tool,

the PowerCenter - Administration Console,

and command line programs.

Metadata You can view repository metadata in the PowerCenter - Repository Manager. The Informatica

Metadata Exchange (MX) provides a set of relational views that allow easy SQL access to the PowerCenter metadata repository. The views are created during the installation of the upgrade of the repository with the help of mxbld file query (for instance for oracle oramxbld.sql). You find then this view in the PowerCenter Repository schema and they begin with REP.

Page 28: Informatica PDF A

Global and local repositories You can develop global and local repositories to share metadata:

Global repository. The global repository is the hub of the repository domain. Use the global

repository to store common objects that multiple developers can use through shortcuts. These

objects may include operational or Application source definitions, reusable transformations,

mapplets, and mappings.

Local repositories. A local repository is any repository within the domain that is not the global

repository. Use local repositories for development. From a local repository, you can create

shortcuts to objects in shared folders in the global repository. These objects include source

definitions, common dimensions and lookups, and enterprise standard transformations. You can

also create copies of objects in non-shared folders.

PowerCenter version control PowerCenter supports versioned repositories. A versioned repository can store multiple versions of an object. PowerCenter version control allows you to efficiently develop, test, and deploy metadata into production.

Page 29: Informatica PDF A

PowerCenter - Administration Console

About The Administration Console is a web application that you use to manage a domain and security (user, …). If you have a user login to the domain, you can access the Administration Console. Domain objects include:

services,

nodes,

and licenses.

Use the Administration Console to perform the following tasks in the domain:

Manage application services. Manage all application services in the domain, such as the

Integration Service and Repository Service.

Configure nodes. Configure node properties, such as the backup directory and resources. You can

also shut down and restart nodes.

Manage domain objects. Create and manage objects such as services, nodes, licenses, and

folders. Folders allow you to organize domain objects and to manage security by setting

permissions for domain objects.

View and edit domain object properties. You can view and edit properties for all objects in the

domain, including the domain object.

View log events. Use the Log Viewer to view domain, Integration Service, SAP BW Service, Web

Services Hub, and Repository Service log events. Other domain management tasks include applying licenses, managing grids and resources, and configuring security.

Default URL: http://localhost:6001/adminconsole

PowerCenter - Client

Page 30: Informatica PDF A

About The PowerCenter Client consists of the following applications that you use to manage the repository, design mappings, mapplets, and createsessions to load the data:

PowerCenter - Designer. Use the Designer to create mappings that contain transformation

instructions for the Integration Service. For more information about the Designer, see

“PowerCenter Designer” on page 10.

PowerCenter - Data Stencil. Use the Data Stencil to create mapping template that can be used to

generate multiple mappings.

PowerCenter - Repository Manager. Use the Repository Manager to create repository users and

groups, assign privileges and permissions, and manage folders and locks.

PowerCenter - Workflow Manager. Use the Workflow Manager to create, schedule, and run

workflows. A workflow is a set of instructions that describes how and when to run tasks related to

extracting, transforming, and loading data.

PowerCenter - Workflow Monitor. Use the Workflow Monitor to monitor scheduled and running

workflows for each Integration Service. Install the client tools on a Microsoft Windows machine.

Page 31: Informatica PDF A

PowerCenter - Repository Service

About The Repository Service manages connections to the PowerCenter - Repository from PowerCenter - Client applications. The Repository Service is a separate, multi-threaded process that retrieves, inserts, and updates metadata in the repository database tables. The Repository Service ensures the consistency of metadata in the repository. The Repository Service accepts connection requests from the following PowerCenter applications:

PowerCenter - Client. Use the Designer and Workflow Manager to create and store mapping

metadata and connection object information in the repository. Use the Workflow Monitor to

retrieve workflow run status information and session logs written by the Integration Service. Use

the Repository Manager to organize and secure metadata by creating folders, users, and groups.

command_line programs. Use command line programs to perform repository metadata

administration tasks and service-related functions.

PowerCenter - Integration Service. When you start the Integration Service, it connects to the

repository to schedule workflows. When you run a workflow, the Integration Service retrieves

workflow task and mapping metadata from the repository. The Integration Service writes workflow

status to the repository.

Powercenter - Web Services Hub. When you start the Web Services Hub, it connects to the

repository to access web-enabled workflows. The Web Services Hub retrieves workflow task and

mapping metadata from the repository and writes workflow status to the repository.

Powercenter - Sap BW service. Listens for RFC requests from SAP NetWeaver BW and initiates

workflows to extract from or load to SAP BW. You install the Repository Service when you install Powercenter - Services. After you install the

PowerCenter Services, you can use thePowerCenter - Administration Console to manage the Repository Service.

Page 32: Informatica PDF A

PowerCenter - Integration Service

The Integration Service reads workflow information from the repository. The Integration Service

connects to the repository through the repository service to fetch metadata from the repository. A workflow is a set of instructions that describes how and when to run tasks related to extracting, transforming, and loading data. The Integration Service runs workflow tasks. A session is a type of workflow task. A session is a set of instructions that describes how to move data from sources to targets using a mapping.

It extracts data from the mapping sources and stores the data in memory while it applies the transformation rules that you configure in the mapping.

The Integration Service loads the transformed data into the mapping targets.

The Integration Service can combine data from different platforms and source types. For example, you can join data from a flat file and an Oracle source. The Integration Service can also load data to different platforms and target types. You install the Integration Service when you install PowerCenter Services. After you install the PowerCenter Services, you can use theadministration console to manage the Integration Service.

Page 33: Informatica PDF A

Powercenter - Web Services Hub

About The Web Services Hub is available in the PC domain. The Web Service Hub is a web service gateway that allows a client application to use web service standards and protocols to access PowerCenter functionality.

The Web Services Hub enables you to turn PowerCenter workflows into web services. You can manage data integration processes within the PowerCenter framework through requests to PowerCenter web services.

It processes SOAP requests from web service clients that want to access PowerCenter functionality

through web services. Web service clients access the Integration Service and Repository Service through the Web Services Hub. The Web Services Hub hosts the following web services:

Batch web services. Run and monitor web-enabled workflows.

Realtime web services. Create service workflows that allow you to read and write messages to a

web service client through the Web Services Hub. When you install Powercenter - Services, the PowerCenter installer installs the Web Services Hub. You can use the PowerCenter - Administration Console to configure and manage the Web Services Hub. The Web Services Hub also provides web service operations that allow you to:

monitor and control PowerCenter processes

get repository information. The PowerCenter installation includes the Web Services Hub. After you install PowerCenter, use the Administration Console to create, configure and manage the Web Services Hub. Configure workflows to run as web services on the Web Services Hub

Page 34: Informatica PDF A

PowerCenter - Data Analyzer

About PowerCenter Data Analyzer is a data access tool and provides a framework to perform business analytics on corporate data. With Data Analyzer, you can :

extract,

filter,

format,

and analyze corporate information from data stored in :

a data warehouse,

operational data store,

or other data storage models. Data Analyzer uses a web browser interface to view and analyze business information at any level.

Data Analyzer extracts, filters, and presents information in easy-to-understand reports. You can use

Data Analyzer to design, develop, and deploy reports and set up dashboards and alerts to provide the latest information to users at the time and in the manner most useful to them.

Data Analyzer has a repository that stores metadata to track information about enterprise metrics, reports, and report delivery. Once an administrator installs Data Analyzer, users can connect to it from any computer that has a web browser and access to the Data Analyzer host.

Data Analyzer can access information from databases, web services, or XML documents. You can set up reports to analyze information from multiple data sources. You can also set up reports to analyze

real-time data from message streams. If you have a PowerCenter data warehouse, Data Analyzer can read and import information regarding the PowerCenter data warehouse directly from the PowerCenter repository.

Data Analyzer provides a PowerCenter Integration utility that notifies Data Analyzer when a PowerCenter session completes.

You can set up reports in Data Analyzer to run when a PowerCenter session completes.

Data Analyzer reports display enterprise data from relational or XML sources as metrics and attributes.

Dashboards provide access to enterprise data.

Data Analyzer Components In Data Analyzer, you can read data from a data source, create reports, and view the results on a web

browser. Data Analyzer contains the following components:

Data Analyzer repository. The Data Analyzer repository stores the metadata about objects and

processes that it requires to handle user requests. The metadata includes information about

schemas, user profiles, personalization, reports and report delivery, and other objects and

Page 35: Informatica PDF A

processes. You can use the metadata in the repository to create reports based on schemas without

accessing the data warehouse directly. Data Analyzer connects to the repository through Java

Database Connectivity (JDBC) drivers. The Data Analyzer repository is separate from the

PowerCenter repository.

Application server. Data Analyzer uses a third-party Java Application - Application Server to

manage processes. The Java application server provides services such as database access and

server load balancing to Data Analyzer. The Java application server also provides an environment

that uses Java technology to manage application, network, and system resources.

Web server. Data Analyzer uses an HTTP server to fetch and transmit Data Analyzer pages to web

browsers.

Data source. For analytic and operational schemas, Data Analyzer reads data from a relational

database. It connects to the database through JDBC drivers. For hierarchical schemas, Data

Analyzer reads data from an XML document. The XML document may reside on a web server or be

generated by a web service operation. Data Analyzer connects to the XML document or web

service through an HTTP connection.

PowerCenter - Metadata Manager

Page 36: Informatica PDF A

About PowerCenter Metadata Manager is a metadata management tool that you can use to browse and analyze metadata from disparate metadata repositories. Metadata Manager helps you understand and manage how information and processes are derived, the fundamental relationships between them, and how they are used.

Metadata Manager uses PowerCenter - Data Analyzer functionality. You can use the embedded Data Analyzer features to design, develop, and deploy metadata reports and dashboards.

Metadata Manager uses PowerCenter - Workflows to extract metadata from source repositories and

load it into a centralized metadata warehouse called the Metadata Manager Warehouse.

Metadata Manager tools Metadata Manager provides the following tools:

Metadata Manager Console. Set up, configure, and run XConnects, which load source repository

metadata into the Metadata Manager Warehouse. Each XConnect consists of a preliminary

transformation process and PowerCenter workflows that load metadata from a particular source

repository into the Metadata Manager Warehouse. You can also use the Metadata Manager

Console to set up connections to source repositories and other Metadata Manager components.

Metadata Manager Custom Metadata Configurator. Create XConnects to load metadata from

source repositories for which Metadata Manager does not package XConnects.

Metadata Manager Interface. Browse source repository metadata and run reports to analyze the

metadata. Also, use it to configure metamodels, set up source repositories, configure the reporting

schema, and set up access and privileges for users and groups.

Metadata Manager Components Metadata Manager works within a web-based framework that requires the interaction of the following

components:

application server. Helps the Metadata Manager Server manage its processes efficiently. The Java

application server provides services such as database access and server load balancing for the

Metadata Manager Server. The Java application server also provides an environment that uses

Java technology to manage application, network, and system resources.

Metadata Manager Server. Manages the source repository metadata stored in the Metadata

Manager Warehouse. You can use Metadata Manager, which uses the Metadata Manager Server, to

search, view, and configure source repository metadata and run reports.

Page 37: Informatica PDF A

Metadata Manager Warehouse. Stores the Metadata Manager metadata, such as the Metadata

Manager reporting schema, user profiles, and reports. It also stores source repository metadata

and metamodels.

PowerCenter - Repository. Stores the workflows, which are XConnect components that extract

source metadata and load it into the Metadata Manager Warehouse.

Web server. Fetches and transmits Metadata Manager pages to web browsers. Each supported

application server contains an integrated web server.

Page 38: Informatica PDF A

PowerCenter - Repository Reports

About PowerCenter Repository Reports are a set of prepackaged PowerCenter - Data Analyzer reports and

dashboards to help you analyze and manage PowerCenter metadata. Use PowerCenter Repository Reports to browse and analyze PowerCenter metadata. PowerCenter Repository Reports provide the following types of reports to help you administer your PowerCenter environment:

Configuration Management. With Configuration Management reports, you can analyze deployment

groups and PowerCenter repository object labels.

Operations. With Operations reports, you can analyze operational statistics for workflows,

worklets, and sessions. Operational reports provide information such as connection usage, service

load by period, and workflow and session load times, completion status, and errors.

PowerCenter - Repository Objects. With PowerCenter Object reports, you can identify PowerCenter

objects, their properties, and their interdependencies with other repository objects.

Security. With the Security report, you can analyze users, groups, and their association within the

repository. You can access PowerCenter Repository Reports from the following areas in Data Analyzer:

View tab. Provides access to PowerCenter Repository Reports dashboards, which contain links to

reports.

Find tab. Provides access to the primary reports associated with an analytic workflow and to

standalone reports. To access PowerCenter - Workflows reports, run the associated primary

report, click the Workflow tab, and then navigate through the analytic workflow until you reach the

workflow report. Before you can set up PowerCenter Repository Reports, you must first install and configure PowerCenter and Data Analyzer. PowerCenter provides the source metadata that you analyze. Create

reports, analytic workflows, dashboards, schedules, and personalized alerts to analyze PowerCenter metadata in PowerCenter - Data Analyzer. PowerCenter Repository Reports use PowerCenter MX Views to access metadata.

Page 39: Informatica PDF A

PowerCenter - Service Manager

About The Service Manager is built in to the domain and supports the domain and the application services. The Service Manager performs the following functions:

Alerts. Provides notifications about domain and service events.

Authentication. Authenticates user requests from the Administration Console and from infacmd.

Authorization. Authorizes user requests for services. Requests can come from the Administration

Console or from infacmd.

Domain configuration. Manages domain configuration metadata.

Node configuration. Manages node configuration metadata.

Licensing. Registers license information and verifies license information when you run application

services.

Logging. Provides accumulated log events from each service in the domain. You can view logs in

the Administration Console and Workflow Monitor.