informatica pdf a
TRANSCRIPT
Data Integration Tools
ETL : Extraction, Transformation and Load Software
ELT : Extraction, Load and Transformation Software
Do it once at night vs. do it every time for a query
The “360 degree view of the enterprise” is a commonly discussed goal that really means data
integration.
Software Available.
Oracle Warehouse Builder
Informatica - Powercenter
Oracle Data Integrator (Sunopsis)
SQL Server Integration Services
Endeca - Information Discovery Integrator (CloverETL)
Informatica - Powercenter
PowerCenter provides an environment that allows you to load data into a centralized location, such as a data warehouse or operational data store (ODS). You can extract data from multiple sources, transform the data according to business logic you build in the client application, and load the transformed data into file and relational targets.
PowerCenter also provides the ability to view and analyze business information and browse and analyze metadata from disparate metadata repositories.
PowerCenter - Getting Started
If you want to learn PowerCenter Informatica, the best way to follow is:
First to install it
and then to follow the Getting Started Modules.
PowerCenter - Architecture/Components
PowerCenter provides an environment that allows you to load data into a centralized location, such as a data warehouse or operational data store (ODS). You can extract data from multiple sources,
transform the data according to business logic you build in the client application, and load the transformed data into file and relational targets.
PowerCenter also provides the ability to view and analyze business information and browse and analyze metadata from disparate metadata repositories.
Components
PowerCenter - Domain. The Power Center domain is the primary unit for management and administration within PowerCenter. The PowerCenter - Service Manager runs on a PowerCenter - Domain. The Service Manager supports the domain and the application services. Application services represent server-based functionality and include the Repository Service, Integration Service, Web Services Hub, and SAP BW Service.
PowerCenter - Repository. The PowerCenter repository resides in a relational database. The repository database tables contain the instructions required to extract, transform, and load data.
PowerCenter - Administration Console. The Administration Console is a web-based administration tool you can use to administer the PowerCenter domain.
PowerCenter - Client. The PowerCenter Client is used to manage users, define sources and targets, build mapping and mapplets with the transformation logic, and create workflows to run the mapping logic. The PowerCenter Client connects to the repository through the Repository Service to modify repository metadata. It connects to the integration service to start aworkflow.
PowerCenter - Repository Service. The Repository Service accepts requests from the PowerCenter Client to create and modify repository metadata and accepts requests from the Integration Service for metadata when a workflow runs.
PowerCenter - Integration Service. The Integration Service extracts data from sources and loads data to targets.
Powercenter - Web Services Hub.
Web Services Hub is a gateway that exposes PowerCenter
functionality to external clients through web services.
Powercenter - Sap BW service. The SAP BW Service extracts data from and loads data to SAP BW.
PowerCenter - Data Analyzer. Data Analyzer provides a framework to perform business analytics on corporate data. With Data Analyzer, you can extract, filter, format, and analyze corporate information from data stored
in a data warehouse, operational data store, or other data storage models.
PowerCenter - Metadata Manager. Metadata Manager is a metadata management tool that you can use to browse and analyze metadata from disparate metadata repositories. Metadata Manager helps you understand and manage how information and processes are derived, the fundamental relationships between them, and how they are used.
PowerCenter - Repository Reports. PowerCenter Repository Reports are a set of prepackaged Data Analyzer reports and dashboards to help you analyze and manage PowerCenter metadata.
PowerCenter - Installation and Configuration (9.0.1 for OEL Linux
SOFTWARE USED.
Oracle Entreprise Linux 64 bit version 5 Update 5
Oracle Database 11gR2
Power Center – Informatica 9.1
After you complete the installation and configuration process, you have to log in to the Administrator tool to create and configure the following services:
PowerCenter Repository Service
PowerCenter Integration Service
STEP 1-
Oracle Database parameter
Set the open_cursors parameter to 1000
SQL> SHOW parameter open_cursor;
NAME TYPE VALUE
------------------------------------ ----------- ------------------------------
open_cursors integer 300
SQL> ALTER system SET open_cursors=1000;
System altered.
SQL> ALTER system SET open_cursors=1000 SCOPE=BOTH;
System altered.
SQL> SHOW parameter open_cursors;
NAME TYPE VALUE
------------------------------------ ----------- ------------------------------
open_cursors integer 1000
Repository Account
At least two database user accounts are required to host this repository :
POW_REP for the repository Database
POW_DOM fro the domain Configuration Database with the CONNECT, RESOURCE, and CREATE VIEW privileges.
For Oracle, you must perform the following instruction :
CREATE USER POW_REP IDENTIFIED BY POW_REP
DEFAULT tablespace users
TEMPORARY tablespace temp
quota unlimited ON users;
GRANT CONNECT, resource, CREATE VIEW TO POW_REP;
CREATE USER POW_DOM IDENTIFIED BY POW_DOM
DEFAULT tablespace users
TEMPORARY tablespace temp
quota unlimited ON users;
GRANT CONNECT, resource, CREATE VIEW, SELECT ANY dictionary TO POW_DOM;
The grant “select dictionary privilege” to the user POW_DOM is only needed to verify the parameter Open_Cursor during the prerequisite check.
Repository Tablespace
Set the storage size for the tablespace to a small number to prevent the repository from using an excessive amount of space.
Also verify that the default tablespace for the user that owns the repository tables is set to a small size. The following example shows how to set the recommended storage parameter for a tablespace named POWERCENTER_DAC.
ALTER TABLESPACE "POWERCENTER_DAC" DEFAULT STORAGE ( INITIAL 10K NEXT 10K MAXEXTENTS UNLIMITED
PCTINCREASE 50 );
Operating System Login as root
Processor
Verify that the process match the software (x86_64)
[root@di-obia-01 ~]# uname -p
x86_64
[root@di-obia-01 ~]# uname -orv
2.6.18-194.el5 #1 SMP Mon Mar 29 22:10:29 EDT 2010 GNU/Linux
Temporary Disk Space Requirements
The installer writes temporary files to the hard disk. When the installation completes, the installer deletes the temporary files and releases the disk space.
Installer UNIX 1 GB
Informatica Services UNIX 2 GB
Verification of the temporary space with the df commando: Linux - disk free command (df)
[root@di-obia-01 ~]# df -h /tmp
Filesystem Size Used Avail Use% Mounted on
/dev/mapper/VolGroup00-LogVol00
77G 16G 58G 22% /
User Account
On UNIX, create a user account specifically to run the Informatica daemon.
[root@di-obia-01 ~]# /usr/sbin/useradd powercenter
[root@di-obia-01 ~]# id powercenter
uid=502(powercenter) gid=506(powercenter) groups=506(powercenter)
[root@di-obia-01 ~]# passwd powercenter
Changing password for user powercenter.
New UNIX password:
BAD PASSWORD: it is WAY too short
Retype new UNIX password:
passwd: all authentication tokens updated successfully.
Port
Verify that the port numbers are available on the machines where you install Informatica Services.
[root@di-obia-01 ~]# netstat -tulpn|grep 6005
The installer validates the port numbers that you specify to ensure that there will be no port conflicts in the domain.
Environment Variables
Use LANG and LC_ALL to set the UNIX code page for the installer.
File Descriptor
Informatica service processes can use a large number of files. Set the Linux - File Descriptor limit per process to 3000 or higher. In the file /etc/security/limits.conf, add this line:
# PowerCenter
powercenter hard nofile 3001
powercenter soft nofile 2500
And verify with the ulimit function logged as the powercenter system user account:
[powercenter@di-obia-01 ~]$ ulimit -Hn
3001
The limit is set to 3001 to prevent an error with the value 300 during the prerequisites check
Installation directory and Permission
As root, create the installation base directory:
mkdir -p /powercenter/9.0.1
chown -R powercenter:powercenter /powercenter
chown -R powercenter:powercenter /powercenter/9.0.1
chmod -R 775 /powercenter
chmod -R 775 /powercenter/9.0.1
As the installer (powercenter), copy the file to a tmp directory, unzip and untar it:
[powercenter@di-obia-01 powercenter901]$ unzip V26109-01.zip
Archive: V26109-01.zip
inflating: 901HF2_Server_Installer_linux-x64.tar
[powercenter@di-obia-01 powercenter901]$ tar -xf 901HF2_Server_Installer_linux-x64.tar
Change the permission of the file install.sh to execute
chmod -R 744 /tmp/powercenter901/install.sh
Server
Pre-Installation System Check To verify whether a machine meets the system requirements for an Informatica installation, run the Pre-Installation System Check Tool (i9Pi) before you start the installation process.
You can find the i9Pi tool in the following directory: <InstallerExtractDirectory>/i9Pi
[powercenter@di-obia-01 i9Pi]$ ./i9Pi.bin
Preparing to install...
Extracting the JRE from the installer archive...
Unpacking the JRE...
Extracting the installation resources from the installer archive...
Configuring the installer for this system's environment...
Launching installer...
Enter the port 6004 to prevent a problem during the ckeck on the port 2010. The tool normally must verify until the port 6009 but check also the port 6010 which is the standard port for X Forwarding
A pdf is available in the tool directory if you want more information. The report files are stored in the directory where you extract the Pre-Installation System Check tool
files:
HTML format: <i9Pi Directory>/i9Pi/i9Pi_summary.html
Text format: <i9Pi Directory>/i9Pi_summary.log i9Pi_summary.log
Output - Informatica9 Installation Location:/
Informatica starting port number:6004
Database Type:ORACLE
Database Hostname:di-rep-db
Database Port number:1521
Database Service name:direp.hotitem.local
Database User ID:POW_DOM
PASSED: OS Version Check: RHL Version Identified is 2.6.18-194.el5
PASSED: Diskspace: Available DiskSpace is 55105 MB
PASSED: Processors: Available Processors is 2
PASSED: Physical Memory: Available RAM Size is 4242 MB
PASSED: Temporary Space: Available temporary diskspace: 55105 MB. Sufficient for installation
PASSED: Socket Port: Socket port Range 6004-6009
PASSED: LANG Environment Variable: LANG Set to en_US.UTF-8 and LC_ALL Set to null
PASSED: JRE_HOME Environment Variable: JRE_HOME is not set
PASSED: File Descriptors : File descriptors limit is 3001
PASSED: Oracle Open Cursors: Open cursors in Oracle database is set to 1000Sufficient for install
PASSED : Resource privilege : The user account has the RESOURCE privilege.
PASSED: Create Table check : Successfully created a database table.
PASSED: Create View : Successfully created a database view.
Initial Server Software Installation This sequence of installation don't say that you have to hit the next button and does not take care of review and informative windows.
[powercenter@di-obia-01 /]$ unset JRE_HOME
[powercenter@di-obia-01 /]$ /tmp/powercenter901/install.sh
OS detected is Linux
unjar task is in progress.............
You can wait a long time on this step ….
unjar of ESD completed.....
Do you want to continue installation (y/n) ?
y
Starting installation ...
\***************************************************************************
\* Welcome to the Informatica 9.0.1 HotFix 2 Server Installer. *
\***************************************************************************
To verify whether a machine meets the system requirements for an Informatica installation, run
the
Pre-Installation System Check Tool (i9Pi) before you start the installation process. You can
find the i9Pi
tool in the following directory: <InstallerExtractDirectory>/i9Pi
Before you continue, read the 9.0.1 HotFix 2 Installation Guide and Release Notes.
You can find the 9.0.1 HotFix 2 documentation in the Documentation Center at
http://my.informatica.com
Configure the LANG and LC_ALL variables to generate appropriate code pages and
create and connect to repositories and Repository Services.
Do you want to continue? (Y/N)Y
Installer requires operating system Linux version 2.6 and later.
Current operating system Linux version 2.6.
Current operating system meets minimum requirements.
Select a choice :
1. Install Informatica 9.0.1 with Hot Fix 2 or Upgrade from a previous version of Informatica
to
Informatica 9.0.1 with Hot Fix 2
2. Apply Informatica 9.0.1 Hot Fix 2 to existing 9.0.1 install.
Enter the choice(1 or 2):1
-----------------------------------------------------------
Checking for existing 9.0.1 HotFix 2 product installation.
Select (G)UI mode (needs X Window support) or (C)onsole mode (G/C):G
Launching installer in GUI mode ...
Preparing to install...
Extracting the JRE from the installer archive...
Unpacking the JRE...
Extracting the installation resources from the installer archive...
Configuring the installer for this system's environment...
Launching installer...
Preparing SILENT Mode Installation...
===============================================================================
Informatica 9.0.1 Services HotFix2 (created with InstallAnywhere)
-------------------------------------------------------------------------------
Informatica 9.0.1 HotFix 2 Steps:
Installation Type: Install (and not upgrade)
Pre-requisites: Next
License and Directory. (Transfer the license file from the client zip file)
Pre-Installation Summary: Install (Required Disk Space 2815 MB)
Domain Creation: Create a domain
Configuration Database
with as JDBC parameters:
MaxPooledStatements=20;CatalogOptions=0;BatchPerformanceWorkaround=true
Domain Configuration
Port Configuration
Post-Installation Summary
Installation Status:: SUCCESS
The Informatica 9.0.1 HotFix 2 installation is complete.
For more information, see the debug log file:
/powercenter/Informatica/9.0.1/Informatica_9.0.1_Services_HotFix2.log
Informatica Administrator Home Page http://di-obia-01.hotitem.local:6007
Product Name : Informatica 9.0.1 HotFix 2
Installation Type : Fresh Installation
Installation Directory : /powercenter/Informatica/9.0.1
Login to the Administration console with the Administrator user (Administration/Administrator)
Repository Configuration
Start Informatica services. Start > All Programs > Informatica PowerCenter 8.6.1 > Services >
Start Informatica Services.
Launch the Administration console. Start > All Programs > Informatica PowerCenter 8.6.1 >
Services > PowerCenter Administration Console
Login with the account given in the configuration (admin/admin for this article).
Select configuration assistant (if you don't see it, you can also configure a repository with the
configuration tool on the left side)
Welcome > Next
Enter a name for the repository, click on the configure button and fill in the fields. You must enter
a connect identifier and not a connect string. The connect string for Oracle contains the
credentials.
If the creation of the repository hang, try to give the DBA privileges to the oracle database account
POW_REP
Integration service: fill a name in : INT_OBIA7961 and click next.
Character Data for Integration service: (Note: Character Encoding give the choice
between ASCII and Unicode)
Finish ! A global administrator (for the domain and the repository) is automatically created during the installation with the credentials Administrator/Administrator. You may change the password for security purpose
Client Installation Steps:
Double click on the file install.bat located in this directory : File1\pc861_client_win32_x86\
Choose Language. Next
Choose Install Option. Next
Prerequisites. Next
Installation directory For the path, you can leave the default value because it doesn't contain
space C:\Informatica\PowerCenter8.6.1
Pre-installation summary. Next
Post-installation summary. Next
Installation Complete. Done Hot Fix 10 installation:
file2>pc861_HotFix10_Client\install.bat
Select Language. Ok
Welcome. Next. Stop PowerCenter. Ok
Installation Prerequisites. Next
Installation Type: Apply. Next
Installation Directory. Next
Pre-installation summary. Next
Post-installation summary. Next
Installation Complete. Done It will install :
PowerCenter - Client
PowerCenter - Designer
PowerCenter - Repository Manager
PowerCenter - Workflow Manager
PowerCenter - Workflow Monitor
PowerCenter Mapping Architect for Visio (Data Stencil?)
Custom Metadata Configurator
Components
1. DOMAINS The Power Center domain is the primary logical unit for management and administration within PowerCenter. The service manager runs on a PowerCenter domain. The Service Manager supports :
the domain
and the application services.
PowerCenter has a service-oriented architecture that provides the ability to scale services and share resources across multiple machines. PowerCenter provides the PowerCenter domain to support the administration of the PowerCenter services.
where:
gateway host and gateway port are the basis for the administration console url A domain can contain multiple repositories:
What is a domain ? A domain is the primary unit for management and administration of services in PowerCenter.
A domain contains the following components:
One or more nodes. A node is the logical representation of a machine in a domain. A domain may
contain more than one node. The node that hosts the domain is the master gateway for the
domain. You can add other machines as nodes in the domain and configure the nodes to run
application services, such as the Integration Service or Repository Service. All service requests
from other nodes in the domain go through the master gateway. A nodes runs service processes, which is the runtime representation of an application service running
on a node.
Service Manager. The Service Manager is built in to the domain to support the domain and
the application services. The Service Manager runs on each node in the domain. The Service
Manager starts and runs the application services on a machine.
Application services. A group of services that represent PowerCenter server-based functionality.
The application services that run on each node in the domain depend on the way you configure the
node and the application service. You can use the PowerCenter - Administration Console to manage the domain. If you have the high availability option, you can scale services and eliminate single points of failure for services. The Service Manager and application services can continue running despite temporary network or hardware failures. High availability includes resilience, failover, and recovery for services
and tasks in a domain.
This Figure shows a sample domain with three nodes:
This domain has a master gateway on Node 1.
Node 2 runs an Integration Service
and Node 3 runs the Repository Service.
INFA_POWERCENTER_DOMAIN environment variable Before using the command line programs, you can configure the environment
variable INFA_DEFAULT_DOMAIN to set the default domain.
PowerCenter - Repository
About The PowerCenter repository resides in a relational database. The repository database tables contain the instructions required to extract, transform, and load data and store administrative information such as :
user names,
passwords,
permissions,
and privileges. For Oracle, when you create the database user, you create it with the following instruction :
CREATE USER POW_REP IDENTIFIED BY POW_REP
DEFAULT tablespace users
TEMPORARY tablespace temp
quota unlimited ON users;
GRANT CONNECT, resource, CREATE VIEW TO POW_REP;
Administration PowerCenter applications access the repository through the PowerCenter - Repository Service. You administer the repository using :
the PowerCenter - Repository Manager client tool,
the PowerCenter - Administration Console,
and command line programs.
Metadata You can view repository metadata in the PowerCenter - Repository Manager. The Informatica
Metadata Exchange (MX) provides a set of relational views that allow easy SQL access to the PowerCenter metadata repository. The views are created during the installation of the upgrade of the repository with the help of mxbld file query (for instance for oracle oramxbld.sql). You find then this view in the PowerCenter Repository schema and they begin with REP.
Global and local repositories You can develop global and local repositories to share metadata:
Global repository. The global repository is the hub of the repository domain. Use the global
repository to store common objects that multiple developers can use through shortcuts. These
objects may include operational or Application source definitions, reusable transformations,
mapplets, and mappings.
Local repositories. A local repository is any repository within the domain that is not the global
repository. Use local repositories for development. From a local repository, you can create
shortcuts to objects in shared folders in the global repository. These objects include source
definitions, common dimensions and lookups, and enterprise standard transformations. You can
also create copies of objects in non-shared folders.
PowerCenter version control PowerCenter supports versioned repositories. A versioned repository can store multiple versions of an object. PowerCenter version control allows you to efficiently develop, test, and deploy metadata into production.
PowerCenter - Administration Console
About The Administration Console is a web application that you use to manage a domain and security (user, …). If you have a user login to the domain, you can access the Administration Console. Domain objects include:
services,
nodes,
and licenses.
Use the Administration Console to perform the following tasks in the domain:
Manage application services. Manage all application services in the domain, such as the
Integration Service and Repository Service.
Configure nodes. Configure node properties, such as the backup directory and resources. You can
also shut down and restart nodes.
Manage domain objects. Create and manage objects such as services, nodes, licenses, and
folders. Folders allow you to organize domain objects and to manage security by setting
permissions for domain objects.
View and edit domain object properties. You can view and edit properties for all objects in the
domain, including the domain object.
View log events. Use the Log Viewer to view domain, Integration Service, SAP BW Service, Web
Services Hub, and Repository Service log events. Other domain management tasks include applying licenses, managing grids and resources, and configuring security.
Default URL: http://localhost:6001/adminconsole
PowerCenter - Client
About The PowerCenter Client consists of the following applications that you use to manage the repository, design mappings, mapplets, and createsessions to load the data:
PowerCenter - Designer. Use the Designer to create mappings that contain transformation
instructions for the Integration Service. For more information about the Designer, see
“PowerCenter Designer” on page 10.
PowerCenter - Data Stencil. Use the Data Stencil to create mapping template that can be used to
generate multiple mappings.
PowerCenter - Repository Manager. Use the Repository Manager to create repository users and
groups, assign privileges and permissions, and manage folders and locks.
PowerCenter - Workflow Manager. Use the Workflow Manager to create, schedule, and run
workflows. A workflow is a set of instructions that describes how and when to run tasks related to
extracting, transforming, and loading data.
PowerCenter - Workflow Monitor. Use the Workflow Monitor to monitor scheduled and running
workflows for each Integration Service. Install the client tools on a Microsoft Windows machine.
PowerCenter - Repository Service
About The Repository Service manages connections to the PowerCenter - Repository from PowerCenter - Client applications. The Repository Service is a separate, multi-threaded process that retrieves, inserts, and updates metadata in the repository database tables. The Repository Service ensures the consistency of metadata in the repository. The Repository Service accepts connection requests from the following PowerCenter applications:
PowerCenter - Client. Use the Designer and Workflow Manager to create and store mapping
metadata and connection object information in the repository. Use the Workflow Monitor to
retrieve workflow run status information and session logs written by the Integration Service. Use
the Repository Manager to organize and secure metadata by creating folders, users, and groups.
command_line programs. Use command line programs to perform repository metadata
administration tasks and service-related functions.
PowerCenter - Integration Service. When you start the Integration Service, it connects to the
repository to schedule workflows. When you run a workflow, the Integration Service retrieves
workflow task and mapping metadata from the repository. The Integration Service writes workflow
status to the repository.
Powercenter - Web Services Hub. When you start the Web Services Hub, it connects to the
repository to access web-enabled workflows. The Web Services Hub retrieves workflow task and
mapping metadata from the repository and writes workflow status to the repository.
Powercenter - Sap BW service. Listens for RFC requests from SAP NetWeaver BW and initiates
workflows to extract from or load to SAP BW. You install the Repository Service when you install Powercenter - Services. After you install the
PowerCenter Services, you can use thePowerCenter - Administration Console to manage the Repository Service.
PowerCenter - Integration Service
The Integration Service reads workflow information from the repository. The Integration Service
connects to the repository through the repository service to fetch metadata from the repository. A workflow is a set of instructions that describes how and when to run tasks related to extracting, transforming, and loading data. The Integration Service runs workflow tasks. A session is a type of workflow task. A session is a set of instructions that describes how to move data from sources to targets using a mapping.
It extracts data from the mapping sources and stores the data in memory while it applies the transformation rules that you configure in the mapping.
The Integration Service loads the transformed data into the mapping targets.
The Integration Service can combine data from different platforms and source types. For example, you can join data from a flat file and an Oracle source. The Integration Service can also load data to different platforms and target types. You install the Integration Service when you install PowerCenter Services. After you install the PowerCenter Services, you can use theadministration console to manage the Integration Service.
Powercenter - Web Services Hub
About The Web Services Hub is available in the PC domain. The Web Service Hub is a web service gateway that allows a client application to use web service standards and protocols to access PowerCenter functionality.
The Web Services Hub enables you to turn PowerCenter workflows into web services. You can manage data integration processes within the PowerCenter framework through requests to PowerCenter web services.
It processes SOAP requests from web service clients that want to access PowerCenter functionality
through web services. Web service clients access the Integration Service and Repository Service through the Web Services Hub. The Web Services Hub hosts the following web services:
Batch web services. Run and monitor web-enabled workflows.
Realtime web services. Create service workflows that allow you to read and write messages to a
web service client through the Web Services Hub. When you install Powercenter - Services, the PowerCenter installer installs the Web Services Hub. You can use the PowerCenter - Administration Console to configure and manage the Web Services Hub. The Web Services Hub also provides web service operations that allow you to:
monitor and control PowerCenter processes
get repository information. The PowerCenter installation includes the Web Services Hub. After you install PowerCenter, use the Administration Console to create, configure and manage the Web Services Hub. Configure workflows to run as web services on the Web Services Hub
PowerCenter - Data Analyzer
About PowerCenter Data Analyzer is a data access tool and provides a framework to perform business analytics on corporate data. With Data Analyzer, you can :
extract,
filter,
format,
and analyze corporate information from data stored in :
a data warehouse,
operational data store,
or other data storage models. Data Analyzer uses a web browser interface to view and analyze business information at any level.
Data Analyzer extracts, filters, and presents information in easy-to-understand reports. You can use
Data Analyzer to design, develop, and deploy reports and set up dashboards and alerts to provide the latest information to users at the time and in the manner most useful to them.
Data Analyzer has a repository that stores metadata to track information about enterprise metrics, reports, and report delivery. Once an administrator installs Data Analyzer, users can connect to it from any computer that has a web browser and access to the Data Analyzer host.
Data Analyzer can access information from databases, web services, or XML documents. You can set up reports to analyze information from multiple data sources. You can also set up reports to analyze
real-time data from message streams. If you have a PowerCenter data warehouse, Data Analyzer can read and import information regarding the PowerCenter data warehouse directly from the PowerCenter repository.
Data Analyzer provides a PowerCenter Integration utility that notifies Data Analyzer when a PowerCenter session completes.
You can set up reports in Data Analyzer to run when a PowerCenter session completes.
Data Analyzer reports display enterprise data from relational or XML sources as metrics and attributes.
Dashboards provide access to enterprise data.
Data Analyzer Components In Data Analyzer, you can read data from a data source, create reports, and view the results on a web
browser. Data Analyzer contains the following components:
Data Analyzer repository. The Data Analyzer repository stores the metadata about objects and
processes that it requires to handle user requests. The metadata includes information about
schemas, user profiles, personalization, reports and report delivery, and other objects and
processes. You can use the metadata in the repository to create reports based on schemas without
accessing the data warehouse directly. Data Analyzer connects to the repository through Java
Database Connectivity (JDBC) drivers. The Data Analyzer repository is separate from the
PowerCenter repository.
Application server. Data Analyzer uses a third-party Java Application - Application Server to
manage processes. The Java application server provides services such as database access and
server load balancing to Data Analyzer. The Java application server also provides an environment
that uses Java technology to manage application, network, and system resources.
Web server. Data Analyzer uses an HTTP server to fetch and transmit Data Analyzer pages to web
browsers.
Data source. For analytic and operational schemas, Data Analyzer reads data from a relational
database. It connects to the database through JDBC drivers. For hierarchical schemas, Data
Analyzer reads data from an XML document. The XML document may reside on a web server or be
generated by a web service operation. Data Analyzer connects to the XML document or web
service through an HTTP connection.
PowerCenter - Metadata Manager
About PowerCenter Metadata Manager is a metadata management tool that you can use to browse and analyze metadata from disparate metadata repositories. Metadata Manager helps you understand and manage how information and processes are derived, the fundamental relationships between them, and how they are used.
Metadata Manager uses PowerCenter - Data Analyzer functionality. You can use the embedded Data Analyzer features to design, develop, and deploy metadata reports and dashboards.
Metadata Manager uses PowerCenter - Workflows to extract metadata from source repositories and
load it into a centralized metadata warehouse called the Metadata Manager Warehouse.
Metadata Manager tools Metadata Manager provides the following tools:
Metadata Manager Console. Set up, configure, and run XConnects, which load source repository
metadata into the Metadata Manager Warehouse. Each XConnect consists of a preliminary
transformation process and PowerCenter workflows that load metadata from a particular source
repository into the Metadata Manager Warehouse. You can also use the Metadata Manager
Console to set up connections to source repositories and other Metadata Manager components.
Metadata Manager Custom Metadata Configurator. Create XConnects to load metadata from
source repositories for which Metadata Manager does not package XConnects.
Metadata Manager Interface. Browse source repository metadata and run reports to analyze the
metadata. Also, use it to configure metamodels, set up source repositories, configure the reporting
schema, and set up access and privileges for users and groups.
Metadata Manager Components Metadata Manager works within a web-based framework that requires the interaction of the following
components:
application server. Helps the Metadata Manager Server manage its processes efficiently. The Java
application server provides services such as database access and server load balancing for the
Metadata Manager Server. The Java application server also provides an environment that uses
Java technology to manage application, network, and system resources.
Metadata Manager Server. Manages the source repository metadata stored in the Metadata
Manager Warehouse. You can use Metadata Manager, which uses the Metadata Manager Server, to
search, view, and configure source repository metadata and run reports.
Metadata Manager Warehouse. Stores the Metadata Manager metadata, such as the Metadata
Manager reporting schema, user profiles, and reports. It also stores source repository metadata
and metamodels.
PowerCenter - Repository. Stores the workflows, which are XConnect components that extract
source metadata and load it into the Metadata Manager Warehouse.
Web server. Fetches and transmits Metadata Manager pages to web browsers. Each supported
application server contains an integrated web server.
PowerCenter - Repository Reports
About PowerCenter Repository Reports are a set of prepackaged PowerCenter - Data Analyzer reports and
dashboards to help you analyze and manage PowerCenter metadata. Use PowerCenter Repository Reports to browse and analyze PowerCenter metadata. PowerCenter Repository Reports provide the following types of reports to help you administer your PowerCenter environment:
Configuration Management. With Configuration Management reports, you can analyze deployment
groups and PowerCenter repository object labels.
Operations. With Operations reports, you can analyze operational statistics for workflows,
worklets, and sessions. Operational reports provide information such as connection usage, service
load by period, and workflow and session load times, completion status, and errors.
PowerCenter - Repository Objects. With PowerCenter Object reports, you can identify PowerCenter
objects, their properties, and their interdependencies with other repository objects.
Security. With the Security report, you can analyze users, groups, and their association within the
repository. You can access PowerCenter Repository Reports from the following areas in Data Analyzer:
View tab. Provides access to PowerCenter Repository Reports dashboards, which contain links to
reports.
Find tab. Provides access to the primary reports associated with an analytic workflow and to
standalone reports. To access PowerCenter - Workflows reports, run the associated primary
report, click the Workflow tab, and then navigate through the analytic workflow until you reach the
workflow report. Before you can set up PowerCenter Repository Reports, you must first install and configure PowerCenter and Data Analyzer. PowerCenter provides the source metadata that you analyze. Create
reports, analytic workflows, dashboards, schedules, and personalized alerts to analyze PowerCenter metadata in PowerCenter - Data Analyzer. PowerCenter Repository Reports use PowerCenter MX Views to access metadata.
PowerCenter - Service Manager
About The Service Manager is built in to the domain and supports the domain and the application services. The Service Manager performs the following functions:
Alerts. Provides notifications about domain and service events.
Authentication. Authenticates user requests from the Administration Console and from infacmd.
Authorization. Authorizes user requests for services. Requests can come from the Administration
Console or from infacmd.
Domain configuration. Manages domain configuration metadata.
Node configuration. Manages node configuration metadata.
Licensing. Registers license information and verifies license information when you run application
services.
Logging. Provides accumulated log events from each service in the domain. You can view logs in
the Administration Console and Workflow Monitor.