introduction to websphere federation · pdf fileintroduction to websphere federation server...
TRANSCRIPT
IBM DATA MANAGEMENT SOLUTIONS
2005 IBM Corporation
Introduction to WebSphere Federation Server
Sathish [email protected]
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation2
WebSphere Federation Server
Federation overview
Tooling support
Performance
Use Cases
Some recent changes in V9.1
Homogeneous federation
Demo
Summary and references
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation3
What is Federation?
Federation is an integration pattern that allows a collection of resources to be viewed and manipulated as if they were a single resource while retaining their autonomy and integrity. It is the technology on which EII is based.
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation4
How does Federation compare to
EAI
Application
Application
Application
InterpretTransform
Route
DataSource
DataSource
extract
transform
load
Target /Data
Warehouse
ETL
Database
Database
Database
capture,apply
Replication
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation5
History of data federation technologyDataJoinerHeterogeneous
RelationalData Access (R/W)
Distributed JoinsHeterogeneous
ReplicationDistributed TransactionSpatial data support
DB2 UDB V7Heterogeneous Data Access
(Read/Only)Limited Relational Data Sources
Relational ConnectNonrelational Data Sources
Life Sciences Data Connect (V7.2)
Distributed JoinsDiscoveryLink (2000)Spatial extender
DB2 UDB & II V8Heterogeneous Data Access
(Read and Write)Heterogeneous ReplicationMost Data Sources
RelationalNon-relationalLife Sciences
Custom C++ Wrapper SDKGarlic planning technologyCaching over Relational
Nicknames Enhanced Admin ToolsWeb services, MQ, etc.
DJ V11995
DJ V21997
DB2 V7.1RC
2000
DB2 V7.2LSDC2001
DB2 V8.12002
DB2 IIV8.12003
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation6
IBM Information ServerDelivering information you can trust
Parallel Processing
Rich Connectivity to Applications, Data, and Content
IBM Information Server
Understand Cleanse Transform Deliver
QualityStageInformation Analyzer Federation ServerDataStage
Information Services Director
Metadata Server
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation7
Data Federation Transparent Appears to be one source Independent of how and where data is
stored Applications continue to work despite of
any change in how data is storedHeterogeneous Accesses data from diverse sources Relational, Structured, XML, messages,
Web, Extensible Bring together almost any data source. Wrapper Development ToolkitHigh Function Full query support against all data Capabilities of sources as wellAutonomous Non-disruptive to data sources, existing
applications, systems.High Performance Optimization of distributed queries
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation8
Relationaldatabases
Web,Non-Relational
Sources
CollaborationSystems
XMLWeb services
Packagedapplications
SQL, SQL/XML
Content Repositories
and Imaging Systems
Workflow systems
Content
Mainframefiles
Mainframedatabases
SQL
Virtualized Information AccessAccess diverse and distributed information as if it were in one system
Single sign on Unified views Common language Web services or Java API Query and update Optimized access
Classic Federation Server for z/OS
Federation Server
II Content Edition
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation9
VSAMSequential
IMS AdabasCA-DatacomCA-IDMS
Federated Sources
DB2InformixOracleSybase Teradata Microsoft SQL Server ODBC
OLE DBExcelFlat filesLife sciencesCustom-built
DB2 CMFamilyDomino.doc DocumentumFileNetOpen TextStellentInterwovenHummingbird
WebSphereFileNet
Lotus NotesMicrosoft Index ServerIBM Lotus Extended SearchSametimeQuickPlaceMicrosoft Exchange
WebSphere BI Adaptors SAPPeopleSoftSiebel
Plus partner tools and custom-built connectors extend access to more sources
Content& Imaging
Workflow systems
Relationaldatabases
WebOther
XMLWeb services
Packagedapplications
Mainframefiles
Mainframedatabases
SQL
ContentSQLFederation Server
Classic Federation Server for z/OS
II Content Edition
CollaborationSystems
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation10
Incorporate data sources using wrappers
Access to a particular class of data sources or protocols
Contains information about data source characteristics
High-function relational wrappers from IBM
Read/Write access
Clean, simple interface for nonrelational wrappers
Written by IBM, third parties, customers
Read Only
Data Federation Approach
Server2Server1
Nicknam
e
Nicknam
e
Nicknam
e
(OptionalLocal Data)
(remote data source 1)
(remote data source 2)
WrapperB
Server3
Nicknam
e
Wrapper A
Table
Federated views
(remote data source 3)
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation11
Powerful query processing engine in federated server
Decomposes, rewrites and distributes queries
Cost-based optimizer chooses query plan with pushdown as appropriate
Query execution engine drives wrappers, combines results
Compensates for missing function in data source
Invokes functions at remote sources as needed
Data Federation ApproachN
ickname
Nicknam
e
Table
Rel. Wrapper
Client libraryNR. WrapperClient library
Local + Remote Execution Plans
Federated Server
Non-SQL
DB2 cost-based optimizer
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation12
An Example of using Federated Database
Federated Database
Server
Local DataGlobal
Catalog
SQL API(JDBC/ODBC)
Wrappers
00001|SONY|Television|... 00002|RCA|VideoPlayer|.. 00004|SONY|DVDPlayer 00003|SONY|VideoRecorder.......
Database
Application
SELECT I.man, count(*)FROM transactions T,
items IWHERE I.id=T.item_id
AND I.category='Television'AND YEAR(T.tran_date)=2001
GROUP BY I.man;
SELECT tran_date, item_idFROM transactions
WHERE YEAR(tran_date)=2001
ITEMS
TRANSACTIONS
List the number of TV salesper manufacturer in 2001
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation13
Other Federation Concepts Index Specification: an index catalog entry for a nickname
Type Mapping: a mapping between a data source type and a DB2 UDB data type. Defaults are stored in the wrapper libraries.
Function Mapping: a mapping between a data source function and a DB2 UDB function. Defaults are stored in the wrapper libraries.
Option: an additional attribute specific to each source to customize afederated object, e.g. wrapper, server, nickname, user, function mapping
Function Template: a virtual function definition for a data source function that cannot be executed on DB2 UDB
Passthru Session: a special mode that allows users to submit SQL statements directly to a relational data source
User Mapping plugin: user-provided logic to retrieve user mappings from an external repository, e.g. LDAP
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation14
WebSphere Federation Server
Federation overview
Tooling support
Performance
Use Cases
Some recent changes in V9.1
Homogeneous federation
Demo
Summary and references
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation15
Control CenterTools to configure and administer standard wrappers
Plug-in architecture allows custom wrappers to be administered
Administration Tools
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation16
Tools help manage the complexity
Configuration wizard
Guides you through federation configuration process
Discovery
Server discovery: Automatically discovers and configures external servers
Nickname discovery
Deploy
Capture configuration to a script, save and deploy
Facilitates cloning system configuration for horizontal scaling...
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation17
WFS tools help manage the complexity
Health Monitoring
Monitor health of servers, nicknames that affect configuration
Statistics refresh
Refresh nickname statistics on demand or by scheduled task
Snapshot Monitoring
Snapshots are useful for determining the status of a database system.
Event Monitoring
Collect information about the database and any connected applications when specified events occur.
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation18
WebSphere Federation Server
Federation overview
Tooling support
Performance
Use Cases
Some recent changes in V9.1
Homogeneous federation
Demo
Summary and references
IBM DATA MANAGEMENT SOLUTIONS
2003 IBM Corporation19
What factors affect federated performance? Processing power, network bandwidth
Traffic between remote s