02 datastage overview
TRANSCRIPT
Slide 1
Session 2:Datastage overview
<Enter Project Name> Slide 2Slide 2
Enterprise Data IntegrationDescribe DataStageHistory of DatastageIdentify the server and client components of
DataStageDescribe DataStage projectsDescribe DataStage jobsIdentify the steps for designing a DataStage job
Objectives
<Enter Project Name> Slide 3Slide 3
Enterprise Data-Integration
<Enter Project Name> Slide 4Slide 4
With DataStage you can:• Design jobs that extract, integrate, aggregate,
transform data and load into a target• Create, manage, and reuse metadata• Validate, Run, monitor, and schedule jobs• Manage your development environment
DataStage
<Enter Project Name> Slide 5Slide 5
History Of DataStage
DataStage was started in 1997 by company called V-Mark.
Later was taken over by Ardent , which in turn was taken over by Informix.
Informix was acquired by IBM and Ascential was spun as a different company.
Ascential acquired Torrent Systems for $46 million, a developer of parallel-processing infrastructure software for building highly scalable data warehouses.
Current release is DataStage 7.5 from Ascential Software which includes the parallel processing capabilities in addition to its erstwhile server processing.
Ascential Software is now acquired by IBM.
<Enter Project Name> Slide 6Slide 6
DataStage Application Components
M i c r o s o f t ® W i n d o w s N T o r U N I X
S e r v e r R e p o s i t o r y
D e s i g n e r D i r e c t o rR e p o s i t o r yM a n a g e r
O r a c l eS y b a s eI n f o r m i xU n i V e r s eA p p l i c a t i o n s
O r a c l eS Q L S e r v e rR e d B r i c kS y b a s eI n f o r m i xU n i V e r s e
S o u r c eD a t a
T a r g e tD a t a
M i c r o s o f t ® W i n d o w s 9 5
A d m i n i s t r a t o r
E x t r a c t C l e a n s e T r a n s f o r m I n t e g r a t eE x t r a c t C l e a n s e T r a n s f o r m I n t e g r a t e
<Enter Project Name> Slide 7Slide 7
Most DataStage configuration tasks are carried out using the DataStage Administrator, a client program provided with DataStage.
Changing License Details.DataStage Project Administration :
• Add new DataStage projects• Delete projects• Move projects
Add Environment variables
DataStage Administrator
<Enter Project Name> Slide 8Slide 8
DataStage Administrator
Setting up DataStage users
Cleaning up project files
Purging job log files
Setting the timeout interval on server computer
Tracing server activity
Adding entries to the tools menu
Setting job parameter defaultsIssuing Datastage engine commands from the administrator client
<Enter Project Name> Slide 9Slide 9
DataStage Director
Validate jobsRun jobsMonitor jobsSchedule jobsGather statistics
<Enter Project Name> Slide 10Slide 10
DataStage Designer
<Enter Project Name> Slide 11Slide 11
` DataStage Manager
Store metadataReuse metadataDefine routines
<Enter Project Name> Slide 12Slide 12
Define project properties: AdministratorOpen projectDesign jobs: Designer
• Import metadata: Manager• Define extractions, data flows, integrations• Define transformations, constraints,
aggregations• Define loads
Compile and debug jobs: DesignerRun and monitor jobs: Director
Development in DataStage
<Enter Project Name> Slide 13Slide 13
DataStage Projects
Created during installation
Associated with a directory
Attach the users to the projects and assign roles
Self-contained
Multiple users can be working at the same time