oracle data integrator

Oracle Data Integrator A Success Story

Jason Jones Key Performance Ideas, Inc.

About Jason Jones

l  Oracle Essbase Certified Developer – experience since 2005

l  Oracle Data Integrator since 2009 l  Extensive experience in developing, tuning and

enhancing Essbase, Hyperion Planning, and ODI

l  Programming expertise: ● Developing software in Java ● Mobile solutions for the iOS platform (Objective-C) ● Relational databases (SQL Server, Oracle) 2

Agenda

l  Company Situation l  What is Oracle Data Integrator? l  Problems & Pain Points l  Solutions & Opportunities l  ODI Functionality & Benefits l  Lessons Learned l  How to Get Started l  Q&A

Company Situation

l  Large health services company l  Significant Oracle database implementation l  In process of implementing Essbase l  Transitioning to new database servers l  Data movement processes have grown

organically over the years

What is Oracle Data Integrator?

l  History ● Originally developed by Sunopsis, acquired in 2006 ● Originated Extract Load Transform (ELT) as

alternative to ETL ● Data movement between heterogeneous data

sources l  Future

● Strategic product for data integration ● Significant development resources

ETL vs. E-LT

Source ETL Server Target

Target

ETL Server

Traditional Architecture

E-LT Architecture

Source

Problems & Pain Points

l  SQL procedures being used as de facto ETL solution l  Time-consuming to develop l  Hard to maintain l  Little error visibility l  Difficult to integrate with other data sources l  Have to maintain in multiple environments l  Manual logging facility l  Lacking documentation/comments l  Difficult to troubleshoot

Solutions & Opportunities

l  Easier development and maintenance l  Work with different relational database

technologies l  Load to/from text files, Essbase/Planning l  Email alerts l  Easily switch between physical environments l  Process logging, easily pinpoint errors

Essbase

Typical Data Flow

ODI Server

Data Movement OLTP

(Oracle)

Operational Data Store

(Oracle)

Knowledge Modules

l  RKM: Import existing table/data structure l  LKM: Load data from tables l  IKM: Insert/update data in target table l  JKM: Process new rows of data only l  CKM: Validate data being moved

Models

•  Create models for relational tables, text files, and Essbase dimensions to load data to and from

•  Easily create models automatically based on existing tables with RKM

•  Serve as the source and target of interfaces

Enhanced Functionality

•  Enforce data integrity! •  Exist as logical units irrespective of physical environment

Benefits

•  Define requirements for data entering target tables •  Reuse for multiple interfaces

Health Services Organization Value Achieved

Model Datastore

Interfaces

•  Moves data between models •  Source is typically multiple models (tables) •  Destination is always one and only one model (table) •  Declarative!

•  Easy to create, modify, and maintain •  Work across different technologies •  Document business rules •  Leverage power of existing database server to transform data •  “Auto mapping” speeds development time

Benefits

•  Visual replacement for many SQL procedures •  Quickly define criteria for updating tables •  Choose location of data transformation

Interface Screen

Interface Flow

•  Validate data being processed •  Automatically recycle data •  Choose loading/integrating strategy

•  Very easily configure complex actions that would normally be tons of code

•  Select Knowledge Modules (strategy used to load, integrate, validate)

Benefits

•  Turn on data validation, error recycling with mouse click

Interface Flow Screen

Procedures

•  Useful for performing one-off actions if they can’t be put into interface

•  e.g. call a stored procedure

•  Perform action that doesn’t fit into an interface

Benefits

•  Reference existing data transformation process without having to rebuild it

Interface Overview

Procedure Steps

Procedure Step Definition

Packages

•  Chain together multiple interfaces, procedures, and other steps •  Allow for error control flow

•  Gracefully handle errors •  Restart process automatically, if desired •  Send an email alert!

Benefits

•  Treat execution of multiple interfaces as one job •  Add email step in case of error (or success)

Package Flow

Scenarios

•  “Freezes” an interface, procedure, or package in place •  Changes can be made to procedure/interface/package that won’t

affect existing functionality •  Scenarios are unit of work that can be scheduled and called from

command-line

•  Avoid breaking existing processes when need arises to change/augment functionality in interface

Benefits

•  Call ODI functionality from command-line when needed •  Deploy functionality and not be scared to make changes to it

Generated Scenarios

Operator

•  Easily view status of all jobs •  Scenarios, packages, interfaces, procedures, load plans •  Replaced need to have manual logging statements

•  See exactly where and why a process failed

Benefits

•  Insight into currently executing and already executed jobs •  Drill down to exact cause and reason of error

Operator Overview

Operator Step Detail

Operator Step Generated Code

Scheduler

•  More robust than Windows Task scheduler •  Easy to set schedule for jobs to run •  Can call jobs from command-line but use scheduler if possible!

•  Easier to use than setting up Windows Task Scheduler to run a batch file to run a scenario…

Benefits

•  Directly schedule ODI job to run without having to setup batch file •  Run ODI jobs without needed additional deployment step

Scheduler

Journalized Data

•  Pattern for only consuming updated/inserted rows of data •  Easy to implement •  Single checkbox in interface for using journalized data

•  Avoid processing all data, block of data by day •  Avoid maintaining timing variables

Benefits

•  Get away from “day of data” processing paradigm •  Move data from one system to another more often

Journals

Journalized Data in Interface

Topologies

•  Logical “Customer” system •  Physical Customer system in Development, QA, Production

•  Use same logical job for both development and production environments

Benefits

•  Save significant effort not having to copy/deploy code only differing by environment

•  Pick and choose test/production systems

Lessons Learned

l  Critical to leverage someone who has done this to lay foundation! Leverage an experienced individual to set critical first steps up correctly

l  Start with simple ETL job and build a roadmap to larger data movement needs

l  First step/phase can be large, subsequent jobs much easier l  Don’t re-implement functionality, build idiomatic ODI jobs l  Always try to use iterative development model l  Reusability, consistency, maintainability l  Implementation duration l  What to look for

How To Get Started

l  ODI is now Oracle’s data movement tool standard l  Serious thought must be made to implement platform -

initial step is typically an architectural discussion l  Utilize experts to help build a roadmap and identify new

idiomatic ODI functionality l  Initial functionality development

● Software install, physical topologies, logical topologies, models reverse engineered

●  Interfaces, procedures, packages, scenarios, solutions

WALKTHROUGH OF SQL TO SQL INTERFACE STEPS

Drop Work Table

drop table ZODI.C$_0DIM_PETS

Lock Journalized Table

update ZODI.J$PETS set JRN_CONSUMED = '1' where (1=1) And JRN_SUBSCRIBER = ‘LAB_RESULTS_TRANSFER_PETS’

Create View/Table on Source

create or replace view ZODI.C$_0DIM_PETS (

C1_PET_ID, … ) as select * from ( select

PETS.PET_ID C1_PET_ID, … from ZODI.JV$PETS PETS where (1=1) And JRN_SUBSCRIBER = ’LAB_RESULTS_TRANSFER_PETS’

Drop Synonym on Target

drop synonym ESSBASE.C$_0DIM_PETS

Create Synonym on Target

create synonym ESSBASE.C$_0DIM_PETS for ZODI.C$_0DIM_PETS@ZOASIS

Drop Flow Table

drop table ESSBASE.I$_DIM_PETS

Create Flow Table

create table ESSBASE.I$_DIM_PETS (

PET_ID NUMBER(38) NULL, PET_NAME VARCHAR2(32) NULL, … JRN_FLAG VARCHAR2(1) NULL, JRN_DATE DATE NULL, DIM_PET_ID NUMBER NULL ,IND_UPDATE char(1)

) NOLOGGING

Insert Flow Into Table

insert /*+ APPEND */ into ESSBASE.I$_DIM_PETS ( PET_ID, ………… ,IND_UPDATE )

select C1_PET_ID,

from ESSBASE.C$_0DIM_PETS where (1=1)

Analyze Integration Table

begin dbms_stats.gather_table_stats(

ownname => 'ESSBASE', tabname => 'I$_DIM_PETS', estimate_percent =>

dbms_stats.auto_sample_size ); end;

Synchronize Deletions from Journal Table

delete from ZSTAGING.DIM_PETS where exists (

select 'X' from ESSBASE.I$_DIM_PETS I where ZSTAGING.DIM_PETS.PET_ID =

I.PET_ID and IND_UPDATE = 'D' )

Create Index on Flow Table

create index ESSBASE.I$_DIM_PETS_IDX on ESSBASE.I$_DIM_PETS (PET_ID) NOLOGGING

Merge Rows

merge into ZSTAGING.DIM_PETS T using ESSBASE.I$_DIM_PETS S on (

T.PET_ID=S.PET_ID )

when matched then update set

T.PET_NAME = S.PET_NAME, … when not matched then insert

( T.PET_ID,…

Commit Transaction

/*commit*/

Drop Flow Table

drop table ESSBASE.I$_DIM_PETS

Cleanup Journalized Table

delete from ZODI.J$PETS where JRN_CONSUMED = '1' And JRN_SUBSCRIBER = ‘LAB_RESULTS_TRANSFER_PETS' /* AND JRN_DATE < sysdate */

Drop Synonym on Target

drop synonym ESSBASE.C$_0DIM_PETS

Drop View/Table on Source

drop view ZODI.C$_0DIM_PETS

Jason Jones Direct: 206.427.1373 Email: jjones@KeyPerformanceIdeas.com

Thank You!

oracle data integrator

load data

insertupdate data

data integrity

oracle data integrator

etl data movement

data movement processes

heterogeneous data sources

process new rows of

Documents

oracle data integrator training | oracle data integrator...

oracle data integrator (odi) – td01

oracle data integrator

configuring oracle data integrator - oracle documentation

oracle data integrator features

administering oracle data integrator › en › middleware...

oracle data integrator introduction

oracle® fusion middleware integrator · 2019-12-09 ·...

45573405 oracle data integrator introduction

oracle data integrator expression transformation

oracle data integrator 12c - doag

oracle data integrator project

introducing oracle data integrator and oracle ...

comprehensive data quality with oracle data integrator and...

[1]oracle® fusion middleware oracle data integrator tools...

using oracle data integrator

oracle data profiling and oracle data quality for data...

installing oracle data integrator installation

tecnologias oracle para dw oracle data integrator -...

odiun understanding oracle data integrator