7249305-all-domains-real-project-explanation-imp.doc

Upload: bada-sainath

Post on 02-Nov-2015

6 views

Category:

Documents


0 download

TRANSCRIPT

All Domains Real Project Explanation IMP

All Domains Real Project Explanation IMP........Project Explanation:Im giving generic explanation of the project. Any project either banking or sales or insurance can use this explanation.

First u have to start with 1) U have to first explain about objective of the project and what is client expectations2) u have to start where ur involvement and responsibility of ur job and limitations of job.Add some points from post Project Architecture reply like offshore and onsite model and team structure.. etc.,

Main objective of this project is we are providing a system with all the information regarding Sales / Transactions (sales if sales domain / transactions if bank domain or insurance domain) of entire organizations all over the country US / UK ( based on the client location US/UK/..). we will get the daily transaction data from all branches at the end of the day. We have to validate the transactions and implement the business logic based on the transactions type or transaction code. We have to load all historical data into dwh and once finished historical data. We have to load Delta Loads. Delta load means last 24 hrs transactions captured from the source system. In other words u can call it as Change Data Capture (CDC). This Delta loads are scheduled daily basis. Pick some points from What is Target Staging Area Post.. Source to Staging mappings, staging to warehousing.. based on ur comfort level..

Each transaction contains Transaction code.. based on the transaction code u can identify wheather that transaction belongs to sales, purchase / car insurance, health insurance, / deposit , loan, payment ( u have to change the words based on the project..) etc., based on that code business logic will be change.. we validate and calculate the measure and load to database.

Aug 7One Mapping explanation :In Informatica mapping, we first lookup all the transaction codes with code master table to identify the transaction type to implement the correct logic and filter the unnecessary transactions.. because in an organization there are lot of transactions will be there but u have to consider only required transactions for ur project.. the transaction code exists in the code master table are only transactions u have to consider and other transactions load into one table called Wrap table and invalid records( transaction code missing, null, spaces) to error table. For each dimension table we are creating surrogate key and load into dwh tables.

SCD2 Mapping:We are implementing SCD2 mapping for customer dimension or account dimension to keep history of the accounts or customers. We are using SCD2 Date method.before telling this u should know it clearly abt this SCD2 method..careful abt it..

Responsibilities.. pick from Project architecture Post and tell according ur comfortable level.. we are responsible for only development and testing and scheduling we are using third party tools..( Control M, AutoSys, Job Tracker, Tivoli or etc..) we simply give the dependencies between each mapping and run time. Based on that Information scheduling tool team will schedule the mappings. We wont schedule in Informatica .. thats it Finished

Please Let me know if u required more explanation regarding any point reply

We have to start with .Our projects are mainly onsite and offshore model projects.In this project we have one staging area in between source to target databases. In some project they wont use staging areas. Staging area simplify the process..

Architecture Production( Testing ( Development ( Design( Requirement Gathering(Analysis

Analysis and Requirement Gathering: Output :Analysis Doc, Subject Area100% in onsite,Business Analyst, project manager. Gather the useful information for the DSS and indentifying the subject areas, identify the schema objects and all..Design: Output: Technical Design Docs, HLD, UTPETL Lead, BA and Data Architect80% onsite .( Schema design in Erwin and implement in database and preparing the technical design document for ETL. 20% offshore: HLD & UTPBased on the Technical specs.. developers has to create the HLD(high level design) it will have he Informatica flow chart. What are the transformation required for that mapping. In some companies they wont have HLD. Directly form technical specs they will create mappings. HLD will cover only 75% of requirement.UTP Unit Test Plan.. write the test cases based on the requirement. Both positive and negative test cases.

Development : output : Bugs free code, UTR, Integration Test PlanETL Team and offshore BA100% offshore Based on the HLD. U have to create the mappings. After that code review and code standard review will be done by another team member. Based on the review comments u have to updated the mapping. Unit testing based on the UTP. U have to fill the UTP and Enter the expected values and name it as UTR (Unit Test Results). 2 times code review and 2 times unit testing will be conducted in this phase. Migrating to testing repositoryIntegration test plan has to prepare by the senior people.

to be continued.......

Pavan

Aug 2(5 days ago)

Testing : Output: ITR, UAT, Deployment Doc and User GuideTesting Team, Business Analyst and Client.80% offshore Based on the integration test plan testing the application and gives the bugs list to the developer. Developers will fix the bugs in the development repository and again migrated to testing repository. Again testing starts till the bugs free code. 20% OnsiteUAT User Accept Testing. Client will do the UAT.. this is last phase of the etl project. If client satisfy with the product .. next deployment in production environment.

Production:50% offshore 50% onsiteWork will be distributed between offshore and onsite based on the run time of the application. Mapping Bugs needs to fix by Development team. Development team will support for warranty period of 90 days or based on agreement days..

In ETL projects Three Repositorys. For each repository access permissions and location will be different.Development : E1Testing : E2Prduction : E3

forgot to mentioned above

In Everyphase

i given the Out come of the phase as OUTPUT after thatRoles involved percentage of involvment..

If you have any doubts and needs more explanation let me know... kamal

Aug 3(4 days ago)

i'm impressed with the job u have done but 1 Q?u have discussed abt the development and testing of a DWH?

suppose in the middle of a project if the client needs to change some part of the design phase what kind of impact analysis is done on the source schema and the target schema?

waiting 4 detailed explanation?

suppose in the middle of a project if the client needs to change some part of the design phase.. PM , BA, ETL Lead will analyze what is the impact on existing code,Informatica mappings impact, schema changes. based on the impact analysis they will give the estimated hours to complete the task and billing amount separately..

Once client approve on estimation, again work will starts from Design schema till production....

chakravarthy

Aug 6(1 day ago)

pavan,can u tell me fact & dimension tables names of ur project or any other project.so that we can create our own facts & dimension names. ITS ME..

Aug 6(1 day ago)

Excellent work..@pavan,its great to have many useful tips from u yaar,keep up thegood work,thanks again for ur pain In my project ... we won't call as dimensions and facts.. we willl called as standard tables( Dimensions) and relationship tables( facts tables)..

dimensions are Account, Customer, Involve party, Distributor, etc.,.Facts are relation between all those tables. ACCT_X_CUST, IP_X_ACCT, IP_X_IP

Its is not mandatory that ur project had dimensions and fact tables.. because now a days informatica is using for automations of any kind of process.. simply using for some validations, converting from flat file to xml .. etc.,, because if clients are purchasing informatica software.. they are trying to utilize the tool for another projects also..

In interviews u people can tel that my project is not Datawarehousig project.. we used to automate some process of the client.. my requirement is to validate the data and populate the target with valid and useful data to the clients...

i hope it will help in interviews.... if required anything else.. let me know..