informatica (version 9.6.1) · informatica, informatica platform, informatica data services,...

of 36/36
Informatica (Version 9.6.1) Exception Management Guide

Post on 22-Jun-2020

28 views

Category:

Documents

3 download

Embed Size (px)

TRANSCRIPT

  • Informatica (Version 9.6.1)

    Exception Management Guide

  • Informatica Exception Management Guide

    Version 9.6.1June 2014

    Copyright (c) 1998-2014 Informatica Corporation. All rights reserved.

    This software and documentation contain proprietary information of Informatica Corporation and are provided under a license agreement containing restrictions on use and disclosure and are also protected by copyright law. Reverse engineering of the software is prohibited. No part of this document may be reproduced or transmitted in any form, by any means (electronic, photocopying, recording or otherwise) without prior consent of Informatica Corporation. This Software may be protected by U.S. and/or international Patents and other Patents Pending.

    Use, duplication, or disclosure of the Software by the U.S. Government is subject to the restrictions set forth in the applicable software license agreement and as provided in DFARS 227.7202-1(a) and 227.7702-3(a) (1995), DFARS 252.227-7013©(1)(ii) (OCT 1988), FAR 12.212(a) (1995), FAR 52.227-19, or FAR 52.227-14 (ALT III), as applicable.

    The information in this product or documentation is subject to change without notice. If you find any problems in this product or documentation, please report them to us in writing.

    Informatica, Informatica Platform, Informatica Data Services, PowerCenter, PowerCenterRT, PowerCenter Connect, PowerCenter Data Analyzer, PowerExchange, PowerMart, Metadata Manager, Informatica Data Quality, Informatica Data Explorer, Informatica B2B Data Transformation, Informatica B2B Data Exchange Informatica On Demand, Informatica Identity Resolution, Informatica Application Information Lifecycle Management, Informatica Complex Event Processing, Ultra Messaging and Informatica Master Data Management are trademarks or registered trademarks of Informatica Corporation in the United States and in jurisdictions throughout the world. All other company and product names may be trade names or trademarks of their respective owners.

    Portions of this software and/or documentation are subject to copyright held by third parties, including without limitation: Copyright DataDirect Technologies. All rights reserved. Copyright © Sun Microsystems. All rights reserved. Copyright © RSA Security Inc. All Rights Reserved. Copyright © Ordinal Technology Corp. All rights reserved.Copyright © Aandacht c.v. All rights reserved. Copyright Genivia, Inc. All rights reserved. Copyright Isomorphic Software. All rights reserved. Copyright © Meta Integration Technology, Inc. All rights reserved. Copyright © Intalio. All rights reserved. Copyright © Oracle. All rights reserved. Copyright © Adobe Systems Incorporated. All rights reserved. Copyright © DataArt, Inc. All rights reserved. Copyright © ComponentSource. All rights reserved. Copyright © Microsoft Corporation. All rights reserved. Copyright © Rogue Wave Software, Inc. All rights reserved. Copyright © Teradata Corporation. All rights reserved. Copyright © Yahoo! Inc. All rights reserved. Copyright © Glyph & Cog, LLC. All rights reserved. Copyright © Thinkmap, Inc. All rights reserved. Copyright © Clearpace Software Limited. All rights reserved. Copyright © Information Builders, Inc. All rights reserved. Copyright © OSS Nokalva, Inc. All rights reserved. Copyright Edifecs, Inc. All rights reserved. Copyright Cleo Communications, Inc. All rights reserved. Copyright © International Organization for Standardization 1986. All rights reserved. Copyright © ej-technologies GmbH. All rights reserved. Copyright © Jaspersoft Corporation. All rights reserved. Copyright © is International Business Machines Corporation. All rights reserved. Copyright © yWorks GmbH. All rights reserved. Copyright © Lucent Technologies. All rights reserved. Copyright (c) University of Toronto. All rights reserved. Copyright © Daniel Veillard. All rights reserved. Copyright © Unicode, Inc. Copyright IBM Corp. All rights reserved. Copyright © MicroQuill Software Publishing, Inc. All rights reserved. Copyright © PassMark Software Pty Ltd. All rights reserved. Copyright © LogiXML, Inc. All rights reserved. Copyright © 2003-2010 Lorenzi Davide, All rights reserved. Copyright © Red Hat, Inc. All rights reserved. Copyright © The Board of Trustees of the Leland Stanford Junior University. All rights reserved. Copyright © EMC Corporation. All rights reserved. Copyright © Flexera Software. All rights reserved. Copyright © Jinfonet Software. All rights reserved. Copyright © Apple Inc. All rights reserved. Copyright © Telerik Inc. All rights reserved. Copyright © BEA Systems. All rights reserved. Copyright © PDFlib GmbH. All rights reserved. Copyright © Orientation in Objects GmbH. All rights reserved. Copyright © Tanuki Software, Ltd. All rights reserved. Copyright © Ricebridge. All rights reserved. Copyright © Sencha, Inc. All rights reserved.

    This product includes software developed by the Apache Software Foundation (http://www.apache.org/), and/or other software which is licensed under various versions of the Apache License (the "License"). You may obtain a copy of these Licenses at http://www.apache.org/licenses/. Unless required by applicable law or agreed to in writing, software distributed under these Licenses is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the Licenses for the specific language governing permissions and limitations under the Licenses.

    This product includes software which was developed by Mozilla (http://www.mozilla.org/), software copyright The JBoss Group, LLC, all rights reserved; software copyright © 1999-2006 by Bruno Lowagie and Paulo Soares and other software which is licensed under various versions of the GNU Lesser General Public License Agreement, which may be found at http:// www.gnu.org/licenses/lgpl.html. The materials are provided free of charge by Informatica, "as-is", without warranty of any kind, either express or implied, including but not limited to the implied warranties of merchantability and fitness for a particular purpose.

    The product includes ACE(TM) and TAO(TM) software copyrighted by Douglas C. Schmidt and his research group at Washington University, University of California, Irvine, and Vanderbilt University, Copyright (©) 1993-2006, all rights reserved.

    This product includes software developed by the OpenSSL Project for use in the OpenSSL Toolkit (copyright The OpenSSL Project. All Rights Reserved) and redistribution of this software is subject to terms available at http://www.openssl.org and http://www.openssl.org/source/license.html.

    This product includes Curl software which is Copyright 1996-2013, Daniel Stenberg, . All Rights Reserved. Permissions and limitations regarding this software are subject to terms available at http://curl.haxx.se/docs/copyright.html. Permission to use, copy, modify, and distribute this software for any purpose with or without fee is hereby granted, provided that the above copyright notice and this permission notice appear in all copies.

    The product includes software copyright 2001-2005 (©) MetaStuff, Ltd. All Rights Reserved. Permissions and limitations regarding this software are subject to terms available at http://www.dom4j.org/ license.html.

    The product includes software copyright © 2004-2007, The Dojo Foundation. All Rights Reserved. Permissions and limitations regarding this software are subject to terms available at http://dojotoolkit.org/license.

    This product includes ICU software which is copyright International Business Machines Corporation and others. All rights reserved. Permissions and limitations regarding this software are subject to terms available at http://source.icu-project.org/repos/icu/icu/trunk/license.html.

    This product includes software copyright © 1996-2006 Per Bothner. All rights reserved. Your right to use such materials is set forth in the license which may be found at http:// www.gnu.org/software/ kawa/Software-License.html.

    This product includes OSSP UUID software which is Copyright © 2002 Ralf S. Engelschall, Copyright © 2002 The OSSP Project Copyright © 2002 Cable & Wireless Deutschland. Permissions and limitations regarding this software are subject to terms available at http://www.opensource.org/licenses/mit-license.php.

    This product includes software developed by Boost (http://www.boost.org/) or under the Boost software license. Permissions and limitations regarding this software are subject to terms available at http:/ /www.boost.org/LICENSE_1_0.txt.

    This product includes software copyright © 1997-2007 University of Cambridge. Permissions and limitations regarding this software are subject to terms available at http:// www.pcre.org/license.txt.

    This product includes software copyright © 2007 The Eclipse Foundation. All Rights Reserved. Permissions and limitations regarding this software are subject to terms available at http:// www.eclipse.org/org/documents/epl-v10.php and at http://www.eclipse.org/org/documents/edl-v10.php.

    This product includes software licensed under the terms at http://www.tcl.tk/software/tcltk/license.html, http://www.bosrup.com/web/overlib/?License, http://www.stlport.org/doc/ license.html, http:// asm.ow2.org/license.html, http://www.cryptix.org/LICENSE.TXT, http://hsqldb.org/web/hsqlLicense.html, http://httpunit.sourceforge.net/doc/ license.html, http://jung.sourceforge.net/license.txt , http://www.gzip.org/zlib/zlib_license.html, http://www.openldap.org/software/release/

  • license.html, http://www.libssh2.org, http://slf4j.org/license.html, http://www.sente.ch/software/OpenSourceLicense.html, http://fusesource.com/downloads/license-agreements/fuse-message-broker-v-5-3- license-agreement; http://antlr.org/license.html; http://aopalliance.sourceforge.net/; http://www.bouncycastle.org/licence.html; http://www.jgraph.com/jgraphdownload.html; http://www.jcraft.com/jsch/LICENSE.txt; http://jotm.objectweb.org/bsd_license.html; . http://www.w3.org/Consortium/Legal/2002/copyright-software-20021231; http://www.slf4j.org/license.html; http://nanoxml.sourceforge.net/orig/copyright.html; http://www.json.org/license.html; http://forge.ow2.org/projects/javaservice/, http://www.postgresql.org/about/licence.html, http://www.sqlite.org/copyright.html, http://www.tcl.tk/software/tcltk/license.html, http://www.jaxen.org/faq.html, http://www.jdom.org/docs/faq.html, http://www.slf4j.org/license.html; http://www.iodbc.org/dataspace/iodbc/wiki/iODBC/License; http://www.keplerproject.org/md5/license.html; http://www.toedter.com/en/jcalendar/license.html; http://www.edankert.com/bounce/index.html; http://www.net-snmp.org/about/license.html; http://www.openmdx.org/#FAQ; http://www.php.net/license/3_01.txt; http://srp.stanford.edu/license.txt; http://www.schneier.com/blowfish.html; http://www.jmock.org/license.html; http://xsom.java.net; http://benalman.com/about/license/; https://github.com/CreateJS/EaselJS/blob/master/src/easeljs/display/Bitmap.js; http://www.h2database.com/html/license.html#summary; http://jsoncpp.sourceforge.net/LICENSE; http://jdbc.postgresql.org/license.html; http://protobuf.googlecode.com/svn/trunk/src/google/protobuf/descriptor.proto; https://github.com/rantav/hector/blob/master/LICENSE; http://web.mit.edu/Kerberos/krb5-current/doc/mitK5license.html; http://jibx.sourceforge.net/jibx-license.html; and https://github.com/lyokato/libgeohash/blob/master/LICENSE.

    This product includes software licensed under the Academic Free License (http://www.opensource.org/licenses/afl-3.0.php), the Common Development and Distribution License (http://www.opensource.org/licenses/cddl1.php) the Common Public License (http://www.opensource.org/licenses/cpl1.0.php), the Sun Binary Code License Agreement Supplemental License Terms, the BSD License (http:// www.opensource.org/licenses/bsd-license.php), the new BSD License (http://opensource.org/licenses/BSD-3-Clause), the MIT License (http://www.opensource.org/licenses/mit-license.php), the Artistic License (http://www.opensource.org/licenses/artistic-license-1.0) and the Initial Developer’s Public License Version 1.0 (http://www.firebirdsql.org/en/initial-developer-s-public-license-version-1-0/).

    This product includes software copyright © 2003-2006 Joe WaInes, 2006-2007 XStream Committers. All rights reserved. Permissions and limitations regarding this software are subject to terms available at http://xstream.codehaus.org/license.html. This product includes software developed by the Indiana University Extreme! Lab. For further information please visit http://www.extreme.indiana.edu/.

    This product includes software Copyright (c) 2013 Frank Balluffi and Markus Moeller. All rights reserved. Permissions and limitations regarding this software are subject to terms of the MIT license.

    This Software is protected by U.S. Patent Numbers 5,794,246; 6,014,670; 6,016,501; 6,029,178; 6,032,158; 6,035,307; 6,044,374; 6,092,086; 6,208,990; 6,339,775; 6,640,226; 6,789,096; 6,823,373; 6,850,947; 6,895,471; 7,117,215; 7,162,643; 7,243,110; 7,254,590; 7,281,001; 7,421,458; 7,496,588; 7,523,121; 7,584,422; 7,676,516; 7,720,842; 7,721,270; 7,774,791; 8,065,266; 8,150,803; 8,166,048; 8,166,071; 8,200,622; 8,224,873; 8,271,477; 8,327,419; 8,386,435; 8,392,460; 8,453,159; 8,458,230; and RE44,478, International Patents and other Patents Pending.

    DISCLAIMER: Informatica Corporation provides this documentation "as is" without warranty of any kind, either express or implied, including, but not limited to, the implied warranties of noninfringement, merchantability, or use for a particular purpose. Informatica Corporation does not warrant that this software or documentation is error free. The information provided in this software or documentation may include technical inaccuracies or typographical errors. The information in this software and documentation is subject to change at any time without notice.

    NOTICES

    This Informatica product (the "Software") includes certain drivers (the "DataDirect Drivers") from DataDirect Technologies, an operating company of Progress Software Corporation ("DataDirect") which are subject to the following terms and conditions:

    1.THE DATADIRECT DRIVERS ARE PROVIDED "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NON-INFRINGEMENT.

    2. IN NO EVENT WILL DATADIRECT OR ITS THIRD PARTY SUPPLIERS BE LIABLE TO THE END-USER CUSTOMER FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, CONSEQUENTIAL OR OTHER DAMAGES ARISING OUT OF THE USE OF THE ODBC DRIVERS, WHETHER OR NOT INFORMED OF THE POSSIBILITIES OF DAMAGES IN ADVANCE. THESE LIMITATIONS APPLY TO ALL CAUSES OF ACTION, INCLUDING, WITHOUT LIMITATION, BREACH OF CONTRACT, BREACH OF WARRANTY, NEGLIGENCE, STRICT LIABILITY, MISREPRESENTATION AND OTHER TORTS.

    Part Number: DQ-EXC-USG-96000-0001

  • Table of Contents

    Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iiiInformatica Resources. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii

    Informatica My Support Portal. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii

    Informatica Documentation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii

    Informatica Web Site. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii

    Informatica How-To Library. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv

    Informatica Knowledge Base. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv

    Informatica Support YouTube Channel. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv

    Informatica Marketplace. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv

    Informatica Velocity. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv

    Informatica Global Customer Support. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv

    Chapter 1: Introduction to Exception Management. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1Exception Management Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

    Tasks and Workflows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

    Human Task Example. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

    Exception Management Workspaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

    Start Workspace . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

    Exceptions Workspace. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

    Logging In to the Analyst Tool. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

    Chapter 2: Tasks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4Tasks Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

    Task Types. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

    Users and Roles. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

    My Tasks Panel. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

    Task Columns. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

    Task Ownership. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

    Task Maintenance. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

    Viewing Tasks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

    Opening a Task. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

    Release a Task. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

    Complete a Task. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

    Task Administration. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

    Assigning a Task to a User. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

    Viewing All the Task Instances in a Human Task. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

    Multiple Task Completion. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

    Task Export. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

    Export Data from a Correct Exceptions Task. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

    Export Data from a Correct Duplicates Task. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

    Table of Contents i

  • Exporting Task Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

    Chapter 3: Exception Records. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13Exception Records Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

    Data Editing Panel for Exception Records. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

    Configuring the Columns to Display. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

    Filter Exceptions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14

    Correct Exceptions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

    Edit Exception Records. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

    Exception Record Status. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

    Task Actions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

    Review Exceptions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

    Reviewing Exception Records. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

    Chapter 4: Duplicate Records. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19Duplicate Records Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

    Data Editing Panel for Duplicate Records. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

    Correct Duplicates. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

    Editing a Cluster . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

    Creating a Cluster. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

    Finding Duplicate Records in Multiple Clusters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

    Setting the Status of a Cluster. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

    Adding Comments to a Cluster. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

    Duplicate Record Task Filters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

    Filtering Clusters in the Data Editing Panel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

    Review Duplicates Task. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23

    Reviewing Duplicate Records. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

    Chapter 5: Audit Trail Operations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25Audit Trail Operations Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25

    Audit Trail Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25

    Data Audit Panel Filters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

    Status Options. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

    Filtering Records in the Data Audit Panel. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

    Index. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

    ii Table of Contents

  • PrefaceThe Informatica Exception Management Guide describes how to use Exception Managment in the Analyst tool.

    Exception Management is an Analyst tool feature that you can use to view and update data quality exception records in a Human task. Exceptions are records that might contain bad data or duplicate data. Use Exception Management to resolve data errors and to consolidate clusters of duplicate records into a single record.

    Informatica Resources

    Informatica My Support PortalAs an Informatica customer, you can access the Informatica My Support Portal at http://mysupport.informatica.com.

    The site contains product information, user group information, newsletters, access to the Informatica customer support case management system (ATLAS), the Informatica How-To Library, the Informatica Knowledge Base, Informatica Product Documentation, and access to the Informatica user community.

    Informatica DocumentationThe Informatica Documentation team takes every effort to create accurate, usable documentation. If you have questions, comments, or ideas about this documentation, contact the Informatica Documentation team through email at [email protected] We will use your feedback to improve our documentation. Let us know if we can contact you regarding your comments.

    The Documentation team updates documentation as needed. To get the latest documentation for your product, navigate to Product Documentation from http://mysupport.informatica.com.

    Informatica Web SiteYou can access the Informatica corporate web site at http://www.informatica.com. The site contains information about Informatica, its background, upcoming events, and sales offices. You will also find product and partner information. The services area of the site includes important information about technical support, training and education, and implementation services.

    iii

    http://mysupport.informatica.commailto:[email protected]://mysupport.informatica.comhttp://www.informatica.com

  • Informatica How-To LibraryAs an Informatica customer, you can access the Informatica How-To Library at http://mysupport.informatica.com. The How-To Library is a collection of resources to help you learn more about Informatica products and features. It includes articles and interactive demonstrations that provide solutions to common problems, compare features and behaviors, and guide you through performing specific real-world tasks.

    Informatica Knowledge BaseAs an Informatica customer, you can access the Informatica Knowledge Base at http://mysupport.informatica.com. Use the Knowledge Base to search for documented solutions to known technical issues about Informatica products. You can also find answers to frequently asked questions, technical white papers, and technical tips. If you have questions, comments, or ideas about the Knowledge Base, contact the Informatica Knowledge Base team through email at [email protected]

    Informatica Support YouTube ChannelYou can access the Informatica Support YouTube channel at http://www.youtube.com/user/INFASupport. The Informatica Support YouTube channel includes videos about solutions that guide you through performing specific tasks. If you have questions, comments, or ideas about the Informatica Support YouTube channel, contact the Support YouTube team through email at [email protected] or send a tweet to @INFASupport.

    Informatica MarketplaceThe Informatica Marketplace is a forum where developers and partners can share solutions that augment, extend, or enhance data integration implementations. By leveraging any of the hundreds of solutions available on the Marketplace, you can improve your productivity and speed up time to implementation on your projects. You can access Informatica Marketplace at http://www.informaticamarketplace.com.

    Informatica VelocityYou can access Informatica Velocity at http://mysupport.informatica.com. Developed from the real-world experience of hundreds of data management projects, Informatica Velocity represents the collective knowledge of our consultants who have worked with organizations from around the world to plan, develop, deploy, and maintain successful data management solutions. If you have questions, comments, or ideas about Informatica Velocity, contact Informatica Professional Services at [email protected]

    Informatica Global Customer SupportYou can contact a Customer Support Center by telephone or through the Online Support.

    Online Support requires a user name and password. You can request a user name and password at http://mysupport.informatica.com.

    The telephone numbers for Informatica Global Customer Support are available from the Informatica web site at http://www.informatica.com/us/services-and-training/support-services/global-support-centers/.

    iv Preface

    http://mysupport.informatica.comhttp://mysupport.informatica.commailto:[email protected]://www.youtube.com/user/INFASupportmailto:[email protected]://www.informaticamarketplace.comhttp://mysupport.informatica.commailto:[email protected]://mysupport.informatica.comhttp://www.informatica.com/us/services-and-training/support-services/global-support-centers/

  • C H A P T E R 1

    Introduction to Exception Management

    This chapter includes the following topics:

    • Exception Management Overview, 1

    • Tasks and Workflows, 1

    • Human Task Example, 2

    • Exception Management Workspaces , 2

    • Logging In to the Analyst Tool, 3

    Exception Management OverviewYou can review exceptions and then correct the exceptions in the Analyst tool. Exceptions are records in a data set that contain unresolved data quality issues. The records might contain errors, or the records might contain duplicate information.

    A task identifies a set of records and the operations that you can perform to resolve the data quality issues in the exception records. A task step represents the type of work that a user performs on the records the task instance.

    When you log in to the Analyst tool, you can view a list of your tasks in the My Tasks panel. The My Tasks panel appears in the Start workspace. The My Tasks panel is your Inbox for tasks that you receive.

    Select a task, and work on the records that it contains. After you view or edit a record in a task, you can update the status of the record to indicate whether to store the record in the database. When you complete work on a task, the Analyst tool removes the task from your Inbox.

    Tasks and WorkflowsA workflow is a business process that contains of a set of events, tasks, and decisions. A task is a workflow object that runs a single unit of work in the workflow, such as running a mapping, sending an email, or running a shell command.

    1

  • Developers create workflows in Informatica Developer. Informatica stores a workflow as an object in a database called the Model repository. When a workflow runs, it creates the task instances that you resolve in the Analyst tool.

    A workflow can contain different types of tasks. At the workflow level, the type of task that you perform in the Analyst tool is a Human task, because the task needs human interaction to complete.

    A workflow that contains a Human task must also contain a Mapping task. A Mapping task runs an Informatica data process called a mapping. A mapping applies data quality algorithms to records and corrects errors where possible. When a workflow runs a Mapping task, the mapping writes the corrected records to a database table. The mapping writes records that contain unresolved data quality issues to another table. The Human task reads the table of unresolved records and assigns the records to users for manual review in the Analyst tool.

    Human Task ExampleYou are part of a team of data stewards at a retail organization. Your role is to maintain the data quality of a set of customer account records. You are concerned that the records in the data set contain errors.

    A workflow developer on your team creates mappings in the Developer tool to evaluate the accuracy of the customer account data . The developer creates a Mapping task in a workflow to validate and correct customer data. The mapping cannot verify about 10% of the customer records. A user must examine the records and correct them manually. The mapping passes the unresolved records to a Human task.

    The data stewards group and the Accounting department need to review and correct the data. The Human task has two steps to correct exceptions: DS-Correct Exceptions, and Acctg-Correct Exceptions. The data stewards receive the DS-Correct Exceptions task first.

    When you open the Analyst tool, you see a task to correct record exceptions for records 1-50. Another data steward in your department receives a task to review records 51-100. The Analyst tool displays red icons next to the fields that contain problem data. You can edit the records and change the fields. You determine that you cannot fix some records. You decide that an Accounting department user can review the records that you cannot fix. You add comments to these records. The comments appear in the audit trail.

    After you review a record, you set the record status. The record status determines how the workflow processes the records after the Human task completes. When you determine that a record is correct, you choose to accept the record. When you cannot determine how to edit the record, you choose to reprocess the record. If the record does not belong in the customer database, you reject the record.

    When you finish viewing and editing the records, you complete the task. When you complete the task, you pass the records to the DS-Review Exceptions step. Your manager owns the DS-Review-Exceptions step. Your manager reviews your work before passing the exception records to the Accounting group.

    Exception Management WorkspacesThe Analyst tool has workspaces that you use to perform tasks in the tool. The workspaces that you can access depend on the licenses that your organization purchases.

    You can access workspaces from the workspace tabs that appear across the Analyst tool header. You can also access workspaces from access panels on the Start workspace.

    2 Chapter 1: Introduction to Exception Management

  • The Analyst tool has temporary workspaces that you open from links in the Analyst tool. By default, temporary workspaces do not appear as tabs in the Analyst tool header. When you open a temporary workspace, the temporary workspace tab appears in the Analyst tool header. You can close a temporary workspace and the tab disappears in the Analyst tool.

    When you perform Exception Management, your tasks appear in the My Tasks panel of the Start workspace. When you open a task, the Analyst tool opens a temporary workspace called the Exceptions workspace. When you complete work on the task, you can close the Exceptions workspace.

    Start WorkspaceThe Start workspace appears by default when you open the Analyst tool. The Start workspace contains multiple panels that provide links to other Analyst tool workspaces.

    The My Tasks panel appears in the Start workspace when your organization has a license for Human tasks. The My Tasks panel shows your tasks. You can open a task, view a task, or release a task from the My Tasks panel.

    Exceptions WorkspaceReview and edit exception records for a task instance in the Exceptions workspace. You can also view an audit trail of changes that you make to records in a task.

    The Exceptions workspace is a temporary workspace that appears when you open a task in the My Tasks panel. The Exceptions workspace contains a Data Editing panel and a Data Audit panel.

    You can perform correct exceptions tasks and correct duplicate records tasks in the Data Editing panel. The Data Editing panel displays different columns based on the task type. Review each record in a task, change each record that you can fix, and then update the record status to indicate if the changes are complete.

    You can view an audit trail of the changes that you made to records in a task. Open the Data Audit panel to view the audit trail. You can view the fields that changed, the user that changed the record, and the date of the change.

    Logging In to the Analyst ToolUse the Analyst tool URL to log in to the Analyst tool interface. When you log in to the Analyst tool, you must specify the user name, password, and the native domain or the LDAP security domain.

    1. Start a Microsoft Internet Explorer or Google Chrome browser.

    2. In the Address field, enter the URL for the Analyst tool:

    http[s]://:/analyst3. If the domain uses LDAP or native authentication, enter your user name and password on the login

    page.

    4. Select Native or the name of a specific security domain.

    The Security Domain field appears when the Informatica domain uses LDAP or Kerberos authentication. If you do not know the security domain that your user account belongs to, contact the Informatica domain administrator.

    5. Click Log In.

    The Analyst tool opens on the Start workspace.

    Logging In to the Analyst Tool 3

  • C H A P T E R 2

    TasksThis chapter includes the following topics:

    • Tasks Overview, 4

    • Task Types, 4

    • Users and Roles, 5

    • My Tasks Panel, 6

    • Task Maintenance, 7

    • Task Administration, 9

    • Task Export, 10

    Tasks OverviewA workflow creates one or more instances of a Human task. Each task instance contains a different set of records that a user needs to review for data quality. Multiple users can collaborate in the Human task to review a large data set.

    A Human task can contain multiple steps. Records can pass from one step to another within the framework of the Human task. When you work on a task in the Analyst tool, you work on the data issues for a step in the Human task. When you complete work on a task, the records can pass to another step for additional reviews.

    When you work on a task, you review records to determine if the records are ready to pass to the next stage in a workflow. The steps you take to complete a task depend on the type of task and the quality of the records in the task. You can correct fields that contain data that is not valid. You can consolidate duplicate records into one record that has the correct data. You can review changes that other users completed.

    If you are a business administrator, you can manage the tasks that the Human task assigned to you for administration. You can assign tasks to other users. You can view tasks that other users own, and you can complete all task instances for a Human task at the same time.

    Task TypesA Human task contains a sequence of steps. Each step is a type of task. The task defines the type of work that a user must perform in a step.

    You can perform the following types of tasks in the Analyst tool:

    4

  • Correct exceptions

    Review and fix errors in exception records. You can update the data in an exception record. When a record does not contain an error or an empty field, you can change the record status to indicate that the record is valid.

    Review exceptions

    Review the work done by another user to correct exceptions. You can change the actions that the other users performed.

    Correct duplicates

    Create a preferred record from a group of duplicate records in a correct duplicates task. The group of duplicate records is a cluster. The Analyst tool displays the clusters one at a time. If a record is not a duplicate of another record in the cluster you can move the record to another cluster. You can create a cluster that contains a single record.

    A Human task identifies the preferred record in the cluster for storage in the target database table. Any other record in the cluster is a duplicate of the preferred record and you can delete it from the table.

    Review duplicates

    Review the work done by another user to consolidate clusters of duplicate records into preferred records. You can change the cluster actions that other users performed.

    A Human task can contain multiple steps. For example, a workflow developer might create steps to correct employee exceptions. The developer creates a correct exceptions task for the Human Resources department and another correct exceptions task for the Accounting department. Each department works on the same data set. The developer labels the tasks HR Correct Exceptions and Accounting Correct Exceptions. A review exceptions step can follow each correct exceptions step. For example, the HR Review Exceptions step follows the HR Correct Exceptions step. When you complete a step, you select which step is next.

    Note: Records cannot pass from a task that corrects or reviews exceptions to a task that corrects or reviews duplicates. The database tables that contain exception records and cluster records have different structures.

    Users and RolesA workflow developer assigns users and user groups to roles in a Human task.

    The workflow developer assigns the following roles to the Human task users and the user groups:

    Task performer

    A user or group that works on task instances in the Analyst tool.

    Business administrator

    A user or group that manages the status of the task instances associated with a Human task or with a step in the task. A business administrator can reassign a task instance to another task performer.

    An workflow developer configures a business administrator for a Human task. The developer can optionally identify business administrators for specific steps in the Human task. Any business administrator for a Human task is also a business administrator for each step in the Human task.

    A task performer at the Human task level performs work on the same data for the duration of the Human task. When the task performer has a task step assignment, the task performer might not work on other steps.

    Users and Roles 5

  • My Tasks PanelThe My Tasks panel shows the tasks that are assigned to you and provides metadata about each task.

    The Start workspace displays the My Tasks panel. The My Tasks panel is your Inbox for tasks that a workflow assigned to you. You can open a task, view a task, or release a task from the My Tasks panel.

    The following figure shows the My Tasks panel:

    1. My Tasks and Task Administration panel switch

    2. Screen refresh

    3. Filter the tasks in the list

    4. Actions menu

    Task ColumnsThe My Tasks panel lists the tasks assigned to you and provides metadata about each task.

    The following table describes the columns on the My Tasks panel:

    Column Name Description

    Task ID Unique identifier for the task in the workflow. The Task ID identifies the task instance in the Human task. The workflow stores the task ID value so that the ID is unique across multiple runs of the workflow.

    Task Title The task name. The task name consists of the human task name and an identifier that indicates a section of the human task. The identifier has the following format: {row # - row#}.

    Task Type The type of task to perform in a step.A task can be one of the following types:- Correct exceptions. Examine and fix errors in records.- Correct duplicates. Examine a duplicate record cluster

    and create a preferred record from the values in the duplicate records.

    - Review exceptions. Review the work done in the correct exceptions step.

    - Review duplicates. Review the work done in the correct duplicates step.

    6 Chapter 2: Tasks

  • Column Name Description

    Due Date The deadline for the task. A Human task developer can configure due dates for correct exception and correct duplicates tasks. The Analyst tool calculates a review task due date when it creates the task. Each review task might have a different due date.

    Status Due date status. If the task has not reached the due date, the task is on time. If the task in incomplete on the due date, the task is overdue.

    Owner Name of the user assigned to the task.

    Created Date the task was created.

    Task OwnershipA workflow developer might assign a task step to more than one user. The workflow developer might assign a user group to work on a task step.

    When you open a task, you become the owner of the task. If the workflow assigned the task to a user group, the Analyst tool removes the task from the Inbox of the other users when you take ownership of the task.

    Task MaintenanceYou can view tasks, open tasks, and change task status in the Start workspace.

    You can perform the following operations on the My Tasks panel:

    View records in a task.

    View the records in the task without opening the task. When you view the records, the records appear in the Exceptions workspace. You can view records when another user has the task open.

    Open a task.

    Opens a task so you can review and edit the records. When you open the task, you become the owner.

    Release a task.

    Release the ownership of a task that you own. The task has no owner until the business administrator assigns another owner or another user opens the task.

    Complete a task.

    Complete a task step when you finish reviewing all the records. When you complete the step, the Analyst tool passes the task to the next step in the Human task.

    Viewing TasksYou can view records in a task without opening the task. If you are an administrator, you can view tasks assigned to other users to monitor progress.

    1. In the My Tasks panel of the Analyst tool, select a task from the task list.

    Task Maintenance 7

  • 2. Click menu, click Actions > View.

    The task opens in the Data Editing panel of the Exceptions workspace.

    Opening a TaskOpen a task from the Start workspace. You can open a task from the My Tasks panel. If you are an administrator, you can open tasks in the Task Administration panel.

    1. In the My Tasks panel, click a task to select it.

    The Analyst tool highlights the task. An arrow appears at the right of the task.

    2. Click the task arrow and select Open.

    The Exceptions workspace appears. The Data Editing panel displays rows of bad record exceptions or it displays clusters of duplicate records.

    Release a TaskYou can release a task that you own. When you release a task, you remove yourself as owner of the task.

    When you release a task, you remove your user name as the owner. The task appears in the Task Administration panel with a blank Owner field. The administrator can assign the task to another user.

    Releasing a TaskYou can release a task from your task list.

    1. From the Start workspace, switch to the My Tasks panel.

    A list of your tasks appears.

    2. Select one or more tasks to remove from your list.

    3. Click Actions > Release Task.

    Complete a TaskWhen you complete a Human task step, you choose which step is next.

    When you complete a task step, you change the task status by choosing the next task step for the data. For example, the Human task might contain the following steps:

    • Cost Accounting Correct Exceptions

    • Cost Accounting Review Exceptions

    • General Accounting Correct Exceptions

    • General Accounting Review Exceptions

    The Task Actions menu lists the steps that you can choose from. If you completed the Cost Accounting Correct Exceptions step, you might pass the data to the Cost Accounting Review Exceptions task. Or, you might pass the data to the General Accounting Review Exceptions task. You cannot repeat a step in the Human task.

    Note: A task to review data cannot be the first task in a series of steps.

    8 Chapter 2: Tasks

  • Completing a TaskTo complete a task, change the task status by choosing the next task step.

    1. Open the task from the My Tasks panel.

    2. Click Task Actions.

    The Task Actions menu shows the task statuses that you can choose from.

    3. Select a status from the menu.

    Task AdministrationIf you are a business administrator, you manage the status of some or all of the steps in a Human task. A business administrator can assign task steps to users.

    The Task Administration panel displays the tasks that you manage and the tasks that you own. Perform the following tasks on the Task Administration panel:

    • Open tasks assigned to you. If you receive a task to complete, you can use the Task Administration panel the same way as My Tasks.

    • Reassign tasks to users. You might assign a task to a different user if the current user cannot complete the task.

    • View the work that a user performed on a task. You can review the rate of user progress in a task and verify that the user is performing the task correctly.

    • View the list of task instances from the same Human task.

    • Complete multiple tasks. You can change all the task steps in a Human task to a completed state. When you complete the tasks, the records in the tasks move from the Human task to the next stage in the workflow.

    Assigning a Task to a UserAn administrator can assign tasks to users on the Task Administration panel. Assign a task when a task has no owner, or assign a task to a different user when a user cannot complete a task and you need to change the owner.

    1. In the My Tasks panel, switch to the Task Administration panel.

    2. Select a task from the task list.

    3. Click Actions > Reassign Tasks.

    4. Select a user to perform the task.

    Viewing All the Task Instances in a Human TaskYou can view a list of the task instances that a workflow generates from a Human task.

    1. In the My Tasks panel, switch to the Task Administration panel.

    2. Select a task from the task list.

    3. Click Actions > Complete Linked Tasks.

    4. Review the information for each task.

    Task Administration 9

  • The task list displays the following information for each task:

    • Task ID

    • Name of the task

    • Task type

    • Task owner

    • Due date

    • Status

    Do not click OK. If you click OK, you advance all tasks to the next stage in the workflow and remove all tasks from the Inbox of each owner.

    Multiple Task CompletionAn administrator can complete multiple tasks at one time. When an administrator completes all tasks instances for a Human task, the Human task releases the records to the next stage in the workflow. The Analyst tool removes the tasks from the Inbox of the users that worked on the tasks.

    You might complete multiple tasks in the following situations:

    • The workflow failed, and you want to run the workflow again. You need to clear the task instances from each the user Inbox.

    • Users did not complete the task instances, and you want to delete the tasks from the each user Inbox.

    Completing Multiple TasksWhen you complete a set of tasks, you end all work on the Human task and you advance the task records to the next stage in the workflow.

    Note: The action to complete the tasks does not update any record or status data.

    1. In the My Tasks panel, switch to the Task Administration panel .

    2. Select a task from the task list.

    3. Click Actions > Complete Linked Tasks. The list of tasks opens.

    4. Verify that the list contains the tasks you want to complete.

    5. Click OK.

    If you open the Inbox after you complete the tasks, the Inbox might not display any change to the task list. To view the current list of tasks in the Inbox, refresh the Inbox.

    Task ExportYou can export task data to a delimited file. Export data when you want to share the current state of the data with other users.

    When you export a task, you export all data associated with the task instance, including the record or cluster data and the status metadata for the task.

    10 Chapter 2: Tasks

  • Export Data from a Correct Exceptions TaskWhen you export data from an correct exceptions task, you export all record data and status data from the database.

    The following table describes the metadata columns that you export with the task data:

    Column Name Description

    ROW_IDENTIFIER The record row in the database table.

    REVIEW_STATUS The status assigned to the record in the Analyst tool.The status can be one of the following values:- REVIEWED. You marked the record as reviewed.- NULL. You did not mark the record as reviewed.- REJECTED. A reviewer rejected the record.- ACCEPTED. Approved by a reviewer.

    WORKFLOW_ID The workflow that contains the Human task.

    USER_COMMENT Any comment added to the record in the Analyst tool.

    UPDATED_STATUS The update status of the record in the task.The status can be one of the following values:- UPDATED. You added a comment to the record, or you

    marked the record as reviewed.- ACCEPTED. You accepted the record for storage in the

    database table that contains valid data.- REJECTED. You rejected the record as unsuitable for

    storage in the database table that contains valid data.- REPROCESS. You indicated that the record needs

    further processing in another application.- NULL. You did not update the record.

    RECORD_STATUS The record status that the workflow sets. The workflow sets the status value when it writes the record to an exception table for analysis in a Human task. The default value is INVALID.

    Export Data from a Correct Duplicates TaskWhen you export data from a duplicate task, you export all cluster data and status data from the database.

    The following table describes the metadata columns that you export with the task data:

    Column Name Description

    ROW_IDENTIFIER Identifies the record row in the database table.

    SEQUENTIAL_CLUSTER_ID Unique identifier for the cluster in the database table.

    CLUSTER_ID Identifies the cluster that the record belongs to. The Mapping task assigns a cluster ID value to each record in the table.

    Task Export 11

  • Column Name Description

    MATCH_SCORE Decimal value between 0 and 1. Identifies the degree of similarity between two records in the cluster.

    IS_MASTER Indicates if the record is the preferred record in the cluster. The possible values are Y and N.

    UPDATED_STATUS The update status of the record.The status can be one of the following values:- UPDATED. You updated a value in the record.- NULL. You did not update the record.- EXTRACTED. You removed the record from a cluster.

    USER_COMMENT Any comment added to the cluster in the Analyst tool.

    REVIEW_STATUS The status assigned to the cluster in the Analyst tool.The status can be one of the following values:- REVIEWED. You confirmed the record as reviewed.- NULL. You did not mark the record as reviewed.- REJECTED. A reviewer rejected the record.- ACCEPTED. Approved by a reviewer.

    WORKFLOW_ID Identifies the workflow that contains the Human task associated with the task.

    Exporting Task DataExport task data to a delimited file. You can choose whether to output the field names in the delimited file.

    1. Open the task in the My Tasks panel.

    2. Click Actions > Export Data.

    The Export Data dialog box appears.

    3. Optionally, change the export file name. By default, the file name is the same as the task name.

    4. Choose whether to export field names in the first row of the export data.

    5. Click OK.

    6. Choose to open or save the export file.

    12 Chapter 2: Tasks

  • C H A P T E R 3

    Exception RecordsThis chapter includes the following topics:

    • Exception Records Overview, 13

    • Data Editing Panel for Exception Records, 14

    • Correct Exceptions, 15

    • Review Exceptions, 17

    Exception Records OverviewAn exception is a record that may contain one or more data errors. A workflow adds a record to a correct exceptions task when a Mapping task cannot determine if the record is correct.

    Records in a correct exceptions task might contain errors or empty fields. When you correct exception records, you examine the records and attempt to update each record with correct and complete data. The Analyst tool indicates which fields might contain errors in the records. You can select a field and correct the error.

    After you review a record, you can update the record status. You can update the status of the records in one of the following ways:

    • If a record is valid, update the table metadata to confirm that the record is acceptable for persistent storage in the database.

    • If a record is not valid, update the table metadata to confirm that you reject the record for persistent storage in the database.

    • If you cannot confirm the record status, update the table metadata so that the record returns to the workflow for further processing in a Mapping task.

    When you review all the records in a task, you can indicate that you completed work on the task. The task menu options define the next step for the task data. The workflow determines the options that you can choose from.

    After a correct exceptions task, the next step might be to review exceptions. In this step, a user reviews the work that another user performed. In the review exceptions step, a user can change the actions of the other user. The reviewer can approve the record changes, or clear the record status to reject the changes. The review exceptions step is an optional step.

    13

  • Data Editing Panel for Exception RecordsThe Data Editing panel shows the records in a task. When you open an exception task, the Data Editing panel lists the records in the task and provides a set of options you can use to complete the task.

    To edit records on the Data Editing panel, click Edit.

    You can configure what appears in the Data Editing panel for a task. You can configure which columns appear on the panel. Use the Configure option to select the data columns. Use the Filter option to filter the records that appear in the Data Editing panel. To search for records with specific values, use the Find option.

    To change the task status, select options on the Task Actions menu.

    To perform actions on rows, select options on the Record Actions menu.

    1. Filter Exceptions

    2. Configure

    3. Edit

    4. Find

    Configuring the Columns to DisplayYou can organize the data columns that appear in the Data Editing panel. You can choose which columns to view in the panel.

    1. In the My Tasks panel, open a correct exceptions task.

    2. In the Data Editing panel, click Configure.

    The Configure dialog box opens.

    3. Use the Left and Right arrows to add or remove columns from the Data Editing panel.

    4. Use the Up and Down arrows to sort the data columns. The dialog box lists the column names as they appear from left to right in the Data Editing panel.

    Note: You cannot organize the metadata columns.

    5. Click Save.

    Filter ExceptionsYou can filter the records by the types of issue they contain, the priority assigned to them, and the current status of the records.

    14 Chapter 3: Exception Records

  • The following table describes the filter options:

    Option Description

    Type of issue Indicates the type of data quality issue that the workflow identified in the record data. The data quality issue indicates that the record is an exception in the database.Move the pointer over the red icon in a field to view the issue name.

    Priority Indicates the priority that the workflow assigned to the data quality issue in the record.

    Status Indicates the status of the record in the current task, based on the data quality of the record. You can choose from the following status options:- Accepted. Records accepted for storage in the

    database.- Rejected. Records are not suitable for storage in the

    database.- Reprocessed. Records that need further analysis in

    another application.- Empty. Records with no current status.

    Review Indicates the review status of the record in the current task. You can choose from the following review options:- Reviewed. The record is reviewed.- Empty. The record is not reviewed.- Rejected. A reviewer rejected the record.- Accepted. Approved by a reviewer.

    Filtering Exception RecordsYou can filter the data records by the types of issues they contain, on the priority of the data quality issue in the record, and the status of the record. By default, the task does not apply a filter to the data.

    To filter the records that appear in the Data Editing panel, use the Filter option.

    1. In the My Tasks workspace, open a correct exception task.

    2. In the Data Editing panel, click Filter.

    The Filter dialog box opens.

    3. Select the filter criteria to apply to the task data.

    4. Click Apply to apply the filter to the records in the task.

    Correct ExceptionsThe records in a correct exceptions task contain data quality issues that a Mapping task discovers in a workflow. The data quality issues may or may not indicate an error in the data.

    Correct Exceptions 15

  • To correct exceptions, complete one or more of the following steps:

    • Verify that each record contains an error. If the record does not contain an error, you can accept the record for storage in the table without changing it.

    • Update the records with correct data. When you can update a record with correct data, you can accept the record for storage in the table.

    • Update the status of a record. Set the status to determine how the workflow processes the record when the task completes. If you cannot update a record to a usable form, you can reject the record or you can indicate that the record needs reprocessing.

    Edit Exception RecordsEdit exception records fields in the Data Editing panel.

    When you open a task, the Exceptions workspace appears and the Data Editing panel shows the records in the task. The panel shows the fields that contain data quality issues. The error fields contain red icons. You can select an error field and then edit the value in the field. You cannot change a field that is not an error.

    Editing Exception RecordsTo edit exception records, open a task and enable the task for editing in the Data Editing panel.

    To edit exception records, use the following steps:

    1. Open the task in My Tasks panel.

    2. Click Edit to enable the task for editing.

    3. Select a field that has an error.

    4. Enter the correct data value.

    5. Click Save.

    The error icon in the field changes from red to green.

    6. If the record is correct, change the record status to Accept Record. The status indicates that the record is acceptable for storage in the database.

    Repeat the steps for other records in the task.

    Note: You might not have the correct data values for every record in the task. If you cannot edit a record, set the status to Reprocess Record. If you determine that the record is not acceptable, set the status to Reject Record.

    Exception Record StatusWhen you complete work on a record, you set the record status. The record status indicates whether the record contains good quality data. The status determines whether to return the record to the database, delete the record from the database, or pass the record to another process for examination.

    You can perform the following actions to set the record status:

    Accept a record for storage in the database.

    You can mark a record as acceptable for storage in the database. You can accept the record without changing the data or you can edit the record before you accept the record.

    Reject a record from the database.

    You can decide that a record does not belong in the database table. You mark the record for deletion from the table.

    16 Chapter 3: Exception Records

  • Note: The task does not remove records from the table. Another task in the workflow or another application removes the records from the database.

    Return a record for further processing.

    You can decide that you cannot fix the record in the manual review. For example, you cannot determine the correct value of a field in the record. You mark the record for additional processing in another application.

    Clear the status of a record.

    You can undo the status that you set for a record. For example, if you mark a record for rejection from the database, but later you change your mind, you can clear the status and select a different status. You can change a record status at any time before you complete the task.

    Setting the Status of an Exception RecordWhen you complete work on a record, you set the record status to indicate the data quality of the record. You can clear the status if you decide the work is not complete after you set the status previously.

    1. To enable the task for editing, click Edit .

    2. Select the record to update.

    3. Click Record Actions and select the status to apply to the record.

    • To indicate that a record contains correct business information and can remain in the database, select Accept Record.

    • To indicate that a record does not contain usable information and can be deleted from the database, select Reject Record.

    • To indicate that the record needs further processing before it can be returned to the database, click Reprocess Record.

    • To clear the status, click Clear Record Status.

    Task ActionsWhen you complete a task step, you can select the next step to receive the task data. Select the next step on the Task Actions menu.

    When you complete a task step, the Analyst tool passes the task to the next step in the Human task. When you complete a task that corrects exception records, the records might pass to another step to review the changes. The workflow defines the options that you can choose on the Task Actions menu. The Task Actions menu can have one or more options.

    Review ExceptionsValidate the work done by another use in a review exceptions task.

    The steps to complete the review exceptions task are similar to the steps to correct the data. In a review task, you can verify or undo the work another user performed. Your Inbox can include data correction tasks and review tasks.

    Review Exceptions 17

  • When you review exception records, you examine the changes made by the previous user and the status assigned to each record. Verify that the changes to the records are correct. Verify that the record status is correct.

    The accuracy of the data in the record must match the record status. For example, if a record contains errors, the record status must indicate that the record requires additional processing, or that the record is not valid for storage in the database.

    Reviewing Exception RecordsWhen you review the output of a task that corrects exceptions, you validate the status of each record. The status determines how the next stage of the workflow processes the records. The review task ends when you review all records and set the task status.

    Perform the following steps to review exceptions:

    1. Open the task in My Tasks panel.

    The records in the task appear in the Exceptions workspace.

    2. Verify that the status of each record represents the information in the record.

    You can change the record status.

    • If a user changed the record, but you do not accept the changes, click Edit and then click Record Actions > Clear Record Status.

    • If you identify an error in the record, click Edit and update the record. When the record is correct, you can accept the record.

    3. After you review all records, set the task status.

    18 Chapter 3: Exception Records

  • C H A P T E R 4

    Duplicate RecordsThis chapter includes the following topics:

    • Duplicate Records Overview, 19

    • Data Editing Panel for Duplicate Records, 20

    • Correct Duplicates, 20

    • Duplicate Record Task Filters, 23

    • Review Duplicates Task, 23

    Duplicate Records OverviewTo correct duplicate records, examine a cluster of duplicate records and determine which record to store in the database and which records to discard. You can update the preferred record with values from other records in the cluster.

    A cluster is aset of records in which each record matches at least one other record based on a match score. Each cluster has a preferred record. The preferred record contains the most accurate representation of the information in the cluster. The preferred record is the record you want to store in the database. The other records in the cluster are redundant. By default, the Analyst tool selects the first record in the cluster as the preferred record. When you edit the cluster, you update the preferred record with the most accurate field values from the other duplicate records in the cluster.

    If a record is not a duplicate of another record in the cluster, you can remove it from the cluster. You can move a record from one cluster to another cluster. You can create a cluster with one record if the record is unique.

    Complete the task after you review all the duplicate record clusters and accept one preferred record for each cluster of duplicate records.

    Note: Two or more records are duplicates when they contain the same business information. Records can contain similar data but not represent the same information to the business. Your organization must determine the business rules that define duplicate data. For example, your organization might maintain more than one account record for the same customer.

    19

  • Data Editing Panel for Duplicate RecordsComplete a duplicate record correction task in the Data Editing panel of the Exceptions workspace. The Data Editing panel organizes the duplicate record clusters in a series of tabs. Each tab has a number. Click a tab to display the records in one cluster.

    You can limit the amount of data that you view in the Data Editing panel. Configure which data columns to display in the Data Editing panel. You can filter which clusters appear in the Data Editing panel.

    The following figure shows clusters on the Data Editing panel:

    The Data Editing panel has the following options:

    1. Configure

    2. Filter

    3. Edit

    4. Undo and redo

    The panel also contains the Task Actions menu and the Cluster Actions menu. The Task Actions menu contains the options to change the status of the task. The Cluster Actions menu contains the options that you can perform on a cluster.

    Correct DuplicatesTo correct duplicate records, examine all the records in a cluster. Add the most accurate data from each record in the cluster to the preferred record. Remove any record that is not a duplicate record.

    Perform the following actions to correct duplicates:

    Edit the cluster.

    The Data Editing panel displays the records in a cluster and selects a record as the preferred record. Examine the field values in each record in the cluster. Verify that the records in the cluster represent different versions of the same record. If you find values that contain more accurate information than the preferred record values, replace the preferred record values with the values from the other records.

    20 Chapter 4: Duplicate Records

  • Create clusters.

    A cluster might contain a record that is not a duplicate of any other record in the cluster. In this case, create a cluster and move the record to it. Create a cluster to keep the unique record in the database table. The cluster contains a single record.

    Find records in other clusters.

    If you expect that the data contains duplicate records across more than one cluster, search for records in the other clusters. You can open multiple clusters at the same time in the Analyst tool. If duplicate records occur between clusters, move records from one cluster to another.

    Set review status.

    When you complete the preferred record in a cluster, set the cluster review status to REVIEWED. The Analyst tool marks the preferred record for storage in the database table and it marks the remaining records as redundant. A Mapping task can delete the redundant records in a later stage of the workflow. You can clear the cluster status if you want to change the preferred record again. The review status becomes NULL.

    Comment.

    Opens comments for a cluster. You can view and add comments for a specific cluster. When you add comments, the comments appear in the audit trail for the cluster.

    Complete the task.

    After you review all the clusters in your task, you can indicate that you completed work on the task. The workflow defines the options that you can select to complete the task.

    Editing a ClusterExamine the records in the cluster and select any data value that you want to add to a field in the preferred record. You can select fields from multiple records to add to the preferred record.

    The clusters in the task appear in a series of tabs across the Data Editing panel. The first cluster is open by default.

    Use the following steps to edit a cluster:

    1. Open a correct duplicates task.

    The clusters in the task appear in the Exceptions workspace.

    2. Click a tab to open the cluster.

    The duplicate records appear beneath the preferred record.

    3. Compare the preferred record with the other records in the cluster.

    Identify the most accurate values in each column in the cluster.

    4. Click Edit.

    5. Click a value from a duplicate record that you want to move to the preferred record.

    Repeat the steps for all values that you want to move to the preferred record. When you complete work in a cluster, confirm that you reviewed the cluster.

    Creating a ClusterCreate a cluster when the current cluster contains a unique record. Move the unique record to the new cluster.

    1. Open a cluster in the Data Editing panel.

    Correct Duplicates 21

  • 2. Click Cluster Actions > Create Cluster.

    An empty cluster appears in the Data Editing panel below the current cluster.

    3. Select the record to add to the cluster.

    4. Click Move Records.

    The record becomes the preferred record of the cluster you created.

    5. Move any other record that matches the preferred record to the cluster you created.

    If the cluster you created contains one record, the preferred record is a unique record.

    Finding Duplicate Records in Multiple ClustersYou can find records in other clusters that may match records in the current cluster. Enter a data value to search for and enter the record column that contains the data value.

    1. In the Data Editing panel, click Cluster Actions > Find Clusters.

    The Find dialog box appears.

    2. Enter the data value you want to find. You can enter the full data value as it appears in the record column, or you can use an asterisk (*) or percent sign (%) wildcard character in a text pattern.

    3. Select the column that contains the data value to search for.

    4. Click Find.

    The search operation returns all records that contain the value you searched for.

    5. Select any record in the search results that matches a record in the open cluster. You can use the CTRL key to select multiple records.

    The Data Editing panel displays the clusters that contain the records you select. Select a record and click Move Records to move a record from one cluster to the other.

    Setting the Status of a ClusterWhen you complete work on a cluster, confirm that you reviewed the cluster. When you confirm the review, you set the record review status to REVIEWED.

    1. Open a cluster in the Data Editing panel.

    2. Click Edit.

    3. Click Cluster Actions > Confirm Cluster Review.

    4. To clear the REVIEWED status, click Cluster Actions > Clear Cluster Status.

    Adding Comments to a ClusterYou can add comments to a cluster in order to save notes or information about the data. Comments appear in the data audit trail.

    1. Open a cluster in the Data Editing panel

    2. Click Edit.

    3. Click Cluster Actions > Comment.

    4. Enter text in the Comment dialog box.

    22 Chapter 4: Duplicate Records

  • Duplicate Record Task FiltersYou can sort the clusters that appear in the Data Editing panel. You can also apply a filter to display only the clusters that contain a data value that you specify.

    Use the Filter options to filter and sort the clusters. You sort the clusters by the status assigned to each cluster by the user who worked on the task.

    The following table describes the status options:

    Status Description

    Accepted Clusters accepted for the database in a review duplicates task.

    Rejected Clusters rejected for the database in a review duplicates task.

    Reviewed Clusters set to reviewed in a correct duplicates task.

    Filtering Clusters in the Data Editing PanelYou can filter the clusters that display in the Data Editing panel. Select a data column and a value to filter the column by. You can also sort the clusters by status.

    1. Open a correct duplicates task in the My Tasks panel.

    2. In the Data Editing panel, click Filter. The Filter dialog box opens.

    3. Select a data column and then enter a value for the column.

    When you apply the filter, the Data Editing panel displays the clusters that contain the data you searched for.

    4. Use the Up and Down arrows to sort the clusters by status.

    If you do not sort the clusters, the Data Editing panel displays the clusters in numeric order by cluster ID.

    5. Click Apply to apply the filter to the clusters in the task.

    Review Duplicates TaskWhen you review the records in a cluster, you validate the work done by another user in a previous step. You can perform the same processes in the review duplicates task as in a correct duplicates task.

    When you review cluster data, examine the preferred record defined by the previous user and the other records in the cluster. Verify that the preferred record represents the most accurate version of the records in the cluster. Update the preferred record if you find more accurate data in another record in the cluster.

    Verify that the other records in the cluster do not include any record that the business might want to keep. You can create a cluster and add the record to the cluster.

    Note: Your Inbox might contain correct duplicate tasks and it might contain review duplicates tasks.

    Duplicate Record Task Filters 23

  • Reviewing Duplicate RecordsWhen you review the output of a correct duplicates task, you validate that the preferred records represent the best version of the data in the clusters. You review one cluster at a time. The review task ends when you review all clusters and set the task status.

    Perform the following steps for all clusters in the task:

    1. Open a cluster in Data Editing panel.

    2. Compare the preferred record with the other records in the cluster.

    3. Verify that the preferred record contains the most accurate version of the data in the cluster.

    • If a cluster status is reviewed but needs additional work, click Edit and clear the cluster status.

    • If you find an error in the preferred record and you do not want to keep the cluster, click Edit and click Cluster Actions > Reject Record.

    • If you want to update the preferred record, click Edit and change the preferred record.

    • When a record is correct, click Cluster Actions > Accept Record.

    After you review the records in all clusters, you can change the task status to the next step in the workflow.

    24 Chapter 4: Duplicate Records

  • C H A P T E R 5

    Audit Trail OperationsThis chapter includes the following topics:

    • Audit Trail Operations Overview, 25

    • Audit Trail Data, 25

    • Data Audit Panel Filters, 26

    Audit Trail Operations OverviewThe Analyst tool stores an audit trail of all changes that you make to the records in a task. Review the changes to the records i in the Data Audit panel of the Exceptions workspace.

    You can perform the following operations in the Data Audit panel:

    • View the list of task updates since the task was created.

    • Filter the audit trail by the date, the user name, and the type of change.

    • Change which columns appear in the Data Audit panel.

    Note: When you view the Data Audit panel for a duplicate task, the audit trail lists the records that users updated in the task. The audit trail does not display all the cluster data.

    Audit Trail DataEach row in the audit trail represents an update to a record. If you make multiple updates to a record, the audit trail adds an entry for each update. The audit trail organizes record changes in chronological order. If a task contains no updates, the audit trail is empty.

    An audit trail displays the fields that you changes in the task. When you change a field in a record, the Data Audit panel shows an edit icon next to the field. Place the pointer over the icon to see the previous value of the field. An audit trail also identifies the user that changed the record, the date of the change, and the type of change.

    25

  • The following table describes the metadata columns in an audit trail:

    Column Name Description

    Updated By The user that updated the record.

    Updated The date of the record update.

    Comment Any comment that a user adds.

    Status Any status that a user sets.

    Review Any review status that a user sets.

    Use the Configure option to organize the data columns that display in the Data Audit panel.

    Data Audit Panel FiltersYou can filter the records that display in the Data Audit panel.

    The following table describes the filter options:

    Option Description

    FromTo

    The date range for the updates you want to view.

    User The user who performed the updates you want to view.

    Status The status of the record in the current task, based on the data quality of the record.

    Review The review status of the record in the current task. You can choose from the following review options:- Reviewed. The record is reviewed in a review task.- Approved. A reviewer approved the record.- Rejected. A reviewer rejected the record.- Empty. The record is not reviewed.- Cleared. Records that a user changed from a reviewed

    state to an unreviewed state.

    26 Chapter 5: Audit Trail Operations

  • Status OptionsYou can filter audit trail records based on the type of task you are viewing.

    The following table describes the status options you can set as filters in the Audit view:

    Status Task Type Description

    Accepted Correct exceptions Records accepted for storage in the database.

    Cleared Correct exceptions Records with a status update that a user deleted.

    Empty Correct duplicatesCorrect exceptions

    Records with no status update.

    Moved into cluster Correct duplicates Records that moved into the specified cluster.

    Moved out of cluster Correct duplicates Records that were moved out of the specified cluster

    Rejected Correct exceptions Records rejected as unsuitable for storage in the database.

    Reprocessed Correct exceptions Records that need further analysis in another application.

    Filtering Records in the Data Audit PanelYou can filter the records that display in the Data Audit panel.

    Use the following steps to filter data audit records:

    1. Open a task in the Data Audit panel, and click Filter. The Filter dialog box opens.

    2. Select the filter criteria to apply to the task data.

    3. Click Apply to apply the filter to the clusters in the task.

    Data Audit Panel Filters 27

  • I N D E X

    Aaudit trails

    reading an audit trail 25

    CClear Cluster Status

    description 22cluster

    adding comments 22creating 21editing 21filtering 23finding duplicate records 22status updating 22

    comments adding to clusters 22

    DData Audit panel

    filtering audit records 26, 27Data Editing panel

    customizing the panel 14description 14duplicate record tasks 20exceptioin data filter 15filtering cluster data 23filtering clusters 23filtering records 15

    duplicate records correct duplicates task 19creating a cluster 21editing clusters 21export file structure 11searching clusters 22steps to correct duplicates 20updating cluster status 22

    EException Management

    overview 1workspaces 2

    exception records correct exceptions task 13editing exception records 16export file structure 11filtering 15steps to correct exceptions 16updating record status 17

    Exceptions workspace Informatica Analyst 3

    export file duplicate records 11exception records 11

    exporting a task description 10

    Ffiltering clusters

    steps 23filters

    Data Audit panel 26

    IInformatica Analyst interface

    log in 3

    MModel repository 2My Tasks

    panel 6My Tasks menu

    options 7My Tasks panel

    columns 6

    Ppreferred record

    changing 21

    Rreleasing tasks

    description 8review exceptions task

    description 17review task

    review duplicates 24review exceptions 18steps to review clusters 23

    Ttask

    completing 8

    28

  • task (continued)correct duplicates 19correct exceptions 13description 2exporting task data 10, 12Human task 2Mapping task 2opening 8releasing 8task actions 8task instances 2tasks and workflows 2types of task 4

    task actions completing a task 8menu description 17

    task administration assigning a task to a user 9

    Task Administration panel 6

    task administration options 9tasks

    overview 4viewing 7

    Tasks view Dashboard 6, 7

    temporary workspaces Exception Management 2

    Wworkflows

    description 2

    Index 29

    Table of ContentsPrefaceInformatica ResourcesInformatica My Support PortalInformatica DocumentationInformatica Web SiteInformatica How-To LibraryInformatica Knowledge BaseInformatica Support YouTube ChannelInformatica MarketplaceInformatica VelocityInformatica Global Customer Support

    Chapter 1: Introduction to Exception ManagementException Management OverviewTasks and WorkflowsHuman Task ExampleException Management WorkspacesStart WorkspaceExceptions Workspace

    Logging In to the Analyst Tool

    Chapter 2: TasksTasks OverviewTask TypesUsers and RolesMy Tasks PanelTask ColumnsTask Ownership

    Task MaintenanceViewing TasksOpening a TaskRelease a TaskComplete a Task

    Task AdministrationAssigning a Task to a UserViewing All the Task Instances in a Human TaskMultiple Task Completion

    Task ExportExport Data from a Correct Exceptions TaskExport Data from a Correct Duplicates TaskExporting Task Data

    Chapter 3: Exception RecordsException Records OverviewData Editing Panel for Exception RecordsConfiguring the Columns to DisplayFilter Exceptions

    Correct ExceptionsEdit Exception RecordsException Record StatusTask Actions

    Review ExceptionsReviewing Exception Records

    Chapter 4: Duplicate RecordsDuplicate Records OverviewData Editing Panel for Duplicate RecordsCorrect DuplicatesEditing a ClusterCreating a ClusterFinding Duplicate Records in Multiple ClustersSetting the Status of a ClusterAdding Comments to a Cluster

    Duplicate Record Task FiltersFiltering Clusters in the Data Editing Panel

    Review Duplicates TaskReviewing Duplicate Records

    Chapter 5: Audit Trail OperationsAudit Trail Operations OverviewAudit Trail DataData Audit Panel FiltersStatus OptionsFiltering Records in the Data Audit Panel

    Index