of 307 /307
Informatica PowerCenter (Version 9.0.1) Advanced Workflow Guide

Informatica PowerCenter (Version 9.0.1) - Gerardnico€¦ · Informatica PowerCenter Advanced ... This product includes software developed by the OpenSSL Project for use in the OpenSSL

  • Author
    dinhque

  • View
    240

  • Download
    1

Embed Size (px)

Text of Informatica PowerCenter (Version 9.0.1) - Gerardnico€¦ · Informatica PowerCenter Advanced ......

  • Informatica PowerCenter (Version 9.0.1)

    Advanced Workflow Guide

  • Informatica PowerCenter Advanced Workflow Guide

    Version 9.0.1June 2010

    Copyright (c) 1998-2010 Informatica. All rights reserved.

    This software and documentation contain proprietary information of Informatica Corporation and are provided under a license agreement containing restrictions on use anddisclosure and are also protected by copyright law. Reverse engineering of the software is prohibited. No part of this document may be reproduced or transmitted in any form,by any means (electronic, photocopying, recording or otherwise) without prior consent of Informatica Corporation. This Software may be protected by U.S. and/or internationalPatents and other Patents Pending.

    Use, duplication, or disclosure of the Software by the U.S. Government is subject to the restrictions set forth in the applicable software license agreement and as provided inDFARS 227.7202-1(a) and 227.7702-3(a) (1995), DFARS 252.227-7013(1)(ii) (OCT 1988), FAR 12.212(a) (1995), FAR 52.227-19, or FAR 52.227-14 (ALT III), as applicable.

    The information in this product or documentation is subject to change without notice. If you find any problems in this product or documentation, please report them to us inwriting.

    Informatica, Informatica Platform, Informatica Data Services, PowerCenter, PowerCenterRT, PowerCenter Connect, PowerCenter Data Analyzer, PowerExchange,PowerMart, Metadata Manager, Informatica Data Quality, Informatica Data Explorer, Informatica B2B Data Transformation, Informatica B2B Data Exchange and InformaticaOn Demand are trademarks or registered trademarks of Informatica Corporation in the United States and in jurisdictions throughout the world. All other company and productnames may be trade names or trademarks of their respective owners.

    Portions of this software and/or documentation are subject to copyright held by third parties, including without limitation: Copyright DataDirect Technologies. All rightsreserved. Copyright Sun Microsystems. All rights reserved. Copyright RSA Security Inc. All Rights Reserved. Copyright Ordinal Technology Corp. All rightsreserved.Copyright Aandacht c.v. All rights reserved. Copyright Genivia, Inc. All rights reserved. Copyright 2007 Isomorphic Software. All rights reserved. Copyright MetaIntegration Technology, Inc. All rights reserved. Copyright Intalio. All rights reserved. Copyright Oracle. All rights reserved. Copyright Adobe Systems Incorporated. Allrights reserved. Copyright DataArt, Inc. All rights reserved. Copyright ComponentSource. All rights reserved. Copyright Microsoft Corporation. All rights reserved.Copyright Rouge Wave Software, Inc. All rights reserved. Copyright Teradata Corporation. All rights reserved. Copyright Yahoo! Inc. All rights reserved. Copyright Glyph & Cog, LLC. All rights reserved.

    This product includes software developed by the Apache Software Foundation (http://www.apache.org/), and other software which is licensed under the Apache License,Version 2.0 (the "License"). You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0. Unless required by applicable law or agreed to in writing,software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See theLicense for the specific language governing permissions and limitations under the License.

    This product includes software which was developed by Mozilla (http://www.mozilla.org/), software copyright The JBoss Group, LLC, all rights reserved; software copyright 1999-2006 by Bruno Lowagie and Paulo Soares and other software which is licensed under the GNU Lesser General Public License Agreement, which may be found at http://www.gnu.org/licenses/lgpl.html. The materials are provided free of charge by Informatica, "as-is", without warranty of any kind, either express or implied, including but notlimited to the implied warranties of merchantability and fitness for a particular purpose.

    The product includes ACE(TM) and TAO(TM) software copyrighted by Douglas C. Schmidt and his research group at Washington University, University of California, Irvine,and Vanderbilt University, Copyright () 1993-2006, all rights reserved.

    This product includes software developed by the OpenSSL Project for use in the OpenSSL Toolkit (copyright The OpenSSL Project. All Rights Reserved) and redistribution ofthis software is subject to terms available at http://www.openssl.org.

    This product includes Curl software which is Copyright 1996-2007, Daniel Stenberg, . All Rights Reserved. Permissions and limitations regarding thissoftware are subject to terms available at http://curl.haxx.se/docs/copyright.html. Permission to use, copy, modify, and distribute this software for any purpose with or withoutfee is hereby granted, provided that the above copyright notice and this permission notice appear in all copies.

    The product includes software copyright 2001-2005 () MetaStuff, Ltd. All Rights Reserved. Permissions and limitations regarding this software are subject to terms availableat http://www.dom4j.org/ license.html.

    The product includes software copyright 2004-2007, The Dojo Foundation. All Rights Reserved. Permissions and limitations regarding this software are subject to termsavailable at http:// svn.dojotoolkit.org/dojo/trunk/LICENSE.

    This product includes ICU software which is copyright International Business Machines Corporation and others. All rights reserved. Permissions and limitations regarding thissoftware are subject to terms available at http://source.icu-project.org/repos/icu/icu/trunk/license.html.

    This product includes software copyright 1996-2006 Per Bothner. All rights reserved. Your right to use such materials is set forth in the license which may be found at http://www.gnu.org/software/ kawa/Software-License.html.

    This product includes OSSP UUID software which is Copyright 2002 Ralf S. Engelschall, Copyright 2002 The OSSP Project Copyright 2002 Cable & WirelessDeutschland. Permissions and limitations regarding this software are subject to terms available at http://www.opensource.org/licenses/mit-license.php.

    This product includes software developed by Boost (http://www.boost.org/) or under the Boost software license. Permissions and limitations regarding this software are subjectto terms available at http:/ /www.boost.org/LICENSE_1_0.txt.

    This product includes software copyright 1997-2007 University of Cambridge. Permissions and limitations regarding this software are subject to terms available at http://www.pcre.org/license.txt.

    This product includes software copyright 2007 The Eclipse Foundation. All Rights Reserved. Permissions and limitations regarding this software are subject to termsavailable at http:// www.eclipse.org/org/documents/epl-v10.php.

    This product includes software licensed under the terms at http://www.tcl.tk/software/tcltk/license.html, http://www.bosrup.com/web/overlib/?License, http://www.stlport.org/doc/license.html, http://www.asm.ow2.org/license.html, http://www.cryptix.org/LICENSE.TXT, http://hsqldb.org/web/hsqlLicense.html, http://httpunit.sourceforge.net/doc/license.html, http://jung.sourceforge.net/license.txt , http://www.gzip.org/zlib/zlib_license.html, http://www.openldap.org/software/release/license.html, http://www.libssh2.org,http://slf4j.org/license.html, http://www.sente.ch/software/OpenSourceLicense.html, and http://fusesource.com/downloads/license-agreements/fuse-message-broker-v-5-3-license-agreement.

    This product includes software licensed under the Academic Free License (http://www.opensource.org/licenses/afl-3.0.php), the Common Development and DistributionLicense (http://www.opensource.org/licenses/cddl1.php) the Common Public License (http://www.opensource.org/licenses/cpl1.0.php) and the BSD License (http://www.opensource.org/licenses/bsd-license.php).

    This product includes software copyright 2003-2006 Joe WaInes, 2006-2007 XStream Committers. All rights reserved. Permissions and limitations regarding this softwareare subject to terms available at http://xstream.codehaus.org/license.html. This product includes software developed by the Indiana University Extreme! Lab. For furtherinformation please visit http://www.extreme.indiana.edu/.

    This Software is protected by U.S. Patent Numbers 5,794,246; 6,014,670; 6,016,501; 6,029,178; 6,032,158; 6,035,307; 6,044,374; 6,092,086; 6,208,990; 6,339,775;6,640,226; 6,789,096; 6,820,077; 6,823,373; 6,850,947; 6,895,471; 7,117,215; 7,162,643; 7,254,590; 7,281,001; 7,421,458; and 7,584,422, international Patents and otherPatents Pending..

  • DISCLAIMER: Informatica Corporation provides this documentation "as is" without warranty of any kind, either express or implied, including, but not limited to, the impliedwarranties of non-infringement, merchantability, or use for a particular purpose. Informatica Corporation does not warrant that this software or documentation is error free. Theinformation provided in this software or documentation may include technical inaccuracies or typographical errors. The information in this software and documentation issubject to change at any time without notice.

    NOTICES

    This Informatica product (the Software) includes certain drivers (the DataDirect Drivers) from DataDirect Technologies, an operating company of Progress SoftwareCorporation (DataDirect) which are subject to the following terms and conditions:

    1.THE DATADIRECT DRIVERS ARE PROVIDED AS IS WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOTLIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NON-INFRINGEMENT.

    2. IN NO EVENT WILL DATADIRECT OR ITS THIRD PARTY SUPPLIERS BE LIABLE TO THE END-USER CUSTOMER FOR ANY DIRECT, INDIRECT,INCIDENTAL, SPECIAL, CONSEQUENTIAL OR OTHER DAMAGES ARISING OUT OF THE USE OF THE ODBC DRIVERS, WHETHER OR NOT INFORMED OFTHE POSSIBILITIES OF DAMAGES IN ADVANCE. THESE LIMITATIONS APPLY TO ALL CAUSES OF ACTION, INCLUDING, WITHOUT LIMITATION, BREACHOF CONTRACT, BREACH OF WARRANTY, NEGLIGENCE, STRICT LIABILITY, MISREPRESENTATION AND OTHER TORTS.

    Part Number: PC-AWG-90100-0001

  • Table of Contents

    Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiiInformatica Resources. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii

    Informatica Customer Portal. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii

    Informatica Documentation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii

    Informatica Web Site. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii

    Informatica How-To Library. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii

    Informatica Knowledge Base. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii

    Informatica Multimedia Knowledge Base. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii

    Informatica Global Customer Support. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii

    Chapter 1: Understanding Pipeline Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1Understanding Pipeline Partitioning Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

    Partitioning Attributes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

    Partition Points. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

    Number of Partitions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

    Partition Types. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

    Dynamic Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

    Configuring Dynamic Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

    Rules and Guidelines for Dynamic Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

    Using Dynamic Partitioning with Partition Types. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

    Configuring Partition-Level Attributes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

    Cache Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

    Mapping Variables in Partitioned Pipelines. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

    Partitioning Rules. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

    Partition Restrictions for Editing Objects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

    Partition Restrictions for PowerExchange. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

    Configuring Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

    Adding Partition Points to a Pipeline. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

    Configuring a Partition Point. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

    Partition Points Node. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10

    Non-Partition Points Node. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

    Chapter 2: Partition Points. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12Partition Points Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

    Adding and Deleting Partition Points. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13

    Rules and Guidelines for Adding and Deleting Partition Points. . . . . . . . . . . . . . . . . . . . . . . . 13

    Partitioning Relational Sources. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

    Entering an SQL Query. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

    Table of Contents i

  • Entering a Filter Condition. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15

    Partitioning File Sources. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

    Rules and Guidelines for Partitioning File Sources. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16

    Using One Thread to Read a File Source. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

    Using Multiple Threads to Read a File Source. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

    Configuring for File Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18

    Partitioning Relational Targets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

    Database Compatibility. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

    Partitioning File Targets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

    Configuring Connection Settings. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

    Configuring File Properties. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22

    Partitioning Custom Transformations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

    Working with Multiple Partitions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

    Creating Partition Points. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25

    Working with Threads. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25

    Partitioning Joiner Transformations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

    Partitioning Sorted Joiner Transformations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26

    Using Sorted Flat Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27

    Using Sorted Relational Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

    Using Sorter Transformations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

    Optimizing Sorted Joiner Transformations with Partitions. . . . . . . . . . . . . . . . . . . . . . . . . . . 29

    Partitioning Lookup Transformations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29

    Cache Partitioning Lookup Transformations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29

    Partitioning Pipeline Lookup Transformation Cache. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30

    Partitioning Sequence Generator Transformations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

    Partitioning Sorter Transformations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31

    Configuring Sorter Transformation Work Directories. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

    Partitioning XML Generator Transformations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

    Restrictions for Transformations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32

    Restrictions for Numerical Functions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

    Chapter 3: Partition Types. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34Partition Types Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34

    Setting Partition Types in the Pipeline. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

    Setting Partition Types. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

    Database Partitioning Partition Type. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

    Partitioning Database Sources. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38

    Target Database Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40

    Hash Auto-Keys Partition Type. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41

    Hash User Keys Partition Type. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41

    Key Range Partition Type. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42

    Adding a Partition Key. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

    Adding Key Ranges. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

    ii Table of Contents

  • Pass-Through Partition Type. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44

    Round-Robin Partition Type. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46

    Chapter 4: Pushdown Optimization. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47Pushdown Optimization Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47

    Pushdown Optimization Types. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48

    Running Source-Side Pushdown Optimization Sessions. . . . . . . . . . . . . . . . . . . . . . . . . . . . 48

    Running Target-Side Pushdown Optimization Sessions. . . . . . . . . . . . . . . . . . . . . . . . . . . . 48

    Running Full Pushdown Optimization Sessions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48

    Active and Idle Databases. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49

    Working with Databases. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50

    Comparing the Output of the Integration Service and Databases. . . . . . . . . . . . . . . . . . . . . . 50

    Using ODBC Drivers. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51

    Rules and Guidelines for Netezza. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

    Pushdown Compatibility. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

    Incompatible Users for Database Connections. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53

    Qualifying Names of Tables in Idle Databases. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54

    Working with Dates. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54

    Working with Expressions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55

    Operators. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55

    Variables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56

    Functions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56

    Rules and Guidelines for Functions in Pushdown Optimization. . . . . . . . . . . . . . . . . . . . . . . . 60

    Error Handling, Logging, and Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61

    Error Handling. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61

    Logging. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61

    Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61

    Working with Slowly Changing Dimensions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62

    Working with Sequences and Views. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62

    Sequences. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62

    Views. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63

    Troubleshooting Orphaned Sequences and Views. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64

    Using the $$PushdownConfig Mapping Parameter. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66

    Configuring Sessions for Pushdown Optimization. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67

    Pushdown Options. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67

    Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68

    Target Load Rules. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70

    Viewing Pushdown Groups. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70

    Chapter 5: Pushdown Optimization Transformations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73Pushdown Optimization and Transformations Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73

    Rules and Guidelines for Pushdown Optimization and Transformations. . . . . . . . . . . . . . . . . . 74

    Aggregator Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75

    Table of Contents iii

  • Expression Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75

    Filter Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76

    Joiner Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76

    Lookup Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77

    Unconnected Lookup Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78

    Lookup Transformation with an SQL Override. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79

    Router Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79

    Sequence Generator Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80

    Sorter Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81

    Source Qualifier Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81

    Source Qualifier Transformation with an SQL Override. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82

    Target. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83

    Union Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84

    Update Strategy Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84

    Chapter 6: Real-time Processing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86Real-time Processing Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86

    Understanding Real-time Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87

    Messages and Message Queues. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87

    Web Service Messages. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88

    Change Data from PowerExchange Change Data Capture Sources. . . . . . . . . . . . . . . . . . . . 88

    Configuring Real-time Sessions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88

    Terminating Conditions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89

    Idle Time. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89

    Message Count. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89

    Reader Time Limit. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89

    Flush Latency. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90

    Commit Type . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90

    Message Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91

    Prerequisites. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91

    Steps to Enable Message Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92

    Recovery File. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92

    Message Processing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92

    Message Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92

    Session Recovery Data Flush. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93

    Recovery Table. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93

    PM_REC_STATE Table. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93

    Message Processing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93

    Message Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94

    Recovery Queue and Recovery Topic. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94

    Message Processing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95

    Message Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95

    Recovery Ignore List. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95

    iv Table of Contents

  • Stopping Real-time Sessions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95

    Restarting and Recovering Real-time Sessions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96

    Restarting Real-time Sessions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96

    Recovering Real-time Sessions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96

    Restart and Recover Commands. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97

    Rules and Guidelines for Real-time Sessions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97

    Rules and Guidelines for Message Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98

    Real-time Processing Example. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98

    Informatica Real-time Products. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100

    Chapter 7: Commit Points. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101Commit Points Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101

    Target-Based Commits. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102

    Source-Based Commits. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102

    Determining the Commit Source. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103

    Switching from Source-Based to Target-Based Commit. . . . . . . . . . . . . . . . . . . . . . . . . . . 104

    User-Defined Commits. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106

    Rolling Back Transactions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107

    Understanding Transaction Control. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109

    Transformation Scope. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110

    Understanding Transaction Control Units. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112

    Rules and Guidelines for Working with Transaction Control. . . . . . . . . . . . . . . . . . . . . . . . . 112

    Creating Target Files by Transaction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113

    Setting Commit Properties. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113

    Chapter 8: Row Error Logging. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115Row Error Logging Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115

    Error Log Code Pages. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116

    Understanding the Error Log Tables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116

    PMERR_DATA. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116

    PMERR_MSG. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118

    PMERR_SESS. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119

    PMERR_TRANS. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120

    Understanding the Error Log File. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121

    Configuring Error Log Options. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123

    Chapter 9: Workflow Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125Workflow Recovery Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125

    State of Operation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126

    Workflow State of Operation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126

    Session State of Operation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 126

    Target Recovery Tables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127

    Recovery Options. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129

    Table of Contents v

  • Suspending the Workflow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130

    Configuring Suspension Email. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131

    Configuring Workflow Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131

    Recovering Stopped, Aborted, and Terminated Workflows. . . . . . . . . . . . . . . . . . . . . . . . . . 132

    Recovering Suspended Workflows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132

    Configuring Task Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132

    Task Recovery Strategies. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133

    Automatically Recovering Terminated Tasks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135

    Resuming Sessions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135

    Working with Repeatable Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136

    Source Repeatability. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136

    Transformation Repeatability. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137

    Configuring a Mapping for Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137

    Steps to Recover Workflows and Tasks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140

    Recovering a Workflow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140

    Recovering a Session. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141

    Recovering a Workflow From a Session. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141

    Rules and Guidelines for Session Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141

    Configuring Recovery to Resume from the Last Checkpoint. . . . . . . . . . . . . . . . . . . . . . . . . 141

    Unrecoverable Workflows or Tasks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 142

    Chapter 10: Stopping and Aborting. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143Stopping and Aborting Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 143

    Types of Errors. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144

    Threshold Errors. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144

    Fatal Errors. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144

    Integration Service Handling for Session Failure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145

    Stopping or Aborting the Workflow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145

    Stopping or Aborting a Task. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145

    Steps to Stop or Abort. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146

    Chapter 11: Concurrent Workflows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147Concurrent Workflows Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147

    Configuring Unique Workflow Instances. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148

    Recovering Workflow Instances by Instance Name. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148

    Rules and Guidelines for Running Concurrent Instances of the Same Instance Name. . . . . . . . 148

    Configuring Concurrent Workflows of the Same Name. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149

    Running Concurrent Web Service Workflows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149

    Configuring Workflow Instances of the Same Name. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149

    Recovering Workflow Instances of the Same Name. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149

    Rules and Guidelines for Running Concurrent Instances of the Same Instance Name. . . . . . . . 150

    Using Parameters and Variables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 150

    Accessing the Run Instance Name or Run ID. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 150

    vi Table of Contents

  • Steps to Configure Concurrent Workflows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151

    Starting and Stopping Concurrent Workflows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151

    Starting Workflow Instances from Workflow Designer. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151

    Starting One Concurrent Workflow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152

    Starting Concurrent Workflows from the Command Line. . . . . . . . . . . . . . . . . . . . . . . . . . . 152

    Stopping or Aborting Concurrent Workflows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152

    Monitoring Concurrent Workflows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153

    Viewing Session and Workflow Logs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153

    Log Files for Unique Workflow Instances. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153

    Log Files for Workflow Instances of the Same Name. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154

    Rules and Guidelines for Concurrent Workflows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154

    Chapter 12: Grid Processing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156Grid Processing Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156

    Running Workflows on a Grid. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156

    Running Sessions on a Grid. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157

    Working with Partition Groups. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157

    Forming Partition Groups Without Resource Requirements. . . . . . . . . . . . . . . . . . . . . . . . . 158

    Forming Partition Groups With Resource Requirements. . . . . . . . . . . . . . . . . . . . . . . . . . . 158

    Rules and Guidelines for Creating Partition Groups. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158

    Working with Caches. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158

    Grid Connectivity and Recovery. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159

    Configuring a Workflow or Session to Run on a Grid. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159

    Rules and Guidelines for Configuring a Workflow or Session to Run on a Grid. . . . . . . . . . . . . 160

    Chapter 13: Load Balancer. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161Load Balancer Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161

    Assigning Service Levels to Workflows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161

    Assigning Resources to Tasks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 162

    Chapter 14: Workflow Variables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164Workflow Variables Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164

    Predefined Workflow Variables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165

    Using Predefined Workflow Variables in Expressions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167

    Evaluating Condition in a Workflow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167

    Evaluating Task Status in a Workflow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168

    Evaluating Previous Task Status in a Workflow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168

    User-Defined Workflow Variables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169

    Workflow Variable Start and Current Values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 170

    Datatype Default Values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 170

    Creating User-Defined Workflow Variables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171

    Using Worklet Variables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 172

    Persistent Worklet Variables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 172

    Table of Contents vii

  • Overriding the Initial Value. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173

    Rules and Guidelines for Using Worklet Variables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173

    Assigning Variable Values in a Worklet. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173

    Passing Variable Values between Worklets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174

    Configuring Variable Assignments. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174

    Chapter 15: Parameters and Variables in Sessions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176Working with Session Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176

    Changing the Session Log Name. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179

    Changing the Target File and Directory. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179

    Changing Source Parameters in a File. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179

    Changing Connection Parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180

    Getting Run-Time Information. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180

    Rules and Guidelines for Creating File Parameters and Database Connection Parameters. . . . . 180

    Mapping Parameters and Variables in Sessions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181

    Assigning Parameter and Variable Values in a Session. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181

    Passing Parameter and Variable Values between Sessions. . . . . . . . . . . . . . . . . . . . . . . . . 182

    Configuring Variable Assignments. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 183

    Chapter 16: Parameter Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 184Parameter Files Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 184

    Parameter and Variable Types. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 185

    Where to Use Parameters and Variables. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 186

    Overriding Connection Attributes in the Parameter File. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 193

    Parameter File Structure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194

    Parameter File Sections. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194

    Comments. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196

    Null Values. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196

    Sample Parameter File. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196

    Configuring the Parameter File Name and Location. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197

    Using a Parameter File with Workflows or Sessions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197

    Using a Parameter File with pmcmd. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199

    Parameter File Example. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 199

    Guidelines for Creating Parameter Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200

    Troubleshooting Parameters and Parameter Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201

    Tips for Parameters and Parameter Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201

    Chapter 17: FastExport. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203Using FastExport Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203

    Step 1. Create a FastExport Connection. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 204

    Verifying the Code Page Mapping File. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 205

    Step 2. Change the Reader. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 206

    Step 3. Change the Source Connection. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 206

    viii Table of Contents

  • Step 4. Override the Control File (Optional). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 206

    Rules and Guidelines for Using FastExport. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207

    Chapter 18: External Loading. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208External Loading Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208

    Before You Begin. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208

    External Loader Behavior. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 209

    Loading Data to a Named Pipe. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 209

    Staging Data to a Flat File. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 209

    Partitioning Sessions with External Loaders. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210

    Loading to IBM DB2. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210

    IBM DB2 EE External Loader. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210

    IBM DB2 EEE External Loader. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 211

    Rules and Guidelines for IBM DB2 EEE External Loaders. . . . . . . . . . . . . . . . . . . . . . . . . . 211

    Setting Operation Modes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212

    Configuring Authorities, Privileges, and Permissions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212

    Configuring IBM DB2 EE External Loader Attributes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212

    Configuring IBM DB2 EEE External Loader Attributes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 214

    Loading to Oracle. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216

    Rules and Guidelines for Oracle External Loaders. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216

    Loading Multibyte Data to Oracle. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216

    Configuring Oracle External Loader Attributes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 217

    Loading to Sybase IQ. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 217

    Rules and Guidelines for Sybase IQ External Loaders. . . . . . . . . . . . . . . . . . . . . . . . . . . . 218

    Loading Multibyte Data to Sybase IQ. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218

    Configuring Sybase IQ External Loader Attributes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218

    Loading to Teradata. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219

    Rules and Guidelines for Teradata External Loaders. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 220

    Overriding the Control File. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 220

    Creating User Variables in the Control File. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 221

    Configuring Teradata MultiLoad External Loader Attributes. . . . . . . . . . . . . . . . . . . . . . . . . 222

    Configuring Teradata TPump External Loader Attributes. . . . . . . . . . . . . . . . . . . . . . . . . . . 224

    Configuring Teradata FastLoad External Loader Attributes. . . . . . . . . . . . . . . . . . . . . . . . . 226

    Configuring Teradata Warehouse Builder Attributes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228

    Configuring External Loading in a Session. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231

    Configuring a Session to Write to a File. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231

    Configuring File Properties. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 232

    Selecting an External Loader Connection. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 232

    Troubleshooting External Loading. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233

    Chapter 19: FTP. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235FTP Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235

    Rules and Guidelines for Using FTP. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235

    Table of Contents ix

  • Integration Service Behavior. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 236

    Using FTP with Source Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 236

    Using FTP with Target Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 236

    Configuring FTP in a Session. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237

    Configuring SFTP in a Session. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237

    Selecting an FTP Connection. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237

    Configuring Source File Properties. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 238

    Configuring Target File Properties. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239

    Chapter 20: Session Caches. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241Session Caches Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 241

    Cache Memory. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 242

    Cache Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243

    Naming Convention for Cache Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243

    Cache File Directory. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245

    Configuring the Cache Size. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245

    Calculating the Cache Size. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 246

    Using Auto Memory Size. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 246

    Configuring a Numeric Cache Size. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247

    Steps to Configure the Cache Size. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247

    Cache Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 248

    Configuring the Cache Size for Cache Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 248

    Aggregator Caches. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249

    Incremental Aggregation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249

    Configuring the Cache Sizes for an Aggregator Transformation. . . . . . . . . . . . . . . . . . . . . . 249

    Troubleshooting Aggregator Caches. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250

    Joiner Caches. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250

    1:n Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 251

    n:n Partitioning. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 251

    Configuring the Cache Sizes for a Joiner Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . 252

    Troubleshooting Joiner Caches. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 252

    Lookup Caches. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 253

    Sharing Caches. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 254

    Configuring the Cache Sizes for a Lookup Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . 254

    Rank Caches. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 255

    Configuring the Cache Sizes for a Rank Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . 255

    Troubleshooting Rank Caches. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 256

    Sorter Caches. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 256

    Configuring the Cache Size for a Sorter Transformation. . . . . . . . . . . . . . . . . . . . . . . . . . . 257

    XML Target Caches. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 257

    Configuring the Cache Size for an XML Target. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 257

    Optimizing the Cache Size. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258

    x Table of Contents

  • Chapter 21: Incremental Aggregation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 259Incremental Aggregation Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 259

    Integration Service Processing for Incremental Aggregation. . . . . . . . . . . . . . . . . . . . . . . . . . . . 260

    Reinitializing the Aggregate Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 260

    Moving or Deleting the Aggregate Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 261

    Finding Index and Data Files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 261

    Partitioning Guidelines with Incremental Aggregation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 261

    Preparing for Incremental Aggregation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262

    Configuring the Mapping. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262

    Configuring the Session. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 263

    Chapter 22: Session Log Interface. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 264Session Log Interface Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 264

    Implementing the Session Log Interface. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 264

    The Integration Service and the Session Log Interface. . . . . . . . . . . . . . . . . . . . . . . . . . . . 264

    Rules and Guidelines for Implementing the Session Log Interface. . . . . . . . . . . . . . . . . . . . . 265

    Functions in the Session Log Interface. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 265

    INFA_InitSessionLog. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 266

    INFA_OutputSessionLogMsg. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 266

    INFA_OutputSessionLogFatalMsg. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 267

    INFA_EndSessionLog. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 268

    INFA_AbnormalSessionTermination. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 268

    Session Log Interface Example. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 269

    Building the External Session Log Library. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 269

    Using the External Session Log Library. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 269

    Chapter 23: Understanding Buffer Memory. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 270Understanding Buffer Memory Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 270

    Configuring Automatic Memory Settings. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 271

    Configuring Automatic Memory Settings for Multiple Sessions. . . . . . . . . . . . . . . . . . . . . . . 271

    Configuring Buffer Memory. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 272

    Configuring Session Cache Memory. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 272

    Configuring Maximum Memory Limits. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 273

    Configuring Automatic Memory Settings for Session Caches. . . . . . . . . . . . . . . . . . . . . . . . 273

    Chapter 24: High Precision Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 275High Precision Data Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 275

    Bigint. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 275

    Decimal. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 276

    Index. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 277

    Table of Contents xi

  • PrefaceThe PowerCenter Advanced Workflow Guide is written for developers and administrators who are responsible forcreating workflows and sessions, and running workflows. This guide assumes you have knowledge of youroperating systems, relational database concepts, and the database engines, flat files or mainframe system in yourenvironment. This guide also assumes you are familiar with the interface requirements for your supportingapplications.

    Informatica Resources

    Informatica Customer PortalAs an Informatica customer, you can access the Informatica Customer Portal site at http://mysupport.informatica.com. The site contains product information, user group information, newsletters,access to the Informatica customer support case management system (ATLAS), the Informatica How-To Library,the Informatica Knowledge Base, the Informatica Multimedia Knowledge Base, Informatica ProductDocumentation, and access to the Informatica user community.

    Informatica DocumentationThe Informatica Documentation team takes every effort to create accurate, usable documentation. If you havequestions, comments, or ideas about this documentation, contact the Informatica Documentation team throughemail at [email protected] We will use your feedback to improve our documentation. Let usknow if we can contact you regarding your comments.

    The Documentation team updates documentation as needed. To get the latest documentation for your product,navigate to Product Documentation from http://mysupport.informatica.com.

    Informatica Web SiteYou can access the Informatica corporate web site at http://www.informatica.com. The site contains informationabout Informatica, its background, upcoming events, and sales offices. You will also find product and partnerinformation. The services area of the site includes important information about technical support, training andeducation, and implementation services.

    Informatica How-To LibraryAs an Informatica customer, you can access the Informatica How-To Library at http://mysupport.informatica.com.The How-To Library is a collection of resources to help you learn more about Informatica products and features. It

    xii

    http://mysupport.informatica.commailto:[email protected]://mysupport.informatica.comhttp://www.informatica.comhttp://mysupport.informatica.com

  • includes articles and interactive demonstrations that provide solutions to common problems, compare features andbehaviors, and guide you through performing specific real-world tasks.

    Informatica Knowledge BaseAs an Informatica customer, you can access the Informatica Knowledge Base at http://mysupport.informatica.com.Use the Knowledge Base to search for documented solutions to known technical issues about Informaticaproducts. You can also find answers to frequently asked questions, technical white papers, and technical tips. Ifyou have questions, comments, or ideas about the Knowledge Base, contact the Informatica Knowledge Baseteam through email at [email protected]

    Informatica Multimedia Knowledge BaseAs an Informatica customer, you can access the Informatica Multimedia Knowledge Base at http://mysupport.informatica.com. The Multimedia Knowledge Base is a collection of instructional multimedia filesthat help you learn about common concepts and guide you through performing specific tasks. If you havequestions, comments, or ideas about the Multimedia Knowledge Base, contact the Informatica Knowledge Baseteam through email at [email protected]

    Informatica Global Customer SupportYou can contact a Customer Support Center by telephone or through the Online Support. Online Support requiresa user name and password. You can request a user name and password at http://mysupport.informatica.com.

    Use the following telephone numbers to contact Informatica Global Customer Support:

    North America / South America Europe / Middle East / Africa Asia / Australia

    Toll Free+1 877 463 2435 Standard RateBrazil: +55 11 3523 7761Mexico: +52 55 1168 9763United States: +1 650 385 5800

    Toll Free00 800 4632 4357 Standard RateBelgium: +32 15 281 702France: +33 1 41 38 92 26Germany: +49 1805 702 702Netherlands: +31 306 022 797Spain and Portugal: +34 93 480 3760United Kingdom: +44 1628 511 445

    Toll FreeAustralia: 1 800 151 830Singapore: 001 800 4632 4357 Standard RateIndia: +91 80 4112 5738

    Preface xiii

    http://mysupport.informatica.commailto:[email protected]://mysupport.informatica.commailto:[email protected]://mysupport.informatica.com

  • xiv

  • C H A P T E R 1

    Understanding Pipeline PartitioningThis chapter includes the following topics:

    Understanding Pipeline Partitioning Overview, 1

    Partitioning Attributes, 2

    Dynamic Partitioning, 4

    Cache Partitioning, 6

    Mapping Variables in Partitioned Pipelines, 7

    Partitioning Rules, 8

    Configuring Partitioning, 9

    Understanding Pipeline Partitioning OverviewYou create a session for each mapping you want the Integration Service to run. Each mapping contains one ormore pipelines. A pipeline consists of a source qualifier and all the transformations and targets that receive datafrom that source qualifier. When the Integration Service runs the session, it can achieve higher performance bypartitioning the pipeline and performing the extract, transformation, and load for each partition in parallel.

    A partition is a pipeline stage that executes in a single reader, transformation, or writer thread. The number ofpartitions in any pipeline stage equals the number of threads in the stage. By default, the Integration Servicecreates one partition in every pipeline stage.

    If you have the Partitioning option, you can configure multiple partitions for a single pipeline stage. You canconfigure partitioning information that controls the number of reader, transformation, and writer threads that themaster thread creates for the pipeline. You can configure how the Integration Service reads data from the source,distributes rows of data to each transformation, and writes data to the target. You can configure the number ofsource and target connections to use.

    Complete the following tasks to configure partitions for a session:

    Set partition attributes including partition points, the number of partitions, and the partition types.

    You can enable the Integration Service to set partitioning at run time. When you enable dynamic partitioning,the Integration Service scales the number of session partitions based on factors such as the source databasepartitions or the number of nodes in a grid.

    After you configure a session for partitioning, you can configure memory requirements and cache directories foreach transformation.

    The Integration Service evaluates mapping variables for each partition in a target load order group. You canuse variable functions in the mapping to set the variable values.

    1

  • When you create multiple partitions in a pipeline, the Workflow Manager verifies that the Integration Servicecan maintain data consistency in the session using the partitions. When you edit object properties in thesession, you can impact partitioning and cause a session to fail.

    You add or edit partition points in the session properties. When you change partition points you can define thepartition type and add or delete partitions.

    Partitioning AttributesYou can set the following attributes to partition a pipeline:

    Partition points. Partition points mark thread boundaries and divide the pipeline into stages. The IntegrationService redistributes rows of data at partition points.

    Number of partitions. A partition is a pipeline stage that executes in a single thread. If you purchase thePartitioning option, you can set the number of partitions at any partition point. When you add partitions, youincrease the number of processing threads, which can improve session performance.

    Partition types. The Integration Service creates a default partition type at each partition point. If you have thePartitioning option, you can change the partition type. The partition type controls how the Integration Servicedistributes data among partitions at partition points.

    Partition PointsBy default, the Integration Service sets partition points at various transformations in the pipeline. Partition pointsmark thread boundaries and divide the pipeline into stages. A stage is a section of a pipeline between any twopartition points. When you set a partition point at a transformation, the new pipeline stage includes thattransformation.

    The following figure shows the default partition points and pipeline stages for a mapping with one pipeline:

    When you add a partition point, you increase the number of pipeline stages by one. Similarly, when you delete apartition point, you reduce the number of stages by one. Partition points mark the points in the pipeline where theIntegration Service can redistribute data across partitions.

    For example, if you place a partition point at a Filter transformation and define multiple partitions, the IntegrationService can redistribute rows of data among the partitions before the Filter transformation processes the data. Thepartition type you set at this partition point controls the way in which the Integration Service passes rows of data toeach partition.

    2 Chapter 1: Understanding Pipeline Partitioning

  • RELATED TOPICS: Partition Points on page 12

    Number of PartitionsThe number of threads that process each pipeline stage depends on the number of partitions. A partition is apipeline stage that executes in a single reader, transformation, or writer thread. The number of partitions in anypipeline stage equals the number of threads in that stage.

    You can define up to 64 partitions at any partition point in a pipeline. When you increase or decrease the numberof partitions at any partition point, the Workflow Manager increases or decreases the number of partitions at allpartition points in the pipeline. The number of partitions remains consistent throughout the pipeline. If you definethree partitions at any partition point, the Workflow Manager creates three partitions at all other partition points inthe pipeline. In certain circumstances, the number of partitions in the pipeline must be set to one.

    Increasing the number of partitions or partition points increases the number of threads. Therefore, increasing thenumber of partitions or partition points also increases the load on the node. If the node contains enough CPUbandwidth, processing rows of data in a session concurrently can increase session performance. However, if youcreate a large number of partitions or partition points in a session that processes large amounts of data, you canoverload the system.

    The number of partitions you create equals the number of connections to the source or target. If the pipelinecontains a relational source or target, the number of partitions at the source qualifier or target instance equals thenumber of connections to the database. If the pipeline contains file sources, you can configure the session to readthe source with one thread or with multiple threads.

    For example, when you define three partitions across the mapping, the master thread creates three threads ateach pipeline stage, for a total of 12 threads.

    The Integration Service runs the partition threads concurrently. When you run a session with multiple partitions,the threads run as follows:

    1. The reader threads run concurrently to extract data from the source.

    2. The transformation threads run concurrently in each transformation stage to process data. The IntegrationService redistributes data among the partitions at each partition point.

    3. The writer threads run concurrently to write data to the target.

    Partitioning Multiple Input Group TransformationsThe master thread creates a reader and transformation thread for each pipeline in the target load order group. Atarget load order group has multiple pipelines when it contains a transformation with multiple input groups.

    When you connect more than one pipeline to a multiple input group transformation, the Integration Servicemaintains the transformation threads or creates a new transformation thread depending on whether or not themultiple input group transformation is a partition point:

    Partition point does not exist at multiple input group transformation. When a partition point does not existat a multiple input group transformation, the Integration Service processes one thread at a time for the multipleinput group transformation and all downstream transformations in the stage.

    Partition point exists at multiple input group transformation. When a partition point exists at a multipleinput group transformation, the Integration Service creates a new pipeline stage and processes the stage withone thread for each partition. The Integration Service creates one transformation thread for each partitionregardless of the number of output groups the transformation contains.

    Partitioning Attributes 3

  • Partition TypesWhen you configure the partitioning information for a pipeline, you must define a partition type at each partitionpoint in the pipeline. The partition type determines how the Integration Service redistributes data across partitionpoints.

    The Integration Services creates a default partition type at each partition point. If you have the Partitioning option,you can change the partition type. The partition type controls how the Integration Service distributes data amongpartitions at partition points. You can create different partition types at different points in the pipeline.

    You can define the following partition types in the Workflow Manager:

    Database partitioning. The Integration Service queries the IBM DB2 or Oracle database system for tablepartition information. It reads partitioned data from the corresponding nodes in the database. You can usedatabase partitioning with Oracle or IBM DB2 source instances on a multi-node tablespace. You can usedatabase partitioning with DB2 targets.

    Hash auto-keys. The Integration Service uses a hash function to group rows of data among partitions. TheIntegration Service groups the data based on a partition key. The Integration Service uses all grouped or sortedports as a compound partition key. You may need to use hash auto-keys partitioning at Rank, Sorter, andunsorted Aggregator transformations.

    Hash user keys. The Integration Service uses a hash function to group rows of data among partitions. Youdefine the number of ports to generate the partition key.

    Key range. With key range partitioning, the Integration Service distributes rows of data based on a port or setof ports that you define as the partition key. For each port, you define a range of values. The IntegrationService uses the key and ranges to send rows to the appropriate partition. Use key range partitioning when thesources or targets in the pipeline are partitioned by key range.

    Pass-through. In pass-through partitioning, the Integration Service processes data without redistributing rowsamong partitions. All rows in a single partition stay in the partition after crossing a pass-through partition point.Choose pass-through partitioning when you want to create an additional pipeline stage to improveperformance, but do not want to change the distribution of data across partitions.

    Round-robin. The Integration Service distributes data evenly among all partitions. Use round-robin partitioningwhere you want each partition to process approximately the same number of rows.

    Dynamic PartitioningIf the volume of data grows or you add more CPUs, you might need to adjust partitioning so the session run timedoes not increase. When you use dynamic partitioning, you can configure the partition information so theIntegration Service determines the number of partitions to create at run time.

    The Integration Service scales the number of session partitions at run time based on factors such as sourcedatabase partitions or the number of nodes in a grid.

    If any transformation in a stage does not support partitioning, or if the partition configuration does not supportdynamic partitioning, the Integration Service does not scale partitions in the pipeline. The data passes through onepartition.

    4 Chapter 1: Understanding Pipeline Partitioning

  • Complete the following tasks to scale session partitions with dynamic partitioning:

    Set the partitioning. The Integration Service increases the number of partitions based on the partitioningmethod you choose.

    Set session attributes for dynamic partitions. You can set session attributes that identify source and targetfile names and directories. The session uses the session attributes to create the partition-level attributes foreach partition it creates at run time.

    Configure partition types. You can edit partition points and partition types using the Partitions view on theMapping tab of session properties.

    Note: Do not configure dynamic partitioning for a session that contains manual partitions. If you set dynamicpartitioning to a value other than disabled and you manually partition the session, the session is invalid.

    Configuring Dynamic PartitioningConfigure dynamic partitioning on the Config Object tab of session properties. Configure dynamic partitioningusing one of the following methods:

    Disabled. Do not use dynamic partitioning. Defines the number of partitions on the Mapping tab.

    Based on number of partitions. Sets the partitions to a number that you define in the Number of Partitionsattribute. Use the $DynamicPartitionCount session parameter, or enter a number greater than 1.

    Based on number of nodes in grid. Sets the partitions to the number of nodes in the grid running the session.If you configure this option for sessions that do not run on a grid, the session runs in one partition and logs amessage in the session log.

    Based on source partitioning. Determines the number of partitions using database partition information. Thenumber of partitions is the maximum of the number of partitions at the source. For Oracle sources that usecomposite partitioning, the number of partitions is the maximum of the number of subpartitions at the source.

    Based on number of CPUs. Sets the number of partitions equal to the number of CPUs on the node thatprepares the session. If the session is configured to run on a grid, dynamic partitioning sets the number ofpartitions equal to the number of CPUs on the node that prepares the session multiplied by the number ofnodes in the grid.

    RELATED TOPICS: Database Partitioning Partition Type on page 38

    Rules and Guidelines for Dynamic PartitioningUse the following rules and guidelines with dynamic partitioning:

    Dynamic partitioning uses the same connection for each partition.

    You cannot use dynamic partitioning with XML sources and targets.

    You cannot use dynamic partitioning with the Debugger.

    Sessions that use SFTP fail if you enable dynamic partitioning.

    When you set dynamic partitioning to a value other than disabled, and you manually partition the session onthe Mapping tab, you invalidate the session.

    Dynamic Partitioning 5

  • The session fails if you use a parameter other than $DynamicPartitionCount to set the number of partitions.

    The following dynamic partitioning configurations cause a session to run with one partition:

    - You override the default cache directory for an Aggregator, Joiner, Lookup, or Rank transformation. TheIntegration Service partitions a transformation cache directory when the default is $PMCacheDir.

    - You override the Sorter transformation default work directory. The Integration Service partitions the Sortertransformation work directory when the default is $PMTempDir.

    - You use an open-ended range of numbers or date keys with a key range partition type.

    - You use datatypes other than numbers or dates as keys in key range partitioning.

    - You use key range relational target partitioning.

    - You create a user-defined SQL statement or a user-defined source filter.

    - You set dynamic partitioning to the number of nodes in the grid, and the session does not run on a grid.

    - You use pass-through relational source partitioning.

    - You use dynamic partitioning with an Application Source Qualifier.

    - You use SDK or PowerConnect sources and targets with dynamic partitioning.

    Using Dynamic Partitioning with Partition TypesThe following rules apply to using dynamic partitioning with different partition types:

    Pass-through partitioning. If you change the number of partitions at a partition point, the number of partitionsin each pipeline stage changes. If you use pass-through partitioning with a relational source, the session runsin one partition in the stage.

    Key range partitioning. You must define a closed range of numbers or date keys to use dynamic partitioning.The keys must be numeric or date datatypes. Dynamic partitioning does not scale partitions with key rangepartitioning on relational targets.

    Database partitioning. When you use database partitioning, the Integration Service creates session partitionsbased on the source database partitions. Use database partitioning with Oracle and IBM DB2 sources.

    Hash auto-keys, hash user keys, or round-robin. Use hash user keys, hash auto-keys, and round-robinpartition types to distribute rows with dynamic partitioning. Use hash user keys and hash auto-keys partitioningwhen you want the Integration Service to distribute rows to the partitions by group. Use round-robin partitioningwhen you want the Integration Service to distribute rows evenly to partitions.

    Configuring Partition-Level AttributesWhen you use dynamic partitioning, the Integration Service defines the partition-level attributes for each partition itcreates at run time. It names the file and directory attributes based on session-level attribute names that youdefine in the session properties.

    For example, you define the session reject file name as accting_detail.bad. When the Integration Service createspartitions at run time, it creates a reject file for each partition, such as accting_detail1.bad, acctin